WorldWideScience

Sample records for standard-accented speakers sociality

  1. Monolingual and Bilingual Children's Social Preferences for Monolingual and Bilingual Speakers

    Science.gov (United States)

    Byers-Heinlein, Krista; Behrend, Douglas A.; Said, Lyakout Mohamed; Girgis, Helana; Poulin-Dubois, Diane

    2017-01-01

    Past research has shown that young monolingual children exhibit language-based social biases: they prefer native language to foreign language speakers. The current research investigated how children's language preferences are influenced by their own bilingualism and by a speaker's bilingualism. Monolingual and bilingual 4- to 6-year-olds heard…

  2. Extending Situated Language Comprehension (Accounts with Speaker and Comprehender Characteristics: Toward Socially Situated Interpretation

    Directory of Open Access Journals (Sweden)

    Katja Münster

    2018-01-01

    Full Text Available More and more findings suggest a tight temporal coupling between (non-linguistic socially interpreted context and language processing. Still, real-time language processing accounts remain largely elusive with respect to the influence of biological (e.g., age and experiential (e.g., world and moral knowledge comprehender characteristics and the influence of the ‘socially interpreted’ context, as for instance provided by the speaker. This context could include actions, facial expressions, a speaker’s voice or gaze, and gestures among others. We review findings from social psychology, sociolinguistics and psycholinguistics to highlight the relevance of (the interplay between the socially interpreted context and comprehender characteristics for language processing. The review informs the extension of an extant real-time processing account (already featuring a coordinated interplay between language comprehension and the non-linguistic visual context with a variable (‘ProCom’ that captures characteristics of the language user and with a first approximation of the comprehender’s speaker representation. Extending the CIA to the sCIA (social Coordinated Interplay Account is the first step toward a real-time language comprehension account which might eventually accommodate the socially situated communicative interplay between comprehenders and speakers.

  3. Speaker Recognition

    DEFF Research Database (Denmark)

    Mølgaard, Lasse Lohilahti; Jørgensen, Kasper Winther

    2005-01-01

    Speaker recognition is basically divided into speaker identification and speaker verification. Verification is the task of automatically determining if a person really is the person he or she claims to be. This technology can be used as a biometric feature for verifying the identity of a person...

  4. Speaker Authentication

    CERN Document Server

    Li, Qi (Peter)

    2012-01-01

    This book focuses on use of voice as a biometric measure for personal authentication. In particular, "Speaker Recognition" covers two approaches in speaker authentication: speaker verification (SV) and verbal information verification (VIV). The SV approach attempts to verify a speaker’s identity based on his/her voice characteristics while the VIV approach validates a speaker’s identity through verification of the content of his/her utterance(s). SV and VIV can be combined for new applications. This is still a new research topic with significant potential applications. The book provides with a broad overview of the recent advances in speaker authentication while giving enough attention to advanced and useful algorithms and techniques. It also provides a step by step introduction to the current state of the speaker authentication technology, from the fundamental concepts to advanced algorithms. We will also present major design methodologies and share our experience in developing real and successful speake...

  5. Reflection on Social Cohesion of Sasak Speakers and the Trends Transformed

    OpenAIRE

    Saharudin

    2014-01-01

    This article presents the social cohesiveness of traditional Sasak reflected and expressed in their linguistics, particularly at social capital domain, and to transform it to the current social capital of Sasak people. The category and linguistic expressions are considered to be the lingual category forms and the verbal expressions in the social capital domain, in which it is kept various knowledge systems of the social cohesiveness of Sasak people. Transformation in this study is defined as ...

  6. A Comparison of Teacher-Rated Classroom Conduct, Social Skills, and Teacher-Child Relationship Quality between Preschool English Learners and Preschool English Speakers

    Science.gov (United States)

    Luchtel, Molly; Hughes, Kere; Luze, Gayle; Bruna, Katherine Richardson; Peterson, Carla

    2010-01-01

    This study examined the differences between preschool English learners and preschool English speakers in the areas of classroom conduct, social skills, and teacher-child relationship quality, as rated by their teachers. Data were taken from the Early Head Start Research and Evaluation Project. Students who were English learners were rated…

  7. Social Interactions for Academic Purposes in Literature Classes for Non English Speakers

    OpenAIRE

    Ai Chun Yen

    2012-01-01

    Problem statement: This study reports how Learning Communities (LCs) and Online Learning Communities (OLCs) can improve Taiwanese EFL studentsâ lack of Social Interactions (SIs) and acdemic skills in literature classes (Y1, N = 40) by involving freshmen in a social process that encourages student-student and student-instructor discussion, interpretation, production and negotiation. Approach: The data collected from the database of the university learning management system in National Dong Hwa...

  8. Partially supervised speaker clustering.

    Science.gov (United States)

    Tang, Hao; Chu, Stephen Mingyu; Hasegawa-Johnson, Mark; Huang, Thomas S

    2012-05-01

    Content-based multimedia indexing, retrieval, and processing as well as multimedia databases demand the structuring of the media content (image, audio, video, text, etc.), one significant goal being to associate the identity of the content to the individual segments of the signals. In this paper, we specifically address the problem of speaker clustering, the task of assigning every speech utterance in an audio stream to its speaker. We offer a complete treatment to the idea of partially supervised speaker clustering, which refers to the use of our prior knowledge of speakers in general to assist the unsupervised speaker clustering process. By means of an independent training data set, we encode the prior knowledge at the various stages of the speaker clustering pipeline via 1) learning a speaker-discriminative acoustic feature transformation, 2) learning a universal speaker prior model, and 3) learning a discriminative speaker subspace, or equivalently, a speaker-discriminative distance metric. We study the directional scattering property of the Gaussian mixture model (GMM) mean supervector representation of utterances in the high-dimensional space, and advocate exploiting this property by using the cosine distance metric instead of the euclidean distance metric for speaker clustering in the GMM mean supervector space. We propose to perform discriminant analysis based on the cosine distance metric, which leads to a novel distance metric learning algorithm—linear spherical discriminant analysis (LSDA). We show that the proposed LSDA formulation can be systematically solved within the elegant graph embedding general dimensionality reduction framework. Our speaker clustering experiments on the GALE database clearly indicate that 1) our speaker clustering methods based on the GMM mean supervector representation and vector-based distance metrics outperform traditional speaker clustering methods based on the “bag of acoustic features” representation and statistical

  9. Apology Strategy in English By Native Speaker

    Directory of Open Access Journals (Sweden)

    Mezia Kemala Sari

    2016-05-01

    Full Text Available This research discussed apology strategies in English by native speaker. This descriptive study was presented within the framework of Pragmatics based on the forms of strategies due to the coding manual as found in CCSARP (Cross-Cultural Speech Acts Realization Project.The goals of this study were to describe the apology strategies in English by native speaker and identify the influencing factors of it. Data were collected through the use of the questionnaire in the form of Discourse Completion Test, which was distributed to 30 native speakers. Data were classified based on the degree of familiarity and the social distance between speaker and hearer and then the data of native will be separated and classified by the type of strategies in coding manual. The results of this study are the pattern of apology strategies of native speaker brief with the pattern that potentially occurs IFID plus Offer of repair plus Taking on responsibility. While Alerters, Explanation and Downgrading appear with less number of percentage. Then, the factors that influence the apology utterance by native speakers are the social situation, the degree of familiarity and degree of the offence which more complicated the mistake tend to produce the most complex utterances by the speaker.

  10. From Multilingualism to Bilingualism: Changes in Language Use, Language Value, and Social Mobility among Engdewu Speakers in the Solomon Islands

    Science.gov (United States)

    Emerine Hicks, Rachel

    2017-01-01

    On the island of Santa Cruz in the Solomon Islands, the Engdewu language is facing imminent language shift because of the increasing use of the lingua franca Solomon Islands Pijin in the community. In this article, I argue that this language shift is occurring because of changes to the social structure in Baemawz, one of the villages where Engdewu…

  11. Limited data speaker identification

    Indian Academy of Sciences (India)

    The evidences from all these integrated systems can be combined to obtain a speaker identification system that provides improved performance under limited data. Hence the motivation for this work. A few attempts have been made earlier to study and provide solution to the problem of limited data in the speaker recognition ...

  12. Cultural and social change of foreign students in Indonesia: The influence of Javanese Culture in Teaching Indonesian to Speakers of Other Languages (TISOL)

    Science.gov (United States)

    Saddhono, Kundharu

    2018-03-01

    Teaching Indonesian to Speakers of Other Languages (TISOL) program is increasingly in demand by people in various parts of the world. Foreign students learn a lot of Indonesian language in major cities in Indonesia. The purpose of this study is to explain the cultural and social changes of foreign students in Indonesia, especially in Java, which is following TISOL program. This study focused on the influence of Javanese culture on foreign students studying Indonesian in Java. Research method used is descriptive qualitative with ethnography approach. This research was conducted in TISOL program organized by in Central Java, East Java, and Yogyakarta. Sources of data used are documents and informants. The sampling technique used is purposive sampling. Purposive sampling is considered more capable to obtain complete data in the face of various realities. Data collection techniques are done by reviewing documents or records using content analysis techniques. Other techniques used are interview techniques with some students and lecturers to get data about the factors that affect the cultural and social changes of foreign students in Indonesia. Also, interviews were also conducted with teachers to request a different process in TISOL. The most common way used to improve validity in qualitative research is the triangulation technique. In this study used triangulation theory, triangulation method, and review of informants. The results show that using Javanese culture is very influential in the cultural and social changes of foreign students in Indonesia. Students become more enthusiastic and active in responding to learning in TISOL that is influenced by Javanese culture. The change comes from internal and external students. This change helps foreign students to understand Indonesian language and culture more comprehensively.

  13. [Speaker discrimination in cochlear implant users].

    Science.gov (United States)

    Mühler, R; Ziese, M; Verhey, J L

    2017-03-01

    Although the word and sentence recognition skills of cochlear implant (CI) users have been studied extensively, little is known about their ability to distinguish between individuals on the basis of voice, an important skill for social communication. Speech material from the Oldenburg Logatome Corpus (OLLO) was used to build a set of 120 logatome pairs spoken by 15 male and 15 female speakers, with no overlap of the fundamental frequencies of the two groups of speakers. Each pair contained two different logatomes. For half of the pairs, the two logatomes were spoken by the same speaker, for the other half they were spoken by different speakers. Using a same-different paradigm, 13 adult normal-hearing listeners and 13 adult post-lingually deafened CI users were asked whether the pair of different logatomes were spoken by the same or by different speakers. Mean speaker discrimination score for the CI users was 74.6 % correct and for the normal-hearing listeners 89.6 % correct. A significant influence of voice gender on speaker discrimination score was found in CI users and in normal hearing listeners. The results of the CI users were significantly above the level of chance and no ceiling effect was observed for the normal-hearing listeners, i. e., the presented set of logatome pairs from the OLLO seems to be very well suited to speaker discrimination experiments in CI users and quantitative comparison to normal-hearing listeners. CI users are able to discriminate between speakers but their performance is slightly worse than that of normal-hearing listeners.

  14. Forensic speaker recognition

    NARCIS (Netherlands)

    Meuwly, Didier

    2013-01-01

    The aim of forensic speaker recognition is to establish links between individuals and criminal activities, through audio speech recordings. This field is multidisciplinary, combining predominantly phonetics, linguistics, speech signal processing, and forensic statistics. On these bases, expert-based

  15. FUNDAMENTALS OF SPEAKER RECOGNITION

    Directory of Open Access Journals (Sweden)

    Figen ERTAŞ

    2000-02-01

    Full Text Available The explosive growth of information technology in the last decade has made a considerable impact on the design and construction of systems for human-machine communication, which is becoming increasingly important in many aspects of life. Amongst other speech processing tasks, a great deal of attention has been devoted to developing procedures that identify people from their voices, and the design and construction of speaker recognition systems has been a fascinating enterprise pursued over many decades. This paper introduces speaker recognition in general and discusses its relevant parameters in relation to system performance.

  16. Role of Speaker Cues in Attention Inference

    Directory of Open Access Journals (Sweden)

    Jin Joo Lee

    2017-10-01

    Full Text Available Current state-of-the-art approaches to emotion recognition primarily focus on modeling the nonverbal expressions of the sole individual without reference to contextual elements such as the co-presence of the partner. In this paper, we demonstrate that the accurate inference of listeners’ social-emotional state of attention depends on accounting for the nonverbal behaviors of their storytelling partner, namely their speaker cues. To gain a deeper understanding of the role of speaker cues in attention inference, we conduct investigations into real-world interactions of children (5–6 years old storytelling with their peers. Through in-depth analysis of human–human interaction data, we first identify nonverbal speaker cues (i.e., backchannel-inviting cues and listener responses (i.e., backchannel feedback. We then demonstrate how speaker cues can modify the interpretation of attention-related backchannels as well as serve as a means to regulate the responsiveness of listeners. We discuss the design implications of our findings toward our primary goal of developing attention recognition models for storytelling robots, and we argue that social robots can proactively use speaker cues to form more accurate inferences about the attentive state of their human partners.

  17. Sociophonetic study of young Nigerian English speakers ...

    African Journals Online (AJOL)

    This is with a view to confirming the hypothesis that continuous speech processes (CSPs) can be socially differentiated in a speech community. A sample of 180 young educated Nigerian English (NigE) speakers, evenly stratified into gender and class, voiced 19 utterances and a short passage into digital recording devices ...

  18. Limited data speaker identification

    Indian Academy of Sciences (India)

    Speaker identification; limited training and testing data; MFSR; LPR; LPRP; LVQ; GMM–UBM and combining. ... Department of Information Science and Engineering, Siddaganga Institute of Technology, Tumkur 572 103; Department of Electronics and Communication Engineering, Indian Institute of Technology Guwahati, ...

  19. Speaker Profiles and Abstracts

    Indian Academy of Sciences (India)

    2016-11-05

    Nov 5, 2016 ... a polarised fashion. Their work thus illuminates how Cas1-2 complex and IHF coordinate bring about the preferential integration of protospacer into the leader proximal end. Speaker's Profile. B Anand, IIT ... has a realization formula where a Hilbert space makes a natural appearance. Agler in 1990 found a ...

  20. Speaker Profiles and Abstracts

    Indian Academy of Sciences (India)

    2017-06-30

    Jun 30, 2017 ... After a brief mention of some of her early work in topological field theories, the speaker will describe her ... therapy. The unique properties of Helicobacter pylori DNA replication and cell division include but are not limited to the absence of a helicase loader DnaC, polar replisome formation and assembly of ...

  1. Limited data speaker identification

    Indian Academy of Sciences (India)

    dependent or text-independent type. Speech for the same text is used for ... are practical situations like forensic investigations where the availability of speech data is limited and technology is ...... Murthy K S R, Yegnanarayana B 2006 Combining evidence from residual phase and MFCC features for speaker recognition.

  2. Speaker adaptation in the NIST Speaker Recognition Evaluation 2004

    NARCIS (Netherlands)

    Leeuwen, D.A. van

    2005-01-01

    New in the 2004 edition of the NIST Speaker Recognition Evaluation (SRE) was the condition where unsupervised adaptation of speaker models is allowed. Despite the promising results on development test material, hardly any beneficial results were obtained in the Evaluation itself. An analysis is made

  3. SPEAKER RECOGNITION OF MAGHREB DIALECTS

    OpenAIRE

    Leila Beltaifa - Zouari *1, Asma Chayeh 2

    2017-01-01

    A few studies have focused on the west Arabic (Maghreb) dialects for which resources are rare. To handle this problem, we devoped a web-based database of speech from Tunisian, Algerian and Moroccan speakers covering the diversity of Arabic dialects spoken in north Africa. Then speaker identification and verification experiments have been conducted in order to evaluate the performance of each dialect-based system. A baseline system using Timit database have also be developed for comparison pur...

  4. The 2016 NIST Speaker Recognition Evaluation

    Science.gov (United States)

    2017-08-20

    impact on system performance. Index Terms: NIST evaluation, NIST SRE, speaker detection, speaker recognition, speaker verification 1. Introduction NIST...test segments were uni- formly sampled, ranging approximately from 10s to 60s. This provided the opportunity to more precisely measure the impact of...test segment duration on speaker recognition performance. As for speaker model enrollment, unlike previous SREs, gender labels were not provided. There

  5. Differences in Sickness Allowance Receipt between Swedish Speakers and Finnish Speakers in Finland

    Directory of Open Access Journals (Sweden)

    Kaarina S. Reini

    2017-12-01

    Full Text Available Previous research has documented lower disability retirement and mortality rates of Swedish speakers as compared with Finnish speakers in Finland. This paper is the first to compare the two language groups with regard to the receipt of sickness allowance, which is an objective health measure that reflects a less severe poor health condition. Register-based data covering the years 1988-2011 are used. We estimate logistic regression models with generalized estimating equations to account for repeated observations at the individual level. We find that Swedish-speaking men have approximately 30 percent lower odds of receiving sickness allowance than Finnish-speaking men, whereas the difference in women is about 15 percent. In correspondence with previous research on all-cause mortality at working ages, we find no language-group difference in sickness allowance receipt in the socially most successful subgroup of the population.

  6. Communication Interface for Mexican Spanish Dysarthric Speakers

    Directory of Open Access Journals (Sweden)

    Gladys Bonilla-Enriquez

    2012-03-01

    Full Text Available La disartria es una discapacidad motora del habla caracterizada por debilidad o poca coordinación de los músculos del habla. Esta condición puede ser causada por un infarto, parálisis cerebral, o por una lesión severa en el cerebro. Para mexicanos con esta condición hay muy pocas, si es que hay alguna, tecnologías de asistencia para mejorar sus habilidades sociales de interacción. En este artículo presentamos nuestros avances hacia el desarrollo de una interfazde comunicación para hablantes con disartria cuya lengua materna sea el español mexicano. La metodología propuesta depende de (1 diseño especial de un corpus de entrenamiento con voz normal y recursos limitados, (2 adaptación de usuario estándar, y (3 control de la perplejidad del modelo de lenguaje para lograr alta precisión en el Reconocimiento Automático del Habla (RAH. La interfaz permite al usuario y terapéuta el realizar actividades como adaptación dinámica de usuario, adaptación de vocabulario, y síntesis de texto a voz. Pruebas en vivo fueron realizadas con un usuario con disartria leve, logrando precisiones de 93%-95% para habla espontánea.Dysarthria is a motor speech disorder due to weakness or poor coordination of the speechmuscles. This condition can be caused by a stroke, cerebral palsy, or by a traumatic braininjury. For Mexican people with this condition there are few, if any, assistive technologies to improve their social interaction skills. In this paper we present our advances towards the development of a communication interface for dysarthric speakers whose native language is Mexican Spanish. We propose a methodology that relies on (1 special design of a training normal-speech corpus with limited resources, (2 standard speaker adaptation, and (3 control of language model perplexity, to achieve high Automatic Speech Recognition (ASR accuracy. The interface allows the user and therapist to perform tasks such as dynamic speaker adaptation, vocabulary

  7. The road to understanding is paved with the speaker's intentions: cues to the speaker's attention and intentions affect pronoun comprehension.

    Science.gov (United States)

    Nappa, Rebecca; Arnold, Jennifer E

    2014-05-01

    A series of experiments explore the effects of attention-directing cues on pronoun resolution, contrasting four specific hypotheses about the interpretation of ambiguous pronouns he and she: (1) it is driven by grammatical rules, (2) it is primarily a function of social processing of the speaker's intention to communicate, (3) it is modulated by the listener's own egocentric attention, and (4) it is primarily a function of learned probabilistic cues. Experiment 1 demonstrates that pronoun interpretation is guided by the well-known N1 (first-mention) bias, which is also modulated by both the speaker's gaze and pointing gestures. Experiment 2 demonstrates that a low-level visual capture cue has no effect on pronoun interpretation, in contrast with the social cue of pointing. Experiment 3 uses a novel intentional cue: the same attention-capture flash as in Experiment 2, but with instructions that the cue is intentionally created by the speaker. This cue does modulate the N1 bias, demonstrating the importance of information about the speaker's intentions to pronoun resolution. Taken in sum, these findings demonstrate that pronoun resolution is a process best categorized as driven by an appreciation of the speaker's communicative intent, which may be subserved by a sensitivity to predictive cues in the environment. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Speaker's voice as a memory cue.

    Science.gov (United States)

    Campeanu, Sandra; Craik, Fergus I M; Alain, Claude

    2015-02-01

    Speaker's voice occupies a central role as the cornerstone of auditory social interaction. Here, we review the evidence suggesting that speaker's voice constitutes an integral context cue in auditory memory. Investigation into the nature of voice representation as a memory cue is essential to understanding auditory memory and the neural correlates which underlie it. Evidence from behavioral and electrophysiological studies suggest that while specific voice reinstatement (i.e., same speaker) often appears to facilitate word memory even without attention to voice at study, the presence of a partial benefit of similar voices between study and test is less clear. In terms of explicit memory experiments utilizing unfamiliar voices, encoding methods appear to play a pivotal role. Voice congruency effects have been found when voice is specifically attended at study (i.e., when relatively shallow, perceptual encoding takes place). These behavioral findings coincide with neural indices of memory performance such as the parietal old/new recollection effect and the late right frontal effect. The former distinguishes between correctly identified old words and correctly identified new words, and reflects voice congruency only when voice is attended at study. Characterization of the latter likely depends upon voice memory, rather than word memory. There is also evidence to suggest that voice effects can be found in implicit memory paradigms. However, the presence of voice effects appears to depend greatly on the task employed. Using a word identification task, perceptual similarity between study and test conditions is, like for explicit memory tests, crucial. In addition, the type of noise employed appears to have a differential effect. While voice effects have been observed when white noise is used at both study and test, using multi-talker babble does not confer the same results. In terms of neuroimaging research modulations, characterization of an implicit memory effect

  9. Bilingual and monolingual children prefer native-accented speakers.

    Science.gov (United States)

    Souza, André L; Byers-Heinlein, Krista; Poulin-Dubois, Diane

    2013-01-01

    Adults and young children prefer to affiliate with some individuals rather than others. Studies have shown that monolingual children show in-group biases for individuals who speak their native language without a foreign accent (Kinzler et al., 2007). Some studies have suggested that bilingual children are less influenced than monolinguals by language variety when attributing personality traits to different speakers (Anisfeld and Lambert, 1964), which could indicate that bilinguals have fewer in-group biases and perhaps greater social flexibility. However, no previous studies have compared monolingual and bilingual children's reactions to speakers with unfamiliar foreign accents. In the present study, we investigated the social preferences of 5-year-old English and French monolinguals and English-French bilinguals. Contrary to our predictions, both monolingual and bilingual preschoolers preferred to be friends with native-accented speakers over speakers who spoke their dominant language with an unfamiliar foreign accent. This result suggests that both monolingual and bilingual children have strong preferences for in-group members who use a familiar language variety, and that bilingualism does not lead to generalized social flexibility.

  10. Speaker Identity Supports Phonetic Category Learning

    Science.gov (United States)

    Mani, Nivedita; Schneider, Signe

    2013-01-01

    Visual cues from the speaker's face, such as the discriminable mouth movements used to produce speech sounds, improve discrimination of these sounds by adults. The speaker's face, however, provides more information than just the mouth movements used to produce speech--it also provides a visual indexical cue of the identity of the speaker. The…

  11. Signal modelling in speaker recognition | Sharma | Botswana ...

    African Journals Online (AJOL)

    This paper introduces recent advancements in speaker recognition system. It incorporates the background concepts and emphasizes the signal model design technique most commonly used in the state-of-the-art of speaker recognition system. An analytical description is provided of procedures applied in speaker ...

  12. Improving Speaker Recognition by Biometric Voice Deconstruction

    Directory of Open Access Journals (Sweden)

    Luis Miguel eMazaira-Fernández

    2015-09-01

    Full Text Available Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g. YouTube to broadcast its message. In this new scenario, classical identification methods (such fingerprints or face recognition have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. Through the present paper, a new methodology to characterize speakers will be shown. This methodology is benefiting from the advances achieved during the last years in understanding and modelling voice production. The paper hypothesizes that a gender dependent characterization of speakers combined with the use of a new set of biometric parameters extracted from the components resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract gender-dependent extended biometric parameters are given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

  13. Similar speaker recognition using nonlinear analysis

    International Nuclear Information System (INIS)

    Seo, J.P.; Kim, M.S.; Baek, I.C.; Kwon, Y.H.; Lee, K.S.; Chang, S.W.; Yang, S.I.

    2004-01-01

    Speech features of the conventional speaker identification system, are usually obtained by linear methods in spectral space. However, these methods have the drawback that speakers with similar voices cannot be distinguished, because the characteristics of their voices are also similar in spectral space. To overcome the difficulty in linear methods, we propose to use the correlation exponent in the nonlinear space as a new feature vector for speaker identification among persons with similar voices. We show that our proposed method surprisingly reduces the error rate of speaker identification system to speakers with similar voices

  14. Hybrid Speaker Recognition Using Universal Acoustic Model

    Science.gov (United States)

    Nishimura, Jun; Kuroda, Tadahiro

    We propose a novel speaker recognition approach using a speaker-independent universal acoustic model (UAM) for sensornet applications. In sensornet applications such as “Business Microscope”, interactions among knowledge workers in an organization can be visualized by sensing face-to-face communication using wearable sensor nodes. In conventional studies, speakers are detected by comparing energy of input speech signals among the nodes. However, there are often synchronization errors among the nodes which degrade the speaker recognition performance. By focusing on property of the speaker's acoustic channel, UAM can provide robustness against the synchronization error. The overall speaker recognition accuracy is improved by combining UAM with the energy-based approach. For 0.1s speech inputs and 4 subjects, speaker recognition accuracy of 94% is achieved at the synchronization error less than 100ms.

  15. Speaker independent acoustic-to-articulatory inversion

    Science.gov (United States)

    Ji, An

    Acoustic-to-articulatory inversion, the determination of articulatory parameters from acoustic signals, is a difficult but important problem for many speech processing applications, such as automatic speech recognition (ASR) and computer aided pronunciation training (CAPT). In recent years, several approaches have been successfully implemented for speaker dependent models with parallel acoustic and kinematic training data. However, in many practical applications inversion is needed for new speakers for whom no articulatory data is available. In order to address this problem, this dissertation introduces a novel speaker adaptation approach called Parallel Reference Speaker Weighting (PRSW), based on parallel acoustic and articulatory Hidden Markov Models (HMM). This approach uses a robust normalized articulatory space and palate referenced articulatory features combined with speaker-weighted adaptation to form an inversion mapping for new speakers that can accurately estimate articulatory trajectories. The proposed PRSW method is evaluated on the newly collected Marquette electromagnetic articulography -- Mandarin Accented English (EMA-MAE) corpus using 20 native English speakers. Cross-speaker inversion results show that given a good selection of reference speakers with consistent acoustic and articulatory patterns, the PRSW approach gives good speaker independent inversion performance even without kinematic training data.

  16. Infants' preferences for native speakers are associated with an expectation of information

    DEFF Research Database (Denmark)

    Begus, Katarina; Gliga, Teodora; Southgate, Victoria

    2016-01-01

    Humans' preference for others who share our group membership is well documented, and this heightened valuation of in-group members seems to be rooted in early development. Before 12 mo of age, infants already show behavioral preferences for others who evidence cues to same-group membership...... such as race or native language, yet the function of this selectivity remains unclear. We examine one of these social biases, the preference for native speakers, and propose that this preference may result from infants' motivation to obtain information and the expectation that interactions with native speakers...... was observed when infants could expect to receive information from the native speaker, indicating that infants were preparing to learn information from the native speaker to a greater extent than from the foreign speaker. While previous research has demonstrated that infants prefer to interact...

  17. A speaker recognition solution for identification and authentication

    OpenAIRE

    2014-01-01

    M.Com. (Informatics) A certain degree of vulnerability exists in traditional knowledge-based identification and authentication access control, as a result of password interception and social engineering techniques. This vulnerability has warranted the exploration of additional identification and authentication approaches such as physical token-based systems and biometrics. Speaker recognition is one such biometric approach that is currently not widely used due to its inherent technological...

  18. Infants' Selectively Pay Attention to the Information They Receive from a Native Speaker of Their Language.

    Science.gov (United States)

    Marno, Hanna; Guellai, Bahia; Vidal, Yamil; Franzoi, Julia; Nespor, Marina; Mehler, Jacques

    2016-01-01

    From the first moments of their life, infants show a preference for their native language, as well as toward speakers with whom they share the same language. This preference appears to have broad consequences in various domains later on, supporting group affiliations and collaborative actions in children. Here, we propose that infants' preference for native speakers of their language also serves a further purpose, specifically allowing them to efficiently acquire culture specific knowledge via social learning. By selectively attending to informants who are native speakers of their language and who probably also share the same cultural background with the infant, young learners can maximize the possibility to acquire cultural knowledge. To test whether infants would preferably attend the information they receive from a speaker of their native language, we familiarized 12-month-old infants with a native and a foreign speaker, and then presented them with movies where each of the speakers silently gazed toward unfamiliar objects. At test, infants' looking behavior to the two objects alone was measured. Results revealed that infants preferred to look longer at the object presented by the native speaker. Strikingly, the effect was replicated also with 5-month-old infants, indicating an early development of such preference. These findings provide evidence that young infants pay more attention to the information presented by a person with whom they share the same language. This selectivity can serve as a basis for efficient social learning by influencing how infants' allocate attention between potential sources of information in their environment.

  19. Speaker's presentations. Energy supply security

    International Nuclear Information System (INIS)

    Pierret, Ch.

    2000-01-01

    This document is a collection of most of the papers used by the speakers of the European Seminar on Energy Supply Security organised in Paris (at the French Ministry of Economy, Finance and Industry) on 24 November 2000 by the General Direction of Energy and Raw Materials, in co-operation with the European Commission and the French Planning Office. About 250 attendees were present, including a lot of high level Civil Servants from the 15 European State members, and their questions have allowed to create a rich debate. It took place five days before the publication, on 29 November 2000, by the European Commission, of the Green Paper 'Towards a European Strategy for the Security of Energy Supply'. This French initiative, which took place within the framework of the European Presidency of the European Union, during the second half-year 2000. will bring a first impetus to the brainstorming launched by the Commission. (author)

  20. Native Speakers in Linguistic Imperialism

    DEFF Research Database (Denmark)

    Phillipson, Robert

    2016-01-01

    learning and teaching, and the inappropriate qualifications of those sent to education systems when they are unfamiliar with the learners’ languages, cultures, and pedagogical traditions. Whether the schemes involved constitute linguistic imperialismis analysed. Whereas the need for multilingual competence......An investigation of Native English Speaking Teachers’ performance in schemes in six Asian contexts, commissioned by the British Council, and undertaken by three British academics, is subjected to critical evaluation. Key issues for exploration are the issue of a monolingual approach to English...... is recognised as desirable by some British experts, the native speakers in question seldom have this key qualification. This is even the case when the host country (Brunei) aims at bilingual education. It is unlikely that the host countries are getting value for money. Whether the UK and other ‘English...

  1. Infants Prefer Tunes Previously Introduced by Speakers of Their Native Language.

    Science.gov (United States)

    Soley, Gaye; Sebastián-Gallés, Núria

    2015-01-01

    Infants show attentional biases for certain individuals over others based on various cues. However, the role of these biases in shaping infants' preferences and learning is not clear. This study asked whether infants' preference for native speakers (Kinzler, Dupoux, & Spelke, 2007) would modulate their preferences for tunes. After getting equal exposure to two different tunes introduced by two speakers, 7-month-olds (N = 32) listened longer to the tune that was introduced by a native speaker compared to the tune that was introduced by a foreign speaker. This suggests that the social-emotional context in which exposure to stimuli occurs influences auditory preferences, and that the early emerging attentional biases might have important ramifications regarding social learning in early infancy. © 2015 The Authors. Child Development © 2015 Society for Research in Child Development, Inc.

  2. Voice aftereffects of adaptation to speaker identity.

    Science.gov (United States)

    Zäske, Romi; Schweinberger, Stefan R; Kawahara, Hideki

    2010-09-01

    While adaptation to complex auditory stimuli has traditionally been reported for linguistic properties of speech, the present study demonstrates non-linguistic high-level aftereffects in the perception of voice identity, following adaptation to voices or faces of personally familiar speakers. In Exp. 1, prolonged exposure to speaker A's voice biased the perception of identity-ambiguous voice morphs between speakers A and B towards speaker B (and vice versa). Significantly biased voice identity perception was also observed in Exp. 2 when adaptors were videos of speakers' silently articulating faces, although effects were reduced in magnitude relative to those seen in Exp. 1. By contrast, adaptation to an unrelated speaker C elicited an intermediate proportion of speaker A identifications in both experiments. While crossmodal aftereffects on auditory identification (Exp. 2) dissipated rapidly, unimodal aftereffects (Exp. 1) were still measurable a few minutes after adaptation. These novel findings suggest contrastive coding of voice identity in long-term memory, with at least two perceptual mechanisms of voice identity adaptation: one related to auditory coding of voice characteristics, and another related to multimodal coding of familiar speaker identity. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  3. Speaker-specific variability of phoneme durations

    CSIR Research Space (South Africa)

    Van Heerden, CJ

    2007-11-01

    Full Text Available The durations of phonemes varies for different speakers. To this end, the correlations between phonemes across different speakers are studied and a novel approach to predict unknown phoneme durations from the values of known phoneme durations for a...

  4. Deixis in Spontaneous Speech of Jordanian Urban Arabic Native Speakers

    Directory of Open Access Journals (Sweden)

    Zainab Sa'aida

    2017-02-01

    Full Text Available This study aims at describing types and usages of deixis in the speech of Jordanian Urban Arabic native speakers. The present study was conducted in different settings which researcher’s family members, friends, colleagues, and acquaintances took part in. Data of the study were collected through observing spontaneous speech of native speakers of Jordanian Urban Arabic. The data consist of transcriptions of deictic expressions, which were categorised into groups according to the types or usages of deictic words. The data were translated and transliterated by the researcher. The International Phonetic Alphabet symbols were used to transcribe the data. Findings of the study show that there are five types of deictic expressions in Jordanian Urban Arabic: Personal, spatial, temporal, discourse and social deixis. The current study has also described the various usages of deictic words in Jordanian Urban Arabic: gestural, symbolic, and non-deictic usage.

  5. Basque speakers and the media: opinions, habits and attitudes

    Directory of Open Access Journals (Sweden)

    Gorka Salces Alcalde

    2016-07-01

    Full Text Available This research tries to shed light on the relationship of Basque language speakers with the media, as its main aim is to help understand which social, psychological and cognitive mechanisms influence the media consumption of minoritised language speakers. To fulfill this purpose, it was used an integrated methodological perspective, which has combined conducting in-depth interviews, on the qualitative side, and the compilation of audience data, on the quantitative one. Given that both the media landscape and the Basque linguistic community are involved in a process of deep transformation, this work´s conclusions try to set out some findings and lines of reflection that could help Basque language media understand better their potential audience and anticipate the challenges of the near future.

  6. Robustness-related issues in speaker recognition

    CERN Document Server

    Zheng, Thomas Fang

    2017-01-01

    This book presents an overview of speaker recognition technologies with an emphasis on dealing with robustness issues. Firstly, the book gives an overview of speaker recognition, such as the basic system framework, categories under different criteria, performance evaluation and its development history. Secondly, with regard to robustness issues, the book presents three categories, including environment-related issues, speaker-related issues and application-oriented issues. For each category, the book describes the current hot topics, existing technologies, and potential research focuses in the future. The book is a useful reference book and self-learning guide for early researchers working in the field of robust speech recognition.

  7. On the optimization of a mixed speaker array in an enclosed space using the virtual-speaker weighting method

    Science.gov (United States)

    Peng, Bo; Zheng, Sifa; Liao, Xiangning; Lian, Xiaomin

    2018-03-01

    In order to achieve sound field reproduction in a wide frequency band, multiple-type speakers are used. The reproduction accuracy is not only affected by the signals sent to the speakers, but also depends on the position and the number of each type of speaker. The method of optimizing a mixed speaker array is investigated in this paper. A virtual-speaker weighting method is proposed to optimize both the position and the number of each type of speaker. In this method, a virtual-speaker model is proposed to quantify the increment of controllability of the speaker array when the speaker number increases. While optimizing a mixed speaker array, the gain of the virtual-speaker transfer function is used to determine the priority orders of the candidate speaker positions, which optimizes the position of each type of speaker. Then the relative gain of the virtual-speaker transfer function is used to determine whether the speakers are redundant, which optimizes the number of each type of speaker. Finally the virtual-speaker weighting method is verified by reproduction experiments of the interior sound field in a passenger car. The results validate that the optimum mixed speaker array can be obtained using the proposed method.

  8. Data requirements for speaker independent acoustic models

    CSIR Research Space (South Africa)

    Badenhorst, JAC

    2008-11-01

    Full Text Available When developing speech recognition systems in resource-constrained environments, careful design of the training corpus can play an important role in compensating for data scarcity. One of the factors to consider relates to the speaker composition...

  9. Yes, you can? A speaker's potency to act upon his words orchestrates early neural responses to message-level meaning.

    Science.gov (United States)

    Bornkessel-Schlesewsky, Ina; Krauspenhaar, Sylvia; Schlesewsky, Matthias

    2013-01-01

    Evidence is accruing that, in comprehending language, the human brain rapidly integrates a wealth of information sources-including the reader or hearer's knowledge about the world and even his/her current mood. However, little is known to date about how language processing in the brain is affected by the hearer's knowledge about the speaker. Here, we investigated the impact of social attributions to the speaker by measuring event-related brain potentials while participants watched videos of three speakers uttering true or false statements pertaining to politics or general knowledge: a top political decision maker (the German Federal Minister of Finance at the time of the experiment), a well-known media personality and an unidentifiable control speaker. False versus true statements engendered an N400 - late positivity response, with the N400 (150-450 ms) constituting the earliest observable response to message-level meaning. Crucially, however, the N400 was modulated by the combination of speaker and message: for false versus true political statements, an N400 effect was only observable for the politician, but not for either of the other two speakers; for false versus true general knowledge statements, an N400 was engendered by all three speakers. We interpret this result as demonstrating that the neurophysiological response to message-level meaning is immediately influenced by the social status of the speaker and whether he/she has the power to bring about the state of affairs described.

  10. Pitch Correlogram Clustering for Fast Speaker Identification

    Directory of Open Access Journals (Sweden)

    Nitin Jhanwar

    2004-12-01

    Full Text Available Gaussian mixture models (GMMs are commonly used in text-independent speaker identification systems. However, for large speaker databases, their high computational run-time limits their use in online or real-time speaker identification situations. Two-stage identification systems, in which the database is partitioned into clusters based on some proximity criteria and only a single-cluster GMM is run in every test, have been suggested in literature to speed up the identification process. However, most clustering algorithms used have shown limited success, apparently because the clustering and GMM feature spaces used are derived from similar speech characteristics. This paper presents a new clustering approach based on the concept of a pitch correlogram that captures frame-to-frame pitch variations of a speaker rather than short-time spectral characteristics like cepstral coefficient, spectral slopes, and so forth. The effectiveness of this two-stage identification process is demonstrated on the IVIE corpus of 110 speakers. The overall system achieves a run-time advantage of 500% as well as a 10% reduction of error in overall speaker identification.

  11. Accent attribution in speakers with Foreign Accent Syndrome

    OpenAIRE

    Verhoeven, J.; De Pauw, G.; Pettinato, M.; Hirson, A.; Van Borsel, J.; Marien, P.

    2013-01-01

    Purpose: The main aim of this experiment was to establish the extent to which the impression of foreignness in speakers with Foreign Accent Syndrome (FAS) is in any way comparable to the impression of foreignness in speakers with a real foreign accent.\\ud \\ud Method: Three groups of listeners attributed accents to conversational speech samples of 5 FAS speakers which were embedded amongst those of 5 speakers with a real foreign accent and 5 native speaker controls. The listener groups differe...

  12. When speaker identity is unavoidable : Neural processing of speaker identity cues in natural speech

    NARCIS (Netherlands)

    Tuninetti, A.; Chládková, K.; Peter, V.; Schiller, N.O.; Escudero, P.

    2017-01-01

    Speech sound acoustic properties vary largely across speakers and accents. When perceiving speech, adult listeners normally disregard non-linguistic variation caused by speaker or accent differences, in order to comprehend the linguistic message, e.g. to correctly identify a speech sound or a word.

  13. Infants’ selectively pay attention to the information they receive from a native speaker of their language

    Directory of Open Access Journals (Sweden)

    Hanna Marno

    2016-08-01

    Full Text Available AbstractFrom the first moments of their life, infants show a preference for their native language, as well as towards speakers with whom they share the same language. This preference appears to have broad consequences in various domains later on, supporting group affiliations and collaborative actions in children. Here, we propose that infants’ preference for native speakers of their language also serves a further purpose, specifically allowing them to efficiently acquire culture specific knowledge via social learning. By selectively attending to informants who are native speakers of their language and who probably also share the same cultural background with the infant, young learners can maximize the possibility to acquire cultural knowledge. To test whether infants would preferably attend the information they receive from a speaker of their native language, we familiarized 12-month-old infants with a native and a foreign speaker, and then presented them with movies where each of the speakers silently gazed towards unfamiliar objects. At test, infants’ looking behaviour to the two objects alone was measured. Results revealed that infants preferred to look longer at the object presented by the native speaker. Strikingly, the effect was replicated also with 5-month-old infants, indicating an early development of such preference. These findings provide evidence that young infants pay more attention to the information presented by a person with whom they share the same language. This selectivity can serve as a basis for efficient social learning by influencing how infants’ allocate attention between potential sources of information in their environment.

  14. The kinesthetic speaker. Putting action into words.

    Science.gov (United States)

    Morgan, N

    2001-04-01

    Speeches and presentations offer an interesting catch-22: executives don't want to spend long hours creating them, and people don't want to sit for long hours listening to them. Ultimately, though, executives can't live without them. That's because a good speech or presentation has the power to inspire people to act on the speaker's behalf and create change. Author Nick Morgan, a longtime speech-writer and speaking coach, says what's most often lacking in today's speeches and presentations is what he calls the "kinesthetic connection." Many good speakers connect aurally with their audiences, telling dramatic stories and effectively pacing their speeches to hold people's attention. Others connect visually, with a vivid film clip or a killer slide. Some people do both, but not many also connect kinesthetically. Morgan says the kinesthetic speaker feeds an audience's primal hunger to experience a presentation on a physical, as well as an intellectual, level. Through awareness of their own physical presence--gestures, posture, movements--and through the effective use of the space in which they present, kinesthetic speakers can create potent nonverbal messages that reinforce their verbal ones. In this article, Morgan describes techniques for harnessing kinesthetic power and creating a sense of intimacy with an audience--a closeness that is more widely expected from speakers since the advent of television. For instance, kinesthetic speakers should make use of audience proxies--individuals in the crowd who serve as representatives for the others. Ultimately, the author says, a speech or presentation offers something of great value to business executives: it's the best vehicle for winning trust from large groups of people--be they employees, colleagues, or share-holders.

  15. SEMI-AUTOMATIC SPEAKER VERIFICATION SYSTEM

    Directory of Open Access Journals (Sweden)

    E. V. Bulgakova

    2016-03-01

    Full Text Available Subject of Research. The paper presents a semi-automatic speaker verification system based on comparing of formant values, statistics of phone lengths and melodic characteristics as well. Due to the development of speech technology, there is an increased interest now in searching for expert speaker verification systems, which have high reliability and low labour intensiveness because of the automation of data processing for the expert analysis. System Description. We present a description of a novel system analyzing similarity or distinction of speaker voices based on comparing statistics of phone lengths, formant features and melodic characteristics. The characteristic feature of the proposed system based on fusion of methods is a weak correlation between the analyzed features that leads to a decrease in the error rate of speaker recognition. The system advantage is the possibility to carry out rapid analysis of recordings since the processes of data preprocessing and making decision are automated. We describe the functioning methods as well as fusion of methods to combine their decisions. Main Results. We have tested the system on the speech database of 1190 target trials and 10450 non-target trials, including the Russian speech of the male and female speakers. The recognition accuracy of the system is 98.59% on the database containing records of the male speech, and 96.17% on the database containing records of the female speech. It was also experimentally established that the formant method is the most reliable of all used methods. Practical Significance. Experimental results have shown that proposed system is applicable for the speaker recognition task in the course of phonoscopic examination.

  16. Joint Single-Channel Speech Separation and Speaker Identification

    DEFF Research Database (Denmark)

    Mowlaee, Pejman; Saeidi, Rahim; Tan, Zheng-Hua

    2010-01-01

    In this paper, we propose a closed loop system to improve the performance of single-channel speech separation in a speaker independent scenario. The system is composed of two interconnected blocks: a separation block and a speaker identiſcation block. The improvement is accomplished by incorporat......In this paper, we propose a closed loop system to improve the performance of single-channel speech separation in a speaker independent scenario. The system is composed of two interconnected blocks: a separation block and a speaker identiſcation block. The improvement is accomplished...... by incorporating the speaker identities found by the speaker identiſcation block as additional information for the separation block, which converts the speaker-independent separation problem to a speaker-dependent one where the speaker codebooks are known. Simulation results show that the closed loop system...

  17. Pratap Raychaudhuri | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    ... from the studies carried out in the speaker's laboratory, the kind of problems that can be addressed using this technique will be highlighted. That it is much more economic and infinitely more productive to build your own scanning tunneling microscope than to buy commercial products commonly available in the market will ...

  18. Souvik Mukherjee | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    Past studies did not consider Asian/Indian individuals, which is important since these populations have diverse population histories. The speakers team has characterised the healthy facial skin microbiome in an Indian population of Dravidian descent and derived evidence that inter-individual variability in microbiome profile ...

  19. Raghavan Krishnan | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    Using a state-of-the-art global climate model with high-resolution zooming over South Asia, it was demonstrated by the speaker's group that the weakening trend of the monsoon is attributable largely to anthropogenic aerosol forcing, together with contributions from land-use land-cover changes and rapid warming of the ...

  20. Pradyut Ghosh | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    How they succeeded in linking molecules together to design molecular machines from a tiny lift to microscopic motors and muscles will first be discussed. Then the speaker's group's work on the development of threaded systems, such as pseudorotaxanes and rotaxanes having multiple functionalities, will also be discussed ...

  1. Schizophrenia among Sesotho speakers in South Africa

    African Journals Online (AJOL)

    memory coincides with the findings by Ensink et al.11 who reported a similar trend among Xhosa-speaking patients with schizophrenia and the culture-bound syndrome known as amafufunyana. Table II: Primary symptoms expressed by Sesotho speakers. Symptoms. (%). Symptoms. (%). Auditory hallucinations. 90,0.

  2. Amitabh Joshi | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    Joshi's research interests include the evolution of life-histories; evolution of competitive ability and its consequences for population dynamics; small population and ... a few examples from work in the speaker's laboratory over the past twenty years will be used to illustrate the power of this approach, and underscore the point ...

  3. Speaker recognition through NLP and CWT modeling.

    Energy Technology Data Exchange (ETDEWEB)

    Brown-VanHoozer, A.; Kercel, S. W.; Tucker, R. W.

    1999-06-23

    The objective of this research is to develop a system capable of identifying speakers on wiretaps from a large database (>500 speakers) with a short search time duration (<30 seconds), and with better than 90% accuracy. Much previous research in speaker recognition has led to algorithms that produced encouraging preliminary results, but were overwhelmed when applied to populations of more than a dozen or so different speakers. The authors are investigating a solution to the ''huge population'' problem by seeking two completely different kinds of characterizing features. These features are extracted using the techniques of Neuro-Linguistic Programming (NLP) and the continuous wavelet transform (CWT). NLP extracts precise neurological, verbal and non-verbal information, and assimilates the information into useful patterns. These patterns are based on specific cues demonstrated by each individual, and provide ways of determining congruency between verbal and non-verbal cues. The primary NLP modalities are characterized through word spotting (or verbal predicates cues, e.g., see, sound, feel, etc.) while the secondary modalities would be characterized through the speech transcription used by the individual. This has the practical effect of reducing the size of the search space, and greatly speeding up the process of identifying an unknown speaker. The wavelet-based line of investigation concentrates on using vowel phonemes and non-verbal cues, such as tempo. The rationale for concentrating on vowels is there are a limited number of vowels phonemes, and at least one of them usually appears in even the shortest of speech segments. Using the fast, CWT algorithm, the details of both the formant frequency and the glottal excitation characteristics can be easily extracted from voice waveforms. The differences in the glottal excitation waveforms as well as the formant frequency are evident in the CWT output. More significantly, the CWT reveals significant

  4. Speaker Recognition Through NLP and CWT Modeling

    Energy Technology Data Exchange (ETDEWEB)

    Brown-VanHoozer, S.A.; Kercel, S.W.; Tucker, R.W.

    1999-06-16

    The objective of this research is to develop a system capable of identifying speakers on wiretaps from a large database (>500 speakers) with a short search time duration (<30 seconds), and with better than 90% accuracy. Much previous research in speaker recognition has led to algorithms that produced encouraging preliminary results, but were overwhelmed when applied to populations of more than a dozen or so different speakers. The authors are investigating a solution to the "large population" problem by seeking two completely different kinds of characterizing features. These features are he techniques of Neuro-Linguistic Programming (NLP) and the continuous wavelet transform (CWT). NLP extracts precise neurological, verbal and non-verbal information, and assimilates the information into useful patterns. These patterns are based on specific cues demonstrated by each individual, and provide ways of determining congruency between verbal and non-verbal cues. The primary NLP modalities are characterized through word spotting (or verbal predicates cues, e.g., see, sound, feel, etc.) while the secondary modalities would be characterized through the speech transcription used by the individual. This has the practical effect of reducing the size of the search space, and greatly speeding up the process of identifying an unknown speaker. The wavelet-based line of investigation concentrates on using vowel phonemes and non-verbal cues, such as tempo. The rationale for concentrating on vowels is there are a limited number of vowels phonemes, and at least one of them usually appears in even the shortest of speech segments. Using the fast, CWT algorithm, the details of both the formant frequency and the glottal excitation characteristics can be easily extracted from voice waveforms. The differences in the glottal excitation waveforms as well as the formant frequency are evident in the CWT output. More significantly, the CWT reveals significant detail of the glottal excitation

  5. When speaker identity is unavoidable: Neural processing of speaker identity cues in natural speech.

    Science.gov (United States)

    Tuninetti, Alba; Chládková, Kateřina; Peter, Varghese; Schiller, Niels O; Escudero, Paola

    2017-11-01

    Speech sound acoustic properties vary largely across speakers and accents. When perceiving speech, adult listeners normally disregard non-linguistic variation caused by speaker or accent differences, in order to comprehend the linguistic message, e.g. to correctly identify a speech sound or a word. Here we tested whether the process of normalizing speaker and accent differences, facilitating the recognition of linguistic information, is found at the level of neural processing, and whether it is modulated by the listeners' native language. In a multi-deviant oddball paradigm, native and nonnative speakers of Dutch were exposed to naturally-produced Dutch vowels varying in speaker, sex, accent, and phoneme identity. Unexpectedly, the analysis of mismatch negativity (MMN) amplitudes elicited by each type of change shows a large degree of early perceptual sensitivity to non-linguistic cues. This finding on perception of naturally-produced stimuli contrasts with previous studies examining the perception of synthetic stimuli wherein adult listeners automatically disregard acoustic cues to speaker identity. The present finding bears relevance to speech normalization theories, suggesting that at an unattended level of processing, listeners are indeed sensitive to changes in fundamental frequency in natural speech tokens. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition

    OpenAIRE

    BenZeghiba, Mohamed Faouzi; Bourlard, Hervé

    2004-01-01

    This paper investigates a new approach to perform simultaneous speech and speaker recognition. The likelihood estimated by a speaker identification system is combined with the posterior probability estimated by the speech recognizer. So, the joint posterior probability of the pronounced word and the speaker identity is maximized. A comparison study with other standard techniques is carried out in three different applications, (1) closed set speech and speaker identification, (2) open set spee...

  7. Young Children's Sensitivity to Speaker Gender When Learning from Others

    Science.gov (United States)

    Ma, Lili; Woolley, Jacqueline D.

    2013-01-01

    This research explores whether young children are sensitive to speaker gender when learning novel information from others. Four- and 6-year-olds ("N" = 144) chose between conflicting statements from a male versus a female speaker (Studies 1 and 3) or decided which speaker (male or female) they would ask (Study 2) when learning about the functions…

  8. Speech Breathing in Speakers Who Use an Electrolarynx

    Science.gov (United States)

    Bohnenkamp, Todd A.; Stowell, Talena; Hesse, Joy; Wright, Simon

    2010-01-01

    Speakers who use an electrolarynx following a total laryngectomy no longer require pulmonary support for speech. Subsequently, chest wall movements may be affected; however, chest wall movements in these speakers are not well defined. The purpose of this investigation was to evaluate speech breathing in speakers who use an electrolarynx during…

  9. Improving Monaural Speaker Identification by Double-Talk Detection

    DEFF Research Database (Denmark)

    Saeidi, Rahim; Mowlaee, Pejman; Kinnunen, Tomi

    2010-01-01

    This paper describes a novel approach to improve monoaural speaker identification where two speakers are present in a single-microphone recording. The goal is to identify both of the underlying speakers in the given mixture. The proposed approach is composed of a double-talk detector (DTD) as a p...

  10. A Tutorial on Text-Independent Speaker Verification

    Directory of Open Access Journals (Sweden)

    Frédéric Bimbot

    2004-04-01

    Full Text Available This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.

  11. Processing Speaker Variability in Repetition and Semantic/Associative Priming

    Science.gov (United States)

    Lee, Chao-Yang; Zhang, Yu

    2015-01-01

    The effect of speaker variability on accessing the form and meaning of spoken words was evaluated in two short-term priming experiments. In the repetition priming experiment, participants listened to repeated or unrelated prime-target pairs, in which the prime and target were produced by the same speaker or different speakers. The results showed…

  12. An experimental comparison of modelling techniques for speaker ...

    Indian Academy of Sciences (India)

    Feature extraction involves extracting speaker-specific features from the speech signal at reduced data rate. The extracted features are further combined using modelling techniques to generate speaker models. The speaker models are then tested using the features extracted from the test speech signal. The improvement in ...

  13. Accent Attribution in Speakers with Foreign Accent Syndrome

    Science.gov (United States)

    Verhoeven, Jo; De Pauw, Guy; Pettinato, Michele; Hirson, Allen; Van Borsel, John; Marien, Peter

    2013-01-01

    Purpose: The main aim of this experiment was to investigate the perception of Foreign Accent Syndrome in comparison to speakers with an authentic foreign accent. Method: Three groups of listeners attributed accents to conversational speech samples of 5 FAS speakers which were embedded amongst those of 5 speakers with a real foreign accent and 5…

  14. Infants' preferences for native speakers are associated with an expectation of information.

    Science.gov (United States)

    Begus, Katarina; Gliga, Teodora; Southgate, Victoria

    2016-11-01

    Humans' preference for others who share our group membership is well documented, and this heightened valuation of in-group members seems to be rooted in early development. Before 12 mo of age, infants already show behavioral preferences for others who evidence cues to same-group membership such as race or native language, yet the function of this selectivity remains unclear. We examine one of these social biases, the preference for native speakers, and propose that this preference may result from infants' motivation to obtain information and the expectation that interactions with native speakers will provide better opportunities for learning. To investigate this hypothesis, we measured EEG theta activity, a neural rhythm shown to index active and selective preparation for encoding information in adults. In study 1, we established that 11-mo-old infants exhibit an increase in theta activation in situations when they can expect to receive information. We then used this neural measure of anticipatory theta activity to explore the expectations of 11-mo-olds when facing social partners who either speak the infants' native language or a foreign tongue (study 2). A larger increase in theta oscillations was observed when infants could expect to receive information from the native speaker, indicating that infants were preparing to learn information from the native speaker to a greater extent than from the foreign speaker. While previous research has demonstrated that infants prefer to interact with knowledgeable others, the current experiments provide evidence that such an information-seeking motive may also underpin infants' demonstrated preference for native speakers.

  15. Speaker models and the English classroom: the impact of the intercultural-speaker teacher model in Norway

    OpenAIRE

    Hansen, Thomas

    2012-01-01

    In the present thesis, I examine the extent to which the intercultural-speaker model is acknowledged by teachers of English in Norway. As opposed to the native-speaker model, the model of the intercultural speaker focuses on the context in which intercultural communication takes place and how interlocutors can use their own background to master the interaction between them. My point of departure is that the teaching tradition in Norway is based on the native-speaker model,...

  16. Vocal caricatures reveal signatures of speaker identity

    Science.gov (United States)

    López, Sabrina; Riera, Pablo; Assaneo, María Florencia; Eguía, Manuel; Sigman, Mariano; Trevisan, Marcos A.

    2013-12-01

    What are the features that impersonators select to elicit a speaker's identity? We built a voice database of public figures (targets) and imitations produced by professional impersonators. They produced one imitation based on their memory of the target (caricature) and another one after listening to the target audio (replica). A set of naive participants then judged identity and similarity of pairs of voices. Identity was better evoked by the caricatures and replicas were perceived to be closer to the targets in terms of voice similarity. We used this data to map relevant acoustic dimensions for each task. Our results indicate that speaker identity is mainly associated with vocal tract features, while perception of voice similarity is related to vocal folds parameters. We therefore show the way in which acoustic caricatures emphasize identity features at the cost of loosing similarity, which allows drawing an analogy with caricatures in the visual space.

  17. Grammatical Planning Units during Real-Time Sentence Production in Speakers with Agrammatic Aphasia and Healthy Speakers

    Science.gov (United States)

    Lee, Jiyeon; Yoshida, Masaya; Thompson, Cynthia K.

    2015-01-01

    Purpose: Grammatical encoding (GE) is impaired in agrammatic aphasia; however, the nature of such deficits remains unclear. We examined grammatical planning units during real-time sentence production in speakers with agrammatic aphasia and control speakers, testing two competing models of GE. We queried whether speakers with agrammatic aphasia…

  18. English Speakers Attend More Strongly than Spanish Speakers to Manner of Motion when Classifying Novel Objects and Events

    Science.gov (United States)

    Kersten, Alan W.; Meissner, Christian A.; Lechuga, Julia; Schwartz, Bennett L.; Albrechtsen, Justin S.; Iglesias, Adam

    2010-01-01

    Three experiments provide evidence that the conceptualization of moving objects and events is influenced by one's native language, consistent with linguistic relativity theory. Monolingual English speakers and bilingual Spanish/English speakers tested in an English-speaking context performed better than monolingual Spanish speakers and bilingual…

  19. Methods of Speakers\\' Effects on the Audience

    Directory of Open Access Journals (Sweden)

    فریبا حسینی

    2010-09-01

    Full Text Available Methods of Speakers' Effects on the Audience    Nasrollah Shameli *   Fariba Hosayni **     Abstract   This article is focused on four issues. The first issue is related to the speaker's external appearance including the beauty of face, the power of his voice, moves and signals by hand, the stick and eyebrow as well as the height. Such characteristics could have an important effect on the audience. The second issue is related to internal features of the speaker. These include the ethics of the preacher , his/her piety and intention on the speakers based on their personalities, habits and emotions, knowledge and culture, and speed of learning. The third issue is concerned with the appearance of the lecture. Words should be clear enough as well as being mixed with Quranic verses, poetry and proverbs. The final issue is related to the content. It is argued that the subject of the talk should be in accordance with the level of understanding of listeners as well as being new and interesting for them.   3 - A phenomenon rhetoric: It was noted in this section How to give words and phrases so that these words and phrases are clear, correct, mixed in parables, governance and Quranic verses, and appropriate their meaning.   4 - the content of Oratory : It was noted in this section to the topic of Oratory and say that the Oratory should be the theme commensurate with the minds of audiences and also should mean that agree with the case may be, then I say: that the rhetoric if the theme was innovative and new is affecting more and more on the audience.     Key words : Oratory , Preacher , Audience, Influence of speech     * Associate Professor, Department of Arabic Language and Literature, University of Isfahan E-mail: Dr-Nasrolla Shameli@Yahoo.com   * * M.A. in Arabic Language and Literature from Isfahan University E-mail: faribahosayni@yahoo.com

  20. Using timing information in speaker verification

    CSIR Research Space (South Africa)

    Van Heerden, CJ

    2005-11-01

    Full Text Available ) linear predic- tor coefficients and mel frequency cepstral coefficients. However, it is likely that speakers also differ significantly in the duration assigned to segments they speak. As far as could be determined, such temporal information has..., with mel-frequency cepstral coefficients (MFCCs) as input features. HMMs were generated for all triphones, with Figure 1: Dark areas show acceptable areas where normal variance in temporal information will cause recurring in- stances of the same phoneme...

  1. Do We All Apologize the Same?--An Empirical Study on the Act of Apologizing by Spanish Speakers Learning English.

    Science.gov (United States)

    Mir, Montserrat

    1992-01-01

    A study examined the production of English apology strategies by Spanish speakers learning English, by analyzing the remedial move in native and non-native social interactions. To restore harmony when an offensive act has been committed, remedial exchanges are performed according to the rules of speaking and the social norms of the speech…

  2. A system of automatic speaker recognition on a minicomputer

    International Nuclear Information System (INIS)

    El Chafei, Cherif

    1978-01-01

    This study describes a system of automatic speaker recognition using the pitch of the voice. The pre-treatment consists in the extraction of the speakers' discriminating characteristics taken from the pitch. The programme of recognition gives, firstly, a preselection and then calculates the distance between the speaker's characteristics to be recognized and those of the speakers already recorded. An experience of recognition has been realized. It has been undertaken with 15 speakers and included 566 tests spread over an intermittent period of four months. The discriminating characteristics used offer several interesting qualities. The algorithms concerning the measure of the characteristics on one hand, the speakers' classification on the other hand, are simple. The results obtained in real time with a minicomputer are satisfactory. Furthermore they probably could be improved if we considered other speaker's discriminating characteristics but this was unfortunately not in our possibilities. (author) [fr

  3. Sequence Complexity Effects on Speech Production in Healthy Speakers and Speakers with Hypokinetic or Ataxic Dysarthria

    Science.gov (United States)

    Reilly, Kevin J.; Spencer, Kristie A.

    2013-01-01

    The present study investigated the effects of sequence complexity, defined in terms of phonemic similarity and phonotoactic probability, on the timing and accuracy of serial ordering for speech production in healthy speakers and speakers with either hypokinetic or ataxic dysarthria. Sequences were comprised of strings of consonant-vowel (CV) syllables with each syllable containing the same vowel, /a/, paired with a different consonant. High complexity sequences contained phonemically similar consonants, and sounds and syllables that had low phonotactic probabilities; low complexity sequences contained phonemically dissimilar consonants and high probability sounds and syllables. Sequence complexity effects were evaluated by analyzing speech error rates and within-syllable vowel and pause durations. This analysis revealed that speech error rates were significantly higher and speech duration measures were significantly longer during production of high complexity sequences than during production of low complexity sequences. Although speakers with dysarthria produced longer overall speech durations than healthy speakers, the effects of sequence complexity on error rates and speech durations were comparable across all groups. These findings indicate that the duration and accuracy of processes for selecting items in a speech sequence is influenced by their phonemic similarity and/or phonotactic probability. Moreover, this robust complexity effect is present even in speakers with damage to subcortical circuits involved in serial control for speech. PMID:24146997

  4. Sequence complexity effects on speech production in healthy speakers and speakers with hypokinetic or ataxic dysarthria.

    Directory of Open Access Journals (Sweden)

    Kevin J Reilly

    Full Text Available The present study investigated the effects of sequence complexity, defined in terms of phonemic similarity and phonotoactic probability, on the timing and accuracy of serial ordering for speech production in healthy speakers and speakers with either hypokinetic or ataxic dysarthria. Sequences were comprised of strings of consonant-vowel (CV syllables with each syllable containing the same vowel, /a/, paired with a different consonant. High complexity sequences contained phonemically similar consonants, and sounds and syllables that had low phonotactic probabilities; low complexity sequences contained phonemically dissimilar consonants and high probability sounds and syllables. Sequence complexity effects were evaluated by analyzing speech error rates and within-syllable vowel and pause durations. This analysis revealed that speech error rates were significantly higher and speech duration measures were significantly longer during production of high complexity sequences than during production of low complexity sequences. Although speakers with dysarthria produced longer overall speech durations than healthy speakers, the effects of sequence complexity on error rates and speech durations were comparable across all groups. These findings indicate that the duration and accuracy of processes for selecting items in a speech sequence is influenced by their phonemic similarity and/or phonotactic probability. Moreover, this robust complexity effect is present even in speakers with damage to subcortical circuits involved in serial control for speech.

  5. Bilingual children weigh speaker's referential cues and word-learning heuristics differently in different language contexts when interpreting a speaker's intent.

    Science.gov (United States)

    Hung, Wan-Yu; Patrycia, Ferninda; Yow, W Q

    2015-01-01

    Past research has investigated how children use different sources of information such as social cues and word-learning heuristics to infer referential intents. The present research explored how children weigh and use some of these cues to make referential inferences. Specifically, we examined how switching between languages known (familiar) or unknown (unfamiliar) to a child would influence his or her choice of cue to interpret a novel label in a challenging disambiguation task, where a pointing cue was pitted against the mutual exclusivity (ME) principle. Forty-eight 3-and 4-years-old English-Mandarin bilingual children listened to a story told either in English only (No-Switch), English and Mandarin (Familiar-Switch), English and Japanese (Unfamiliar-Switch), or English and English-sounding nonsense sentences (Nonsense-Switch). They were then asked to select an object (from a pair of familiar and novel objects) after hearing a novel label paired with the speaker's point at the familiar object, e.g., "Can you give me the blicket?" Results showed that children in the Familiar-Switch condition were more willing to relax ME to follow the speaker's point to pick the familiar object than those in the Unfamiliar-Switch condition, who were more likely to pick the novel object. No significant differences were found between the other conditions. Further analyses revealed that children in the Unfamiliar-Switch condition looked at the speaker longer than children in the other conditions when the switch happened. Our findings suggest that children weigh speakers' referential cues and word-learning heuristics differently in different language contexts while taking into account their communicative history with the speaker. There are important implications for general education and other learning efforts, such as designing learning games so that the history of credibility with the user is maintained and how learning may be best scaffolded in a helpful and trusting environment.

  6. Insight into the Attitudes of Speakers of Urban Meccan Hijazi Arabic towards their Dialect

    Directory of Open Access Journals (Sweden)

    Sameeha D. Alahmadi

    2016-04-01

    Full Text Available The current study mainly aims to examine the attitudes of speakers of Urban Meccan Hijazi Arabic (UMHA towards their dialect, which is spoken in Mecca, Saudi Arabia. It also investigates whether the participants’ age, sex and educational level have any impact on their perception of their dialect. To this end, I designed a 5-point-Likert-scale questionnaire, requiring participants to rate their attitudes towards their dialect. I asked 80 participants, whose first language is UMHA, to fill out the questionnaire. On the basis of the three independent variables, namely, age, sex and educational level, the participants were divided into three groups: old and young speakers, male and female speakers and educated and uneducated speakers. The results reveal that in general, all the groups (young and old, male and female, and educated and uneducated participants have a sense of responsibility towards their dialect, making their attitudes towards their dialect positive. However, differences exist between the three groups. For instance, old speakers tend to express their pride of their dialect more than young speakers. The same pattern is observed in male and female groups. The results show that females may feel embarrassed to provide answers that may imply that they are not proud of their own dialect, since the majority of women in the Arab world, in general, are under more pressure to conform to the overt norms of the society than males. Therefore, I argue that most Arab women may not have the same freedom to express their opinions and feelings about various issues. Based on the results, the study concludes with some recommendations for further research.  Keywords: sociolinguistics, language attitudes, dialectology, social variables, Urban Meccan Hijazi Arabic

  7. Tracking Intermittently Speaking Multiple Speakers Using a Particle Filter

    Directory of Open Access Journals (Sweden)

    Angela Quinlan

    2009-01-01

    Full Text Available The problem of tracking multiple intermittently speaking speakers is difficult as some distinct problems must be addressed. The number of active speakers must be estimated, these active speakers must be identified, and the locations of all speakers including inactive speakers must be tracked. In this paper we propose a method for tracking intermittently speaking multiple speakers using a particle filter. In the proposed algorithm the number of active speakers is firstly estimated based on the Exponential Fitting Test (EFT, a source number estimation technique which we have proposed. The locations of the speakers are then tracked using a particle filtering framework within which the decomposed likelihood is used in order to decouple the observed audio signal and associate each element of the decomposed signal with an active speaker. The tracking accuracy is then further improved by the inclusion of a silence region detection step and estimation of the noise-only covariance matrix. The method was evaluated using live recordings of 3 speakers and the results show that the method produces highly accurate tracking results.

  8. Speaker diarization system using HXLPS and deep neural network

    Directory of Open Access Journals (Sweden)

    V. Subba Ramaiah

    2018-03-01

    Full Text Available In general, speaker diarization is defined as the process of segmenting the input speech signal and grouped the homogenous regions with regard to the speaker identity. The main idea behind this system is that it is able to discriminate the speaker signal by assigning the label of the each speaker signal. Due to rapid growth of broadcasting and meeting, the speaker diarization is burdensome to enhance the readability of the speech transcription. In order to solve this issue, Holoentropy with the eXtended Linear Prediction using autocorrelation Snapshot (HXLPS and deep neural network (DNN is proposed for the speaker diarization system. The HXLPS extraction method is newly developed by incorporating the Holoentropy with the XLPS. Once we attain the features, the speech and non-speech signals are detected by the Voice Activity Detection (VAD method. Then, i-vector representation of every segmented signal is obtained using Universal Background Model (UBM model. Consequently, DNN is utilized to assign the label for the speaker signal which is then clustered according to the speaker label. The performance is analysed using the evaluation metrics, such as tracking distance, false alarm rate and diarization error rate. The outcome of the proposed method ensures the better diarization performance by achieving the lower DER of 1.36% based on lambda value and DER of 2.23% depends on the frame length. Keywords: Speaker diarization, HXLPS feature extraction, Voice activity detection, Deep neural network, Speaker clustering, Diarization Error Rate (DER

  9. Coffee Can Speakers: Amazing Energy Transformers--Fifth-Grade Students Learn the Science behind Speakers

    Science.gov (United States)

    Wise, Kevin; Haake, Monica

    2007-01-01

    In this article, the authors describe steps on how to develop a high-impact activity in which students build, test, and improve their own "coffee can" speakers to observe firsthand how loudspeakers work to convert electrical energy to sound. The activity is appropriate for students in grades three to six and lends itself best to students…

  10. Speech, speaker and speaker's gender identification in automatically processed broadcast stream

    Czech Academy of Sciences Publication Activity Database

    Silovský, J.; Nouza, Jan

    2006-01-01

    Roč. 15, č. 3 (2006), s. 42-48 ISSN 1210-2512 R&D Projects: GA AV ČR(CZ) 1QS108040569 Institutional research plan: CEZ:AV0Z20670512 Keywords : speech processing * speaker recognition Subject RIV: JD - Computer Applications, Robotics

  11. Guest speakers and internationalization in higher education: A critical reflection of guest speakers in tourism programmes

    Directory of Open Access Journals (Sweden)

    Abdallah Ali

    2016-01-01

    Full Text Available At its general term, this paper discusses the role of guest lecturers in higher education. More precisely, the paper considers the role of guest lecturers in tourism Management programmes taught to student in Universities. The paper examines theory and literature relating to tourism studies and higher education in general. It discusses the relevance of guest speakers in universities and examines their contributions towards tourism programmes. The paper discusses lecturers in tourism degrees suggesting that they might lack industry experiences but are still required to deliver theories to students and are seen as classroom managers and student mentors. The paper further investigates how lecturers and guest speakers are perceived as two different entities by students where lecturers are seen as classroom managers and guest speakers are seen as the industry knowledge providers. As internationalization of education becomes the focus of higher education institutions that aim to deliver education in a more international context, guest speakers that address local practical occurrences become less appealing. With the rapid spread of distant learning and online education, the paper concludes that guest lecturers are not the solution for perfect academic experiences in higher education in general and specifically in tourism programmes.

  12. Speaker-dependent Dictionary-based Speech Enhancement for Text-Dependent Speaker Verification

    DEFF Research Database (Denmark)

    Thomsen, Nicolai Bæk; Thomsen, Dennis Alexander Lehmann; Tan, Zheng-Hua

    2016-01-01

    The problem of text-dependent speaker verification under noisy conditions is becoming ever more relevant, due to increased usage for authentication in real-world applications. Classical methods for noise reduction such as spectral subtraction and Wiener filtering introduce distortion and do not p...... on dictionary-based noise reduction and compare it to the baseline methods....

  13. Dialect Accent Features for Establishing Speaker Identity A Case Study

    CERN Document Server

    Kulshreshtha, Manisha

    2012-01-01

    Dialect Accent Features for Establishing Speaker Identity: A Case Study discusses the subject of forensic voice identification and speaker profiling. Specifically focusing on speaker profiling and using dialects of the Hindi language, widely used in India, the authors have contributed to the body of research on speaker identification by using accent feature as the discriminating factor. This case study contributes to the understanding of the speaker identification process in a situation where unknown speech samples are in different language/dialect than the recording of a suspect. The authors' data establishes that vowel quality, quantity, intonation and tone of a speaker as compared to Khariboli (standard Hindi) could be the potential features for identification of dialect accent.

  14. The Unforeseen Consequences of Interacting With Non-Native Speakers.

    Science.gov (United States)

    Lev-Ari, Shiri; Ho, Emily; Keysar, Boaz

    2018-02-07

    Sociolinguistic research shows that listeners' expectations of speakers influence their interpretation of the speech, yet this is often ignored in cognitive models of language comprehension. Here, we focus on the case of interactions between native and non-native speakers. Previous literature shows that listeners process the language of non-native speakers in less detail, because they expect them to have lower linguistic competence. We show that processing the language of non-native speakers increases lexical competition and access in general, not only of the non-native speaker's speech, and that this leads to poorer memory of one's own speech during the interaction. We further find that the degree to which people adjust their processing to non-native speakers is related to the degree to which they adjust their speech to them. We discuss implications for cognitive models of language processing and sociolinguistic research on attitudes. Copyright © 2018 Cognitive Science Society, Inc.

  15. Real Time Recognition Of Speakers From Internet Audio Stream

    Directory of Open Access Journals (Sweden)

    Weychan Radoslaw

    2015-09-01

    Full Text Available In this paper we present an automatic speaker recognition technique with the use of the Internet radio lossy (encoded speech signal streams. We show an influence of the audio encoder (e.g., bitrate on the speaker model quality. The model of each speaker was calculated with the use of the Gaussian mixture model (GMM approach. Both the speaker recognition and the further analysis were realized with the use of short utterances to facilitate real time processing. The neighborhoods of the speaker models were analyzed with the use of the ISOMAP algorithm. The experiments were based on four 1-hour public debates with 7–8 speakers (including the moderator, acquired from the Polish radio Internet services. The presented software was developed with the MATLAB environment.

  16. Perceptual prothesis in native Spanish speakers

    Science.gov (United States)

    Theodore, Rachel M.; Schmidt, Anna M.

    2003-04-01

    Previous research suggests a perceptual bias exists for native phonotactics [D. Massaro and M. Cohen, Percept. Psychophys. 34, 338-348 (1983)] such that listeners report nonexistent segments when listening to stimuli that violate native phonotactics [E. Dupoux, K. Kakehi, Y. Hirose, C. Pallier, and J. Mehler, J. Exp. Psychol.: Human Percept. Perform. 25, 1568-1578 (1999)]. This study investigated how native-language experience affects second language processing, focusing on how native Spanish speakers perceive the English clusters /st/, /sp/, and /sk/, which represent phonotactically illegal forms in Spanish. To preserve native phonotactics, Spanish speakers often produce prothetic vowels before English words beginning with /s/ clusters. Is the influence of native phonotactics also present in the perception of illegal clusters? A stimuli continuum ranging from no vowel (e.g., ``sku'') to a full vowel (e.g., ``esku'') before the cluster was used. Four final vowel contexts were used for each cluster, resulting in 12 sCV and 12 VsCV nonword endpoints. English and Spanish listeners were asked to discriminate between pairs differing in vowel duration and to identify the presence or absence of a vowel before the cluster. Results will be discussed in terms of implications for theories of second language speech perception.

  17. Living with acquired dysarthria: the speaker's perspective.

    Science.gov (United States)

    Walshe, Margaret; Miller, Nick

    2011-01-01

    To explore the speaker's experience of living with acquired chronic dysarthria. Ten people with dysarthria and progressive neurological illness and one person with dysarthria following stroke were interviewed in depth about their experience of living with dysarthria. They covered a range of ages, time post-onset and dysarthria severity levels. Interviews were transcribed and analysed using Framework Method of Analysis. Acquired dysarthria can negatively impact on speakers' lives. Findings here suggest that the experience of living with dysarthria is highly individual. There were some common perspectives. Six key themes emerged from interviews: 'dysarthria as only part of the picture', 'communication has changed', 'people treat me differently', 'dysarthria resulting in negative emotions', 'barriers to communication' and 'life is different now. The impact of co-existing physical disability and the need to consider dysarthria in context was emphasised by all participants. Findings re-emphasise the need to consider the individual experience in clinical practice. The findings provide direction for assessment and intervention in the area.

  18. Visual speaker gender affects vowel identification in Danish

    DEFF Research Database (Denmark)

    Larsen, Charlotte; Tøndering, John

    2013-01-01

    The experiment examined the effect of visual speaker gender on the vowel perception of 20 native Danish-speaking subjects. Auditory stimuli consisting of a continuum between /muːlə/ ‘muzzle’ and /moːlə/ ‘pier’ generated using TANDEM-STRAIGHT matched with video clips of a female and a male speaker...... were used to determine whether visual speaker gender affected Danish listeners similarly to American English-speaking listeners tested in a similar way....

  19. Learning speaker-specific characteristics with a deep neural architecture.

    Science.gov (United States)

    Chen, Ke; Salman, Ahmad

    2011-11-01

    Speech signals convey various yet mixed information ranging from linguistic to speaker-specific information. However, most of acoustic representations characterize all different kinds of information as whole, which could hinder either a speech or a speaker recognition (SR) system from producing a better performance. In this paper, we propose a novel deep neural architecture (DNA) especially for learning speaker-specific characteristics from mel-frequency cepstral coefficients, an acoustic representation commonly used in both speech recognition and SR, which results in a speaker-specific overcomplete representation. In order to learn intrinsic speaker-specific characteristics, we come up with an objective function consisting of contrastive losses in terms of speaker similarity/dissimilarity and data reconstruction losses used as regularization to normalize the interference of non-speaker-related information. Moreover, we employ a hybrid learning strategy for learning parameters of the deep neural networks: i.e., local yet greedy layerwise unsupervised pretraining for initialization and global supervised learning for the ultimate discriminative goal. With four Linguistic Data Consortium (LDC) benchmarks and two non-English corpora, we demonstrate that our overcomplete representation is robust in characterizing various speakers, no matter whether their utterances have been used in training our DNA, and highly insensitive to text and languages spoken. Extensive comparative studies suggest that our approach yields favorite results in speaker verification and segmentation. Finally, we discuss several issues concerning our proposed approach.

  20. Affective processing in bilingual speakers: disembodied cognition?

    Science.gov (United States)

    Pavlenko, Aneta

    2012-01-01

    A recent study by Keysar, Hayakawa, and An (2012) suggests that "thinking in a foreign language" may reduce decision biases because a foreign language provides a greater emotional distance than a native tongue. The possibility of such "disembodied" cognition is of great interest for theories of affect and cognition and for many other areas of psychological theory and practice, from clinical and forensic psychology to marketing, but first this claim needs to be properly evaluated. The purpose of this review is to examine the findings of clinical, introspective, cognitive, psychophysiological, and neuroimaging studies of affective processing in bilingual speakers in order to identify converging patterns of results, to evaluate the claim about "disembodied cognition," and to outline directions for future inquiry. The findings to date reveal two interrelated processing effects. First-language (L1) advantage refers to increased automaticity of affective processing in the L1 and heightened electrodermal reactivity to L1 emotion-laden words. Second-language (L2) advantage refers to decreased automaticity of affective processing in the L2, which reduces interference effects and lowers electrodermal reactivity to negative emotional stimuli. The differences in L1 and L2 affective processing suggest that in some bilingual speakers, in particular late bilinguals and foreign language users, respective languages may be differentially embodied, with the later learned language processed semantically but not affectively. This difference accounts for the reduction of framing biases in L2 processing in the study by Keysar et al. (2012). The follow-up discussion identifies the limits of the findings to date in terms of participant populations, levels of processing, and types of stimuli, puts forth alternative explanations of the documented effects, and articulates predictions to be tested in future research.

  1. Who spoke when? Audio-based speaker location estimation for diarization

    NARCIS (Netherlands)

    Dadvar, M.

    2011-01-01

    Speaker diarization is the process which detects active speakers and groups those speech signals which has been uttered by the same speaker. Generally we can find two main applications for speaker diarization. Automatic Speech Recognition systems make use of the speaker homogeneous clusters to adapt

  2. Teaching First Language Speakers to Communicate across Linguistic Difference: Addressing Attitudes, Comprehension, and Strategies

    Science.gov (United States)

    Subtirelu, Nicholas Close; Lindemann, Stephanie

    2016-01-01

    While most research in applied linguistics has focused on second language (L2) speakers and their language capabilities, the success of interaction between such speakers and first language (L1) speakers also relies on the positive attitudes and communication skills of the L1 speakers. However, some research has suggested that many L1 speakers lack…

  3. An experimental comparison of modelling techniques for speaker ...

    Indian Academy of Sciences (India)

    Most of the existing modelling techniques for the speaker recognition task make an implicit assumption of sufficient data for speaker modelling and hence may lead to poor modelling under limited data condition. The present work gives an experimental evaluation of the modelling techniques like Crisp Vector Quantization ...

  4. Profiles of an Acquisition Generation: Nontraditional Heritage Speakers of Spanish

    Science.gov (United States)

    DeFeo, Dayna Jean

    2018-01-01

    Though definitions vary, the literature on heritage speakers of Spanish identifies two primary attributes: a linguistic and cultural connection to the language. This article profiles four Anglo college students who grew up in bilingual or Spanish-dominant communities in the Southwest who self-identified as Spanish heritage speakers, citing…

  5. Optimization of multilayer neural network parameters for speaker recognition

    Science.gov (United States)

    Tovarek, Jaromir; Partila, Pavol; Rozhon, Jan; Voznak, Miroslav; Skapa, Jan; Uhrin, Dominik; Chmelikova, Zdenka

    2016-05-01

    This article discusses the impact of multilayer neural network parameters for speaker identification. The main task of speaker identification is to find a specific person in the known set of speakers. It means that the voice of an unknown speaker (wanted person) belongs to a group of reference speakers from the voice database. One of the requests was to develop the text-independent system, which means to classify wanted person regardless of content and language. Multilayer neural network has been used for speaker identification in this research. Artificial neural network (ANN) needs to set parameters like activation function of neurons, steepness of activation functions, learning rate, the maximum number of iterations and a number of neurons in the hidden and output layers. ANN accuracy and validation time are directly influenced by the parameter settings. Different roles require different settings. Identification accuracy and ANN validation time were evaluated with the same input data but different parameter settings. The goal was to find parameters for the neural network with the highest precision and shortest validation time. Input data of neural networks are a Mel-frequency cepstral coefficients (MFCC). These parameters describe the properties of the vocal tract. Audio samples were recorded for all speakers in a laboratory environment. Training, testing and validation data set were split into 70, 15 and 15 %. The result of the research described in this article is different parameter setting for the multilayer neural network for four speakers.

  6. Schizophrenia among Sesotho speakers in South Africa | Mosotho ...

    African Journals Online (AJOL)

    Objective: The aim of this study is to investigate the clinical presentation of schizophrenia among Sesotho speakers. Method: A sample of 100 participants diagnosed with schizophrenia was evaluated using the Psychiatric Interview Questionnaire. Results: Core symptoms of schizophrenia among Sesotho speakers do not ...

  7. Deixis in Spontaneous Speech of Jordanian Urban Arabic Native Speakers

    Science.gov (United States)

    Sa'aida, Zainab

    2017-01-01

    This study aims at describing types and usages of deixis in the speech of Jordanian Urban Arabic native speakers. The present study was conducted in different settings which researcher's family members, friends, colleagues, and acquaintances took part in. Data of the study were collected through observing spontaneous speech of native speakers of…

  8. Request Strategies in Everyday Interactions of Persian and English Speakers

    Directory of Open Access Journals (Sweden)

    Shiler Yazdanfar

    2016-12-01

    Full Text Available Cross-cultural studies of speech acts in different linguistic contexts might have interesting implications for language researchers and practitioners. Drawing on the Speech Act Theory, the present study aimed at conducting a comparative study of request speech act in Persian and English. Specifically, the study endeavored to explore the request strategies used in daily interactions of Persian and English speakers based on directness level and supportive moves. To this end, English and Persian TV series were observed and requestive utterances were transcribed. The utterances were then categorized based on Blum-Kulka and Olshtain’s Cross-Cultural Study of Speech Act Realization Pattern (CCSARP for directness level and internal and external mitigation devises. According to the results, although speakers of both languages opted for the direct level as their most frequently used strategy in their daily interactions, the English speakers used more conventionally indirect strategies than the Persian speakers did, and the Persian speakers used more non-conventionally indirect strategies than the English speakers did. Furthermore, the analyzed data revealed the fact that American English speakers use more mitigation devices in their daily interactions with friends and family members than Persian speakers.

  9. A neural mechanism for recognizing speech spoken by different speakers.

    Science.gov (United States)

    Kreitewolf, Jens; Gaudrain, Etienne; von Kriegstein, Katharina

    2014-05-01

    Understanding speech from different speakers is a sophisticated process, particularly because the same acoustic parameters convey important information about both the speech message and the person speaking. How the human brain accomplishes speech recognition under such conditions is unknown. One view is that speaker information is discarded at early processing stages and not used for understanding the speech message. An alternative view is that speaker information is exploited to improve speech recognition. Consistent with the latter view, previous research identified functional interactions between the left- and the right-hemispheric superior temporal sulcus/gyrus, which process speech- and speaker-specific vocal tract parameters, respectively. Vocal tract parameters are one of the two major acoustic features that determine both speaker identity and speech message (phonemes). Here, using functional magnetic resonance imaging (fMRI), we show that a similar interaction exists for glottal fold parameters between the left and right Heschl's gyri. Glottal fold parameters are the other main acoustic feature that determines speaker identity and speech message (linguistic prosody). The findings suggest that interactions between left- and right-hemispheric areas are specific to the processing of different acoustic features of speech and speaker, and that they represent a general neural mechanism when understanding speech from different speakers. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Teaching Portuguese to Spanish Speakers: A Case for Trilingualism

    Science.gov (United States)

    Carvalho, Ana M.; Freire, Juliana Luna; da Silva, Antonio J. B.

    2010-01-01

    Portuguese is the sixth-most-spoken native language in the world, with approximately 240,000,000 speakers. Within the United States, there is a growing demand for K-12 language programs to engage the community of Portuguese heritage speakers. According to the 2000 U.S. census, 85,000 school-age children speak Portuguese at home. As a result, more…

  11. An experimental comparison of modelling techniques for speaker ...

    Indian Academy of Sciences (India)

    Abstract. Most of the existing modelling techniques for the speaker recog- nition task make an implicit assumption of sufficient data for speaker modelling and hence may lead to poor modelling under limited data condition. The present work gives an experimental evaluation of the modelling techniques like Crisp.

  12. Mismatch: Globalization and Native Speaker Models of Linguistic Competence

    Science.gov (United States)

    Hodgson, Kevin Michael

    2014-01-01

    Although the paradigm shift towards English as an International Language (EIL) has been generally accepted within the academic community, a valorization of native speaker norms continues to be prevalent among many non-native speakers (NNSs). Through data drawn from a qualitative questionnaire and proficiency assessment results (TOEIC), this mixed…

  13. Intonation and Gesture as Bootstrapping Devices in Speaker Uncertainty

    Science.gov (United States)

    Hübscher, Iris; Esteve-Gibert, Núria; Igualada, Alfonso; Prieto, Pilar

    2017-01-01

    This study investigates 3- to 5-year-old children's sensitivity to lexical, intonational and gestural information in the comprehension of speaker uncertainty. Most previous studies on children's understanding of speaker certainty and uncertainty across languages have focused on the comprehension of lexical markers, and little is known about the…

  14. Initial Teacher Training Courses and Non-Native Speaker Teachers

    Science.gov (United States)

    Anderson, Jason

    2016-01-01

    This article reports on a study contrasting 41 native speakers (NSs) and 38 non-native speakers (NNSs) of English from two short initial teacher training courses, the Cambridge Certificate in English Language Teaching to Adults and the Trinity College London CertTESOL. After a brief history and literature review, I present findings on teachers'…

  15. (En)countering native-speakerism global perspectives

    CERN Document Server

    Holliday, Adrian; Swan, Anne

    2015-01-01

    The book addresses the issue of native-speakerism, an ideology based on the assumption that 'native speakers' of English have a special claim to the language itself, through critical qualitative studies of the lived experiences of practising teachers and students in a range of scenarios.

  16. Internal request modification by first and second language speakers ...

    African Journals Online (AJOL)

    This study focuses on the question of whether Luganda English speakers would negatively transfer into their English speech the use of syntactic and lexical down graders resulting in pragmatic failure. Data were collected from Luganda and Luganda English speakers by means of a Discourse Completion Test (DCT) ...

  17. Dysprosody and Stimulus Effects in Cantonese Speakers with Parkinson's Disease

    Science.gov (United States)

    Ma, Joan K.-Y.; Whitehill, Tara; Cheung, Katherine S.-K.

    2010-01-01

    Background: Dysprosody is a common feature in speakers with hypokinetic dysarthria. However, speech prosody varies across different types of speech materials. This raises the question of what is the most appropriate speech material for the evaluation of dysprosody. Aims: To characterize the prosodic impairment in Cantonese speakers with…

  18. Guest Speakers in School-Based Sexuality Education

    Science.gov (United States)

    McRee, Annie-Laurie; Madsen, Nikki; Eisenberg, Marla E.

    2014-01-01

    This study, using data from a statewide survey (n = 332), examined teachers' practices regarding the inclusion of guest speakers to cover sexuality content. More than half of teachers (58%) included guest speakers. In multivariate analyses, teachers who taught high school, had professional preparation in health education, or who received…

  19. The Moderating Effect of Valenced Contact: Slovak Language Media Use, Acculturation and L2 Confidence among Young Hungarian Speakers

    Science.gov (United States)

    Vincze, Laszlo; Gasiorek, Jessica

    2016-01-01

    Integrating the tenets of the social context model of L2 acquisition with insights from the parasocial contact hypothesis, the present paper addresses the role of mass media in L2 acquisition and acculturation among young Hungarian speakers in Slovakia. Questionnaire data were collected among Hungarian-speaking secondary school students (N = 310).…

  20. Data-Model Relationship in Text-Independent Speaker Recognition

    Directory of Open Access Journals (Sweden)

    Stapert Robert

    2005-01-01

    Full Text Available Text-independent speaker recognition systems such as those based on Gaussian mixture models (GMMs do not include time sequence information (TSI within the model itself. The level of importance of TSI in speaker recognition is an interesting question and one addressed in this paper. Recent works has shown that the utilisation of higher-level information such as idiolect, pronunciation, and prosodics can be useful in reducing speaker recognition error rates. In accordance with these developments, the aim of this paper is to show that as more data becomes available, the basic GMM can be enhanced by utilising TSI, even in a text-independent mode. This paper presents experimental work incorporating TSI into the conventional GMM. The resulting system, known as the segmental mixture model (SMM, embeds dynamic time warping (DTW into a GMM framework. Results are presented on the 2000-speaker SpeechDat Welsh database which show improved speaker recognition performance with the SMM.

  1. The effect of listening to others remember on subsequent memory: The roles of expertise and trust in socially shared retrieval-induced forgetting and social contagion

    DEFF Research Database (Denmark)

    Koppel, Jonathan Mark; Wohl, Dana; Meksin, Robert

    2014-01-01

    Speakers reshape listeners’ memories through at least two discrete means: (1) social contagion and (2) socially shared retrieval-induced forgetting (SS-RIF). Three experiments explored how social relationships between speaker and listener moderate these conversational effects, focusing specifically...... on two speaker characteristics, expertise and trustworthiness. We examined their effect on SS-RIF and contrasted, within-subjects, their effects on both SS-RIF and the previously studied social contagion. Experiments 1 and 2 explored the effects of perceived expertise; Experiment 3 explored trust. We...... found (1) that speakers who were perceived as experts induced greater levels of social contagion and lower levels of SS-RIF than non-expert speakers, and (2) that, likewise, trust in the speaker had similar mnemonic consequences, in that neutral speakers induced more social contagion and less SS...

  2. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST Speaker Recognition Evaluation 2006

    NARCIS (Netherlands)

    Brümmer, N.; Burget, L.; Černocký, J.H.; Glembek, O.; Grézl, F.; Karafiát, M.; Leeuwen, D.A. van; Matějka, P.; Schwarz, P.; Strasheim, A.

    2007-01-01

    This paper describes and discusses the "STBU" speaker recognition system, which performed well in the NIST Speaker Recognition Evaluation 2006 (SRE). STBU is a consortium of four partners: Spescom DataVoice (Stellenbosch, South Africa), TNO (Soesterberg, The Netherlands), BUT (Brno, Czech Republic),

  3. Revisiting the Issue of Native Speakerism: "I Don't Want to Speak Like a Native Speaker of English"

    Science.gov (United States)

    Choi, Lee Jin

    2016-01-01

    This qualitative study of English Korean bilinguals explores the ways in which they legitimize themselves as "good" bilinguals in relation to the discourse of native-speakerism. I first survey the essentialist discourse of native speakerism still prevalent in the field of English language teaching and learning despite the growing…

  4. Speech serial control in healthy speakers and speakers with hypokinetic or ataxic dysarthria: Effects of sequence length and practice

    Directory of Open Access Journals (Sweden)

    Kevin J Reilly

    2013-10-01

    Full Text Available The current study investigated the processes responsible for selection of sounds and syllables during production of speech sequences in 10 adults with hypokinetic dysarthria from Parkinson’s disease, 5 adults with ataxic dysarthria, and 14 healthy control speakers. Speech production data from a choice reaction time task were analyzed to evaluate the effects of sequence length and practice on speech sound sequencing. Speakers produced sequences that were between one and five syllables in length over five experimental runs of 60 trials each. In contrast to the healthy speakers, speakers with hypokinetic dysarthria demonstrated exaggerated sequence length effects for both inter-syllable intervals (ISIs and speech error rates. Conversely, speakers with ataxic dysarthria failed to demonstrate a sequence length effect on ISIs and were also the only group that did not exhibit practice-related changes in ISIs and speech error rates over the five experimental runs. The exaggerated sequence length effects in the hypokinetic speakers with Parkinson’s disease are consistent with an impairment of action selection during speech sequence production. The absent length effects observed in the speakers with ataxic dysarthria is consistent with previous findings that indicate a limited capacity to buffer speech sequences in advance of their execution. In addition, the lack of practice effects in these speakers suggests that learning-related improvements in the production rate and accuracy of speech sequences involves processing by structures of the cerebellum. Together, the current findings inform models of serial control for speech in healthy speakers and support the notion that sequencing deficits contribute to speech symptoms in speakers with hypokinetic or ataxic dysarthria. In addition, these findings indicate that speech sequencing is differentially impaired in hypokinetic and ataxic dysarthria.

  5. Speech serial control in healthy speakers and speakers with hypokinetic or ataxic dysarthria: effects of sequence length and practice

    Science.gov (United States)

    Reilly, Kevin J.; Spencer, Kristie A.

    2013-01-01

    The current study investigated the processes responsible for selection of sounds and syllables during production of speech sequences in 10 adults with hypokinetic dysarthria from Parkinson’s disease, five adults with ataxic dysarthria, and 14 healthy control speakers. Speech production data from a choice reaction time task were analyzed to evaluate the effects of sequence length and practice on speech sound sequencing. Speakers produced sequences that were between one and five syllables in length over five experimental runs of 60 trials each. In contrast to the healthy speakers, speakers with hypokinetic dysarthria demonstrated exaggerated sequence length effects for both inter-syllable intervals (ISIs) and speech error rates. Conversely, speakers with ataxic dysarthria failed to demonstrate a sequence length effect on ISIs and were also the only group that did not exhibit practice-related changes in ISIs and speech error rates over the five experimental runs. The exaggerated sequence length effects in the hypokinetic speakers with Parkinson’s disease are consistent with an impairment of action selection during speech sequence production. The absent length effects observed in the speakers with ataxic dysarthria is consistent with previous findings that indicate a limited capacity to buffer speech sequences in advance of their execution. In addition, the lack of practice effects in these speakers suggests that learning-related improvements in the production rate and accuracy of speech sequences involves processing by structures of the cerebellum. Together, the current findings inform models of serial control for speech in healthy speakers and support the notion that sequencing deficits contribute to speech symptoms in speakers with hypokinetic or ataxic dysarthria. In addition, these findings indicate that speech sequencing is differentially impaired in hypokinetic and ataxic dysarthria. PMID:24137121

  6. NES++: number system for encryption based privacy preserving speaker verification

    Science.gov (United States)

    Xu, Lei; Feng, Tao; Zhao, Xi; Shi, Weidong

    2014-05-01

    As speech based operation becomes a main hand-free interaction solution between human and mobile devices (i.e., smartphones, Google Glass), privacy preserving speaker verification receives much attention nowadays. Privacy preserving speaker verification can be achieved through many different ways, such as fuzzy vault and encryption. Encryption based solutions are promising as cryptography is based on solid mathematic foundations and the security properties can be easily analyzed in a well established framework. Most current asymmetric encryption schemes work on finite algebraic structures, such as finite group and finite fields. However, the encryption scheme for privacy preserving speaker verification must handle floating point numbers. This gap must be filled to make the overall scheme practical. In this paper, we propose a number system that meets the requirements of both speaker verification and the encryption scheme used in the process. It also supports addition homomorphic property of Pailliers encryption, which is crucial for privacy preserving speaker verification. As asymmetric encryption is expensive, we propose a method of packing several numbers into one plain-text and the computation overhead is greatly reduced. To evaluate the performance of this method, we implement Pailliers encryption scheme over proposed number system and the packing technique. Our findings show that the proposed solution can fulfill the gap between speaker verification and encryption scheme very well, and the packing technique improves the overall performance. Furthermore, our solution is a building block of encryption based privacy preserving speaker verification, the privacy protection and accuracy rate are not affected.

  7. Presentation skills for the reluctant speaker.

    Science.gov (United States)

    Garon, J E

    1999-01-01

    Presentation skills are vital to clinical systems managers. This article covers four steps to successful presentations: 1) tailoring for an audience, 2) organizing a presentation, 3) mastering presentation techniques, and 4) creating effective visual aids. Tailoring for the audience entails learning about the audience and matching the presentation to their knowledge, educational level, and interests. Techniques to curry favor with an audience include: establishing common ground, relating through universal experiences, and pushing "hot buttons." Tasks involved in organizing the presentation for maximum audience interest begin with arranging the key points in a transparent organizational scheme. Audience attention is sustained using "hooks," such as graphics, anecdotes, humor, and quotations. Basic presentation techniques include appropriate rehearsal, effective eye contact with an audience, and anxiety-reducing strategies. Visual aids include flip charts, slides, transparencies, and computer presentations. Criteria for selecting the type of visual aids are delineated based on audience size and type of presentation, along with respective advantages and disadvantages. The golden rule for presentations is "Never show a slide for which you have to apologize." Rules to maximize visibility and effectiveness, including use of standard templates, sans serif fonts, dark backgrounds with light letters, mixed cases, and effective graphics, ensure that slides or projected computer images are clear and professional. Taken together, these strategies will enhance the delivery of the presentation and decrease the speaker's anxiety.

  8. English vowel learning by speakers of Mandarin

    Science.gov (United States)

    Thomson, Ron I.

    2005-04-01

    One of the most influential models of second language (L2) speech perception and production [Flege, Speech Perception and Linguistic Experience (York, Baltimore, 1995) pp. 233-277] argues that during initial stages of L2 acquisition, perceptual categories sharing the same or nearly the same acoustic space as first language (L1) categories will be processed as members of that L1 category. Previous research has generally been limited to testing these claims on binary L2 contrasts, rather than larger portions of the perceptual space. This study examines the development of 10 English vowel categories by 20 Mandarin L1 learners of English. Imitation of English vowel stimuli by these learners, at 6 data collection points over the course of one year, were recorded. Using a statistical pattern recognition model, these productions were then assessed against native speaker norms. The degree to which the learners' perception/production shifted toward the target English vowels and the degree to which they matched L1 categories in ways predicted by theoretical models are discussed. The results of this experiment suggest that previous claims about perceptual assimilation of L2 categories to L1 categories may be too strong.

  9. Human and automatic speaker recognition over telecommunication channels

    CERN Document Server

    Fernández Gallardo, Laura

    2016-01-01

    This work addresses the evaluation of the human and the automatic speaker recognition performances under different channel distortions caused by bandwidth limitation, codecs, and electro-acoustic user interfaces, among other impairments. Its main contribution is the demonstration of the benefits of communication channels of extended bandwidth, together with an insight into how speaker-specific characteristics of speech are preserved through different transmissions. It provides sufficient motivation for considering speaker recognition as a criterion for the migration from narrowband to enhanced bandwidths, such as wideband and super-wideband.

  10. Speaker gaze increases information coupling between infant and adult brains.

    Science.gov (United States)

    Leong, Victoria; Byrne, Elizabeth; Clackson, Kaili; Georgieva, Stanimira; Lam, Sarah; Wass, Sam

    2017-12-12

    When infants and adults communicate, they exchange social signals of availability and communicative intention such as eye gaze. Previous research indicates that when communication is successful, close temporal dependencies arise between adult speakers' and listeners' neural activity. However, it is not known whether similar neural contingencies exist within adult-infant dyads. Here, we used dual-electroencephalography to assess whether direct gaze increases neural coupling between adults and infants during screen-based and live interactions. In experiment 1 ( n = 17), infants viewed videos of an adult who was singing nursery rhymes with ( i ) direct gaze (looking forward), ( ii ) indirect gaze (head and eyes averted by 20°), or ( iii ) direct-oblique gaze (head averted but eyes orientated forward). In experiment 2 ( n = 19), infants viewed the same adult in a live context, singing with direct or indirect gaze. Gaze-related changes in adult-infant neural network connectivity were measured using partial directed coherence. Across both experiments, the adult had a significant (Granger) causal influence on infants' neural activity, which was stronger during direct and direct-oblique gaze relative to indirect gaze. During live interactions, infants also influenced the adult more during direct than indirect gaze. Further, infants vocalized more frequently during live direct gaze, and individual infants who vocalized longer also elicited stronger synchronization from the adult. These results demonstrate that direct gaze strengthens bidirectional adult-infant neural connectivity during communication. Thus, ostensive social signals could act to bring brains into mutual temporal alignment, creating a joint-networked state that is structured to facilitate information transfer during early communication and learning. Copyright © 2017 the Author(s). Published by PNAS.

  11. Forensic Speaker Recognition Law Enforcement and Counter-Terrorism

    CERN Document Server

    Patil, Hemant

    2012-01-01

    Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of 35 speaker recognition experts from around the world. The volume provides a multidimensional view of the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples, collected by law enforcement and counter-terrorism agencies, that are associated with the commission of a terrorist act or other crimes. While addressing such topics as the challenges of forensic case work, handling speech signal degradation, analyzing features of speaker recognition to optimize voice verification system performance, and designing voice applications that meet the practical needs of law enforcement and counter-terrorism agencies, this material all sounds a common theme: how the rigors of forensic utility are demanding new levels of excellence in all aspects of speaker recognition. The contributors are among the most eminent scientists in speech engineering and signal process...

  12. Multistyle Training and Fusion for Speaker Identification of Disguised Voice

    DEFF Research Database (Denmark)

    Prasad, Swati; Tan, Zheng-Hua; Prasad, Ramjee

    2013-01-01

    Speaker identification research faces challenges due to mismatched training and test conditions, arising out of several factors. Non-electronic voice disguise is one of such factor and is commonly seen in crimes. This paper presents a study of the effect of three different types of voice disguises......, taken from the CHAINS speech corpus for the speaker identification accuracy. Out of the three voice disguises, two are variants of imitative style, namely, synchronous and repetitive synchronous imitation, and one is the fast speaking style. Different variants of multistyle training to increase...... the speaker identification accuracy are investigated in this paper. The manner in which the different speaking style’s speech examples are used for multistyle training plays an important role in the speaker identification accuracy. Further, a fusion of two multistyle training at the decision level is proposed...

  13. A neural mechanism for recognizing speech spoken by different speakers

    NARCIS (Netherlands)

    Kreitewolf, Jens; Gaudrain, Etienne; von Kriegstein, Katharina

    2014-01-01

    Understanding speech from different speakers is a sophisticated process, particularly because the same acoustic parameters convey important information about both the speech message and the person speaking. How the human brain accomplishes speech recognition under such conditions is unknown. One

  14. Cost-Sensitive Learning for Emotion Robust Speaker Recognition

    Science.gov (United States)

    Li, Dongdong; Yang, Yingchun

    2014-01-01

    In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved. PMID:24999492

  15. Multilingual speaker age recognition: regression analyses on the Lwazi corpus

    CSIR Research Space (South Africa)

    Feld, M

    2009-12-01

    Full Text Available towards improved understanding of multilingual speech processing, the current contribution investigates how an important para-linguistic aspect of speech, namely speaker age, depends on the language spoken. In particular, the authors study how certain...

  16. Workshop on IVHM and Aviation Safety: Invited Speakers

    Data.gov (United States)

    National Aeronautics and Space Administration — Confirmed Speakers from the United States Ashok N. Srivastava, NASA Ames Research Center Robert Mah, NASA Ames Research Center Gary Hunter, NASA Glenn Research...

  17. Voice patterns in adult English speakers with Autism Spectrum Disorder

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Lambrechts, Anna; Yarrow, Kielan

    in a supervised machine-learning process to determine whether acoustic features predict diagnostic status and severity of the symptoms. In addition we were interested to evaluate how valid the model built based on Danish data would be on English data, i.e. which parameters were dependent and which were....... Using Recurrence Quantification analysis of acoustic features, Fusaroli et al. (2013 ; IMFAR 2014) demonstrated a high efficacy of identifying voice patterns characteristic of adult Danish speakers with Asperger’s syndrome and trained machine learning algorithms to accurately (80-86%) discriminate...... autistic from non-autistic speakers in both adult and children Danish speakers with ASD. Objectives: Our first aim was to replicate the results obtained by Fusaroli et al. (2013, 2014) in a sample of English speakers, i.e. (1) characterise the speech patterns of adults with ASD and (2) employ the results...

  18. Speaker Linking and Applications using Non-Parametric Hashing Methods

    Science.gov (United States)

    2016-09-08

    Speaker Linking and Applications using Non-Parametric Hashing Methods† Douglas Sturim and William M. Campbell MIT Lincoln Laboratory, Lexington, MA...to prove this by extending the proof of [8] to non-parametric hashing. 6. References [1] W. M. Campbell, D. E. Sturim, and D. A. Reynolds , “Support vec...EPFL-CONF-192414, 2012. [13] S. H. Shum, W. M. Campbell, and D. A. Reynolds , “Large-scale community detection on speaker content graphs,” in

  19. Nonnative Speaker-Initiated Repair in A Sequential Complex Context

    DEFF Research Database (Denmark)

    Yufu, Mamiko

    Repair has been one of the main subjects of conversation analytical studies and the focus is often put on achieving mutual understanding. However, there are also some phenomena unique to a contact situation, which may be due to restricted linguistic knowledge of nonnative speakers, difference...... to such factors as how Germans see Japanese, the interference of Japanese conversational styles, etc. Through the analyses of nonnative speaker-initiated repair, the context-sensitive complexities are demonstrated in this paper....

  20. Intonation and gender perception: applications for transgender speakers.

    Science.gov (United States)

    Hancock, Adrienne; Colton, Lindsey; Douglas, Fiacre

    2014-03-01

    Intonation is commonly addressed in voice and communication feminization therapy, yet empirical evidence of gender differences for intonation is scarce and rarely do studies examine how it relates to gender perception of transgender speakers. This study examined intonation of 12 males, 12 females, six female-to-male, and 14 male-to-female transgender speakers describing a Norman Rockwell image. Several intonation measures were compared between biological gender groups, between perceived gender groups, and between male-to-female (MTF) speakers who were perceived as male, female, or ambiguous gender. Speakers with a larger percentage of utterances with upward intonation and a larger utterance semitone range were perceived as female by listeners, despite no significant differences between the actual intonation of the four gender groups. MTF speakers who do not pass as female appear to use less upward and more downward intonations than female and passing MTF speakers. Intonation has potential for use in transgender communication therapy because it can influence perception to some degree. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  1. FPGA Implementation for GMM-Based Speaker Identification

    Directory of Open Access Journals (Sweden)

    Phaklen EhKan

    2011-01-01

    Full Text Available In today's society, highly accurate personal identification systems are required. Passwords or pin numbers can be forgotten or forged and are no longer considered to offer a high level of security. The use of biological features, biometrics, is becoming widely accepted as the next level for security systems. Biometric-based speaker identification is a method of identifying persons from their voice. Speaker-specific characteristics exist in speech signals due to different speakers having different resonances of the vocal tract. These differences can be exploited by extracting feature vectors such as Mel-Frequency Cepstral Coefficients (MFCCs from the speech signal. A well-known statistical modelling process, the Gaussian Mixture Model (GMM, then models the distribution of each speaker's MFCCs in a multidimensional acoustic space. The GMM-based speaker identification system has features that make it promising for hardware acceleration. This paper describes the hardware implementation for classification of a text-independent GMM-based speaker identification system. The aim was to produce a system that can perform simultaneous identification of large numbers of voice streams in real time. This has important potential applications in security and in automated call centre applications. A speedup factor of ninety was achieved compared to a software implementation on a standard PC.

  2. Free-classification of perceptually similar speakers with dysarthria.

    Science.gov (United States)

    Lansford, Kaitlin L; Liss, Julie M; Norton, Rebecca E

    2014-12-01

    In this investigation, the construct of perceptual similarity was explored in the dysarthrias. Specifically, we employed an auditory free-classification task to determine whether listeners could cluster speakers by perceptual similarity, whether the clusters mapped to acoustic metrics, and whether the clusters were constrained by dysarthria subtype diagnosis. Twenty-three listeners blinded to speakers' medical and dysarthria subtype diagnoses participated. The task was to group together (drag and drop) the icons corresponding to 33 speakers with dysarthria on the basis of how similar they sounded. Cluster analysis and multidimensional scaling (MDS) modeled the perceptual dimensions underlying similarity. Acoustic metrics and perceptual judgments were used in correlation analyses to facilitate interpretation of the derived dimensions. Six clusters of similar-sounding speakers and 3 perceptual dimensions underlying similarity were revealed. The clusters of similar-sounding speakers were not constrained by dysarthria subtype diagnosis. The 3 perceptual dimensions revealed by MDS were correlated with metrics for articulation rate, intelligibility, and vocal quality, respectively. This study shows (a) feasibility of a free-classification approach for studying perceptual similarity in dysarthria, (b) correspondence between acoustic and perceptual metrics to clusters of similar-sounding speakers, and (c) similarity judgments transcended dysarthria subtype diagnosis.

  3. Effects of Speaker Variability on Learning Foreign-Accented English for EFL Learners

    Science.gov (United States)

    Gao, Yuan; Low, Renae; Jin, Putai; Sweller, John

    2013-01-01

    Using a cognitive load theory approach, we investigated the effects of speaker variability when individuals are learning to understand English as a foreign language (EFL) spoken by foreign-accented speakers. The use of multiple, Indian-accented speakers was compared to that of a single speaker for Chinese EFL learners with a higher or lower…

  4. Literacy Skill Differences between Adult Native English and Native Spanish Speakers

    Science.gov (United States)

    Herman, Julia; Cote, Nicole Gilbert; Reilly, Lenore; Binder, Katherine S.

    2013-01-01

    The goal of this study was to compare the literacy skills of adult native English and native Spanish ABE speakers. Participants were 169 native English speakers and 124 native Spanish speakers recruited from five prior research projects. The results showed that the native Spanish speakers were less skilled on morphology and passage comprehension…

  5. Politics of Participation in Benoît Maubrey’s Speaker Sculptures

    DEFF Research Database (Denmark)

    Keylin, Vadim

    Speaker Sculptures is a series of works by Benoît Maubrey, created in 1983-2015. All of them are large-scale architecture-like constructions (often modelled after existing historical buildings or building types) built of recycled loudspeakers. The public could connect to the work by calling...... Sculptures connect the public space to the electronic media, subverting their antagonism and creating a single space of social interactions. This offers a possibility of political presence in public space to those, who are unable to do so in person due to physical or mental disabilities, or other personal...

  6. Speaker-Adaptation for Hybrid HMM-ANN Continuous Speech Recognition System

    OpenAIRE

    Neto, Joao; Almeida, Luis; Hochberg, Mike; Martins, Ciro; Nunes, Luis; Renals, Steve; Robinson, Tony

    1995-01-01

    It is well known that recognition performance degrades significantly when moving from a speaker-dependent to a speaker-independent system. Traditional hidden Markov model (HMM) systems have successfully applied speaker-adaptation approaches to reduce this degradation. In this paper we present and evaluate some techniques for speaker-adaptation of a hybrid HMM-artificial neural network (ANN) continuous speech recognition system. These techniques are applied to a well trained, speaker-independe...

  7. Noise Reduction with Microphone Arrays for Speaker Identification

    Energy Technology Data Exchange (ETDEWEB)

    Cohen, Z

    2011-12-22

    Reducing acoustic noise in audio recordings is an ongoing problem that plagues many applications. This noise is hard to reduce because of interfering sources and non-stationary behavior of the overall background noise. Many single channel noise reduction algorithms exist but are limited in that the more the noise is reduced; the more the signal of interest is distorted due to the fact that the signal and noise overlap in frequency. Specifically acoustic background noise causes problems in the area of speaker identification. Recording a speaker in the presence of acoustic noise ultimately limits the performance and confidence of speaker identification algorithms. In situations where it is impossible to control the environment where the speech sample is taken, noise reduction filtering algorithms need to be developed to clean the recorded speech of background noise. Because single channel noise reduction algorithms would distort the speech signal, the overall challenge of this project was to see if spatial information provided by microphone arrays could be exploited to aid in speaker identification. The goals are: (1) Test the feasibility of using microphone arrays to reduce background noise in speech recordings; (2) Characterize and compare different multichannel noise reduction algorithms; (3) Provide recommendations for using these multichannel algorithms; and (4) Ultimately answer the question - Can the use of microphone arrays aid in speaker identification?

  8. Fluency profile: comparison between Brazilian and European Portuguese speakers.

    Science.gov (United States)

    Castro, Blenda Stephanie Alves e; Martins-Reis, Vanessa de Oliveira; Baptista, Ana Catarina; Celeste, Letícia Correa

    2014-01-01

    The purpose of the study was to compare the speech fluency of Brazilian Portuguese speakers with that of European Portuguese speakers. The study participants were 76 individuals of any ethnicity or skin color aged 18-29 years. Of the participants, 38 lived in Brazil and 38 in Portugal. Speech samples from all participants were obtained and analyzed according to the variables of typology and frequency of speech disruptions and speech rate. Descriptive and inferential statistical analyses were performed to assess the association between the fluency profile and linguistic variant variables. We found that the speech rate of European Portuguese speakers was higher than the speech rate of Brazilian Portuguese speakers in words per minute (p=0.004). The qualitative distribution of the typology of common dysfluencies (pPortuguese speakers is not available, speech therapists in Portugal can use the same speech fluency assessment as has been used in Brazil to establish a diagnosis of stuttering, especially in regard to typical and stuttering dysfluencies, with care taken when evaluating the speech rate.

  9. Direct Speaker Gaze Promotes Trust in Truth-Ambiguous Statements.

    Directory of Open Access Journals (Sweden)

    Helene Kreysa

    Full Text Available A speaker's gaze behaviour can provide perceivers with a multitude of cues which are relevant for communication, thus constituting an important non-verbal interaction channel. The present study investigated whether direct eye gaze of a speaker affects the likelihood of listeners believing truth-ambiguous statements. Participants were presented with videos in which a speaker produced such statements with either direct or averted gaze. The statements were selected through a rating study to ensure that participants were unlikely to know a-priori whether they were true or not (e.g., "sniffer dogs cannot smell the difference between identical twins". Participants indicated in a forced-choice task whether or not they believed each statement. We found that participants were more likely to believe statements by a speaker looking at them directly, compared to a speaker with averted gaze. Moreover, when participants disagreed with a statement, they were slower to do so when the statement was uttered with direct (compared to averted gaze, suggesting that the process of rejecting a statement as untrue may be inhibited when that statement is accompanied by direct gaze.

  10. Sociolinguistic competence in the complimenting act of native Chinese and American English speakers: a mirror of cultural value.

    Science.gov (United States)

    Yu, Ming-Chung

    2005-01-01

    The present study examines sociolinguistic features of a particular speech act, paying compliments, by comparing and contrasting native Chinese and native American speakers' performances. By focusing on a relatively understudied speaker group such as the Chinese, typically regarded as having rules of speaking and social norms very different from those of Westerners, this paper aims at illuminating the fact that, in cross-cultural communication, foreign language speakers have to pay close attention to sociolinguistic rules of the target language in addition to structure and discourse rules to meet the needs of linguistic accuracy and fluency. This is due to the fact that such rules play an indispensable role in appropriating the proper use of linguistic forms. The data for this study were collected using ethnographic observation pioneered in this field by Wolfson and Manes (1980). The analysis will first explore both the features of distribution of paying compliments, and the functions they may serve in spoken exchanges for native Chinese and American English speakers. To present a fuller picture of the socio-cultural features this speech act may represent in Chinese and American societies, the analysis will further focus on the issues of topics, the addresser-addressee relationship, and culture-specificity versus universality.

  11. Accent modulates access to word meaning: Evidence for a speaker-model account of spoken word recognition.

    Science.gov (United States)

    Cai, Zhenguang G; Gilbert, Rebecca A; Davis, Matthew H; Gaskell, M Gareth; Farrar, Lauren; Adler, Sarah; Rodd, Jennifer M

    2017-11-01

    Speech carries accent information relevant to determining the speaker's linguistic and social background. A series of web-based experiments demonstrate that accent cues can modulate access to word meaning. In Experiments 1-3, British participants were more likely to retrieve the American dominant meaning (e.g., hat meaning of "bonnet") in a word association task if they heard the words in an American than a British accent. In addition, results from a speeded semantic decision task (Experiment 4) and sentence comprehension task (Experiment 5) confirm that accent modulates on-line meaning retrieval such that comprehension of ambiguous words is easier when the relevant word meaning is dominant in the speaker's dialect. Critically, neutral-accent speech items, created by morphing British- and American-accented recordings, were interpreted in a similar way to accented words when embedded in a context of accented words (Experiment 2). This finding indicates that listeners do not use accent to guide meaning retrieval on a word-by-word basis; instead they use accent information to determine the dialectic identity of a speaker and then use their experience of that dialect to guide meaning access for all words spoken by that person. These results motivate a speaker-model account of spoken word recognition in which comprehenders determine key characteristics of their interlocutor and use this knowledge to guide word meaning access. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  12. Communication Boot Camp: Discover the Speaker in You!

    Directory of Open Access Journals (Sweden)

    Zuraidah Binti Ali

    2013-07-01

    Full Text Available Learning can take place almost anywhere, and this is especially true for our undergraduates who wish to become public speakers. Besides university course and public speaking workshops on campus grounds, undergraduates are now looking for a different learning environment – communication boot camps!! This study presents a compilation of learners’ experience, fun-filled activities, insightful feedback and memorable boot camp moments as captured in camp photos and feedback surveys. It involves a total of thirty seven undergraduates who enrolled in a Communication Boot Camp at Janda Baik, Pahang. Results show that Communication Boot Camp is a successful strategy to groom public speakers with a positive correlation between camp success and camp objectives, particularly in reducing shyness, motivating participants to become public speakers and discovering their talent and skills. In short, the study adds to the promise of zest and delight in public speaking.

  13. Quantile Acoustic Vectors vs. MFCC Applied to Speaker Verification

    Directory of Open Access Journals (Sweden)

    Mayorga-Ortiz Pedro

    2014-02-01

    Full Text Available In this paper we describe speaker and command recognition related experiments, through quantile vectors and Gaussian Mixture Modelling (GMM. Over the past several years GMM and MFCC have become two of the dominant approaches for modelling speaker and speech recognition applications. However, memory and computational costs are important drawbacks, because autonomous systems suffer processing and power consumption constraints; thus, having a good trade-off between accuracy and computational requirements is mandatory. We decided to explore another approach (quantile vectors in several tasks and a comparison with MFCC was made. Quantile acoustic vectors are proposed for speaker verification and command recognition tasks and the results showed very good recognition efficiency. This method offered a good trade-off between computation times, characteristics vector complexity and overall achieved efficiency.

  14. Increase in voice level and speaker comfort in lecture rooms

    DEFF Research Database (Denmark)

    Brunskog, Jonas; Gade, Anders Christian; Bellester, Gaspar Payá

    2009-01-01

    Teachers often suffer from health problems related to their voice. These problems are related to their working environment, including the acoustics of the lecture rooms. However, there is a lack of studies linking the room acoustic parameters to the voice produced by the speaker. In this pilot......, and other physical attributes, the sound power level produced by six speakers was measured. Objective room acoustic parameters were measured in the same rooms, including reverberation time and room gain, and questionnaires were handed out to people who had experience talking in the rooms. It is found...... study, the main goals are to investigate whether objectively measurable parameters of the rooms can be related to an increase in the voice sound power produced by speakers and to the speakers’ subjective judgments about the rooms. In six different rooms with different sizes, reverberation times...

  15. SPEAKERS' IDENTIFICATION METHOD BASED ON COMPARISON OF PHONEME LENGTHS STATISTICS

    Directory of Open Access Journals (Sweden)

    E. V. Bulgakova

    2015-01-01

    Full Text Available Subject of research. The paper presents a semi-automatic method of speaker identification based on prosodic features comparison - statistics of phone lengths. Due to the development of speech technologies in recent times, there is an increased interest in searching of expert methods for speaker's voice identification, which supplement existing methods to increase identification reliability and also have low labour intensity. An efficient solution for this problem is necessary for making the reliable decision whether the voices of the speakers in the audio recordings are identical or different. Method description. We present a novel algorithm for calculating the difference of speakers’ voices based on comparing of statistics for phone and allophone lengths. Characteristic feature of the proposed method is the possibility of its application along with the other semi-automatic methods (acoustic, auditive and linguistic due to the lack of a strong correlation between analyzed features. The advantage of the method is the possibility to carry out rapid analysis of long-duration recordings because of preprocessing automation for data being analyzed. We describe the operation principles of an automatic speech segmentation module used for statistics calculation of sound lengths by acoustic-phonetic labeling. The software has been developed as an instrument of speech data preprocessing for expert analysis. Method approbation. This method was approved on the speech database of 130 speech records, including the Russian speech of the male speakers and female speakers, and showed reliability equal to 71.7% on the database containing female speech records, and 78.4% on the database containing male speech records. Also it was experimentally established that the most informative of all used features are statistics of phone lengths of vowels and sonorant sounds. Practical relevance. Experimental results have shown applicability of the proposed method for the

  16. Social Deixis in Sinhalese: The Pronoun System.

    Science.gov (United States)

    Tilakaratne, Sunanda

    The pronoun system in Sinhalese, which is spoken in Sri Lanka, is examined based on a Sinhalese speaker's intuition and consultation with other native speakers. Spoken Sinhalese differs from the written language in having an entirely different pronoun system. Spoken Sinhalese provides a good example of social deixis because it encodes social…

  17. The Effects of Students' Perception of a Speaker's Role on Their Recall of Drug Facts and Their Opinions and Attitudes About Drugs

    Science.gov (United States)

    McCleaf, James E.; Colby, Margaret A.

    1975-01-01

    The ascribed social role of a speaker and his ascribed experience with drugs contributed to significant differences in student recall of drug information. Sex of the respondent was found to have a significant effect on students' scores on the Drug Opinion and Attitudes Test. (RC)

  18. Gesturing by Speakers with Aphasia: How Does It Compare?

    Science.gov (United States)

    Mol, Lisette; Krahmer, Emiel; van de Sandt-Koenderman, Mieke

    2013-01-01

    Purpose: To study the independence of gesture and verbal language production. The authors assessed whether gesture can be semantically compensatory in cases of verbal language impairment and whether speakers with aphasia and control participants use similar depiction techniques in gesture. Method: The informativeness of gesture was assessed in 3…

  19. Intergenerational Differences in Acculturation Orientations of Turkish Speakers in Australia

    NARCIS (Netherlands)

    Yagmur, K.

    2014-01-01

    In this paper, acculturation and language orientations among Turkish speakers in Australia (n = 283) are discussed. Compared to West European countries, the Turkish community in Australia is much smaller. Given the prevalent pluralism ideology in Australia, a high level of sociocultural adjustment

  20. Timing control accuracy in normal speakers and stutterers.

    Science.gov (United States)

    Cooper, M H; Allen, G D

    1977-03-01

    This study attempted to assess the abilities of 10 normal speakers, five stutterers in therapy, and five stutterers no longer in therapy, to control the time program of repeated utterances. The speech sample comprised repeated sentences, paragraphs, and nursery rhymes, and a finger-tapping task was included as a control. Temporal accuracy was measured. Results suggest that (1) there is a wide range of timing abilities, even among the normal speakers, with considerable overlap between the different groups of speakers; (2) on most of the experimental tasks, normal speakers are more accurate timers than are stutterers; (3) stutterers released from therapy are more accurate timers than are stutterers still in therapy, whenever these groups differ; and (4) subjects' speech timing scores correlate moderately with their tapping scores. These results are discussed in terms of (1) theoretical timing control processes, such as a neural clock for controlling speech segment durations, and a speech motor output buffer, whose capacity may be limited in stutterers, and (2) known effects of rhythmic constraints and respiratory irregularity on fluency.

  1. The Role of Surrogate Speakers in the 1980 Presidential Campaign.

    Science.gov (United States)

    Kessler, Martha Stout

    1981-01-01

    Describes the employment of surrogate speakers and examines their role in the 1980 presidential contest. Considers the advantages and disadvantages of their use and concludes that surrogates served the important needs of three groups--the candidates for whom they spoke, the audiences they addressed, and the surrogates themselves. (PD)

  2. Generating natural language descriptions using speaker-dependent information

    NARCIS (Netherlands)

    Castro Ferreira, Thiago; Paraboni, Ivandré

    2017-01-01

    This paper discusses the issue of human variation in natural language referring expression generation. We introduce a model of content selection that takes speaker-dependent information into account to produce descriptions that closely resemble those produced by each individual, as seen in a number

  3. The Space-Time Topography of English Speakers

    Science.gov (United States)

    Duman, Steve

    2016-01-01

    English speakers talk and think about Time in terms of physical space. The past is behind us, and the future is in front of us. In this way, we "map" space onto Time. This dissertation addresses the specificity of this physical space, or its topography. Inspired by languages like Yupno (Nunez, et al., 2012) and Bamileke-Dschang (Hyman,…

  4. Distinguished guests, notable speakers, and almost 4,000 ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    2016-06-21

    Jun 21, 2016 ... This year's notable speakers included The Honourable Stéphane Dion, Minister of Foreign Affairs; The Honourable Chrystia Freeland, Minister of International Trade; The Honourable Jane Philpott, Minister of Health; conference co-chair Irina Bokova, Director General of UNESCO; conference co-chair ...

  5. Evaluating acoustic speaker normalization algorithms: evidence from longitudinal child data.

    Science.gov (United States)

    Kohn, Mary Elizabeth; Farrington, Charlie

    2012-03-01

    Speaker vowel formant normalization, a technique that controls for variation introduced by physical differences between speakers, is necessary in variationist studies to compare speakers of different ages, genders, and physiological makeup in order to understand non-physiological variation patterns within populations. Many algorithms have been established to reduce variation introduced into vocalic data from physiological sources. The lack of real-time studies tracking the effectiveness of these normalization algorithms from childhood through adolescence inhibits exploration of child participation in vowel shifts. This analysis compares normalization techniques applied to data collected from ten African American children across five time points. Linear regressions compare the reduction in variation attributable to age and gender for each speaker for the vowels BEET, BAT, BOT, BUT, and BOAR. A normalization technique is successful if it maintains variation attributable to a reference sociolinguistic variable, while reducing variation attributable to age. Results indicate that normalization techniques which rely on both a measure of central tendency and range of the vowel space perform best at reducing variation attributable to age, although some variation attributable to age persists after normalization for some sections of the vowel space. © 2012 Acoustical Society of America

  6. Perception of English palatal codas by Korean speakers of English

    Science.gov (United States)

    Yeon, Sang-Hee

    2003-04-01

    This study aimed at looking at perception of English palatal codas by Korean speakers of English to determine if perception problems are the source of production problems. In particular, first, this study looked at the possible first language effect on the perception of English palatal codas. Second, a possible perceptual source of vowel epenthesis after English palatal codas was investigated. In addition, individual factors, such as length of residence, TOEFL score, gender and academic status, were compared to determine if those affected the varying degree of the perception accuracy. Eleven adult Korean speakers of English as well as three native speakers of English participated in the study. Three sets of a perception test including identification of minimally different English pseudo- or real words were carried out. The results showed that, first, the Korean speakers perceived the English codas significantly worse than the Americans. Second, the study supported the idea that Koreans perceived an extra /i/ after the final affricates due to final release. Finally, none of the individual factors explained the varying degree of the perceptional accuracy. In particular, TOEFL scores and the perception test scores did not have any statistically significant association.

  7. Why reference to the past is difficult for agrammatic speakers

    NARCIS (Netherlands)

    Bastiaanse, Roelien

    Many studies have shown that verb inflections are difficult to produce for agrammatic aphasic speakers: they are frequently omitted and substituted. The present article gives an overview of our search to understanding why this is the case. The hypothesis is that grammatical morphology referring to

  8. Pre-emigration reflections: Afrikaans speakers moving to New Zealand

    African Journals Online (AJOL)

    This article reports on the pre-emigration reflections of 15 Afrikaans speakers, all of whom were in the final stages of preparing to emigrate to New Zealand. The study explores the linguistic histories of the participants, their attitudes to their mother tongue (Afrikaans) and to English, and their views on South Africa\\'s language ...

  9. Native Speakers' Perception of Non-Native English Speech

    Science.gov (United States)

    Jaber, Maysa; Hussein, Riyad F.

    2011-01-01

    This study is aimed at investigating the rating and intelligibility of different non-native varieties of English, namely French English, Japanese English and Jordanian English by native English speakers and their attitudes towards these foreign accents. To achieve the goals of this study, the researchers used a web-based questionnaire which…

  10. Openings and Closings in Telephone Conversations between Native Spanish Speakers.

    Science.gov (United States)

    Coronel-Molina, Serafin M.

    1998-01-01

    A study analyzed the opening and closing sequences of 11 dyads of native Spanish-speakers in natural telephone conversations conducted in Spanish. The objective was to determine how closely Hispanic cultural patterns of conduct for telephone conversations follow the sequences outlined in previous research. It is concluded that Spanish…

  11. Early testimonial learning: monitoring speech acts and speakers.

    Science.gov (United States)

    Stephens, Elizabeth; Suarez, Sarah; Koenig, Melissa

    2015-01-01

    Testimony provides children with a rich source of knowledge about the world and the people in it. However, testimony is not guaranteed to be veridical, and speakers vary greatly in both knowledge and intent. In this chapter, we argue that children encounter two primary types of conflicts when learning from speakers: conflicts of knowledge and conflicts of interest. We review recent research on children's selective trust in testimony and propose two distinct mechanisms supporting early epistemic vigilance in response to the conflicts associated with speakers. The first section of the chapter focuses on the mechanism of coherence checking, which occurs during the process of message comprehension and facilitates children's comparison of information communicated through testimony to their prior knowledge, alerting them to inaccurate, inconsistent, irrational, and implausible messages. The second section focuses on source-monitoring processes. When children lack relevant prior knowledge with which to evaluate testimonial messages, they monitor speakers themselves for evidence of competence and morality, attending to cues such as confidence, consensus, access to information, prosocial and antisocial behavior, and group membership. © 2015 Elsevier Inc. All rights reserved.

  12. Negotiation and Intercultural Learning in Italian Native Speaker Chat Rooms

    Science.gov (United States)

    Tudini, Vincenza

    2007-01-01

    Foreign language learners purportedly demonstrate intercultural communicative competence in native speaker (NS) chat rooms through self-initiated negotiation sequences, including those triggered by pragmatic issues and cultural content. This study identified and classified one-to-one NS-learner negotiations between intermediate learners and NS of…

  13. AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS

    Directory of Open Access Journals (Sweden)

    Murat CANER

    2006-02-01

    Full Text Available In this study an artificial neural network (ANN is implemented, which has been used frequently as an implementation model in recent years, to recognize speaker identification. Generally, recognition is consist of three stages that, processing of signal, obtaining attributes and comparing them. Speech samples are transformed into digital data according to voice card of PC. In the analysis of voice stage, recurrent periods and white noise of voice data are trimmed by hamming window method and voice attribute part of the digital data is obtained. For obtaining attribute of voice data LPC (linear predictive coding and DFT (discrete fourier transform methods are used. Of those 28 coefficents, that is used for speaker recognition, 16 were obtained by the analysis of DFT and 12 were obtained by the analysis of LPC. The parameters that represent speaker voice, is used for training and test of ANN. Multilayer perceptron model is used as an architecture of ANN and backpropagation algorithm is used for training method. Voices of "a" is taken from 7 different person and their attributes are found. ANN is trained with these features to find the speaker who is the owner of the sample voice. And then using the test data that is not used for training part, recognition achievement of ANN is tested. As a result, good results were obtained with low failure rate.

  14. Google Home: smart speaker as environmental control unit.

    Science.gov (United States)

    Noda, Kenichiro

    2017-08-23

    Environmental Control Units (ECU) are devices or a system that allows a person to control appliances in their home or work environment. Such system can be utilized by clients with physical and/or functional disability to enhance their ability to control their environment, to promote independence and improve their quality of life. Over the last several years, there have been an emergence of several inexpensive, commercially-available, voice activated smart speakers into the market such as Google Home and Amazon Echo. These smart speakers are equipped with far field microphone that supports voice recognition, and allows for complete hand-free operation for various purposes, including for playing music, for information retrieval, and most importantly, for environmental control. Clients with disability could utilize these features to turn the unit into a simple ECU that is completely voice activated and wirelessly connected to appliances. Smart speakers, with their ease of setup, low cost and versatility, may be a more affordable and accessible alternative to the traditional ECU. Implications for Rehabilitation Environmental Control Units (ECU) enable independence for physically and functionally disabled clients, and reduce burden and frequency of demands on carers. Traditional ECU can be costly and may require clients to learn specialized skills to use. Smart speakers have the potential to be used as a new-age ECU by overcoming these barriers, and can be used by a wider range of clients.

  15. Teaching the Native English Speaker How to Teach English

    Science.gov (United States)

    Odhuu, Kelli

    2014-01-01

    This article speaks to teachers who have been paired with native speakers (NSs) who have never taught before, and the feelings of frustration, discouragement, and nervousness on the teacher's behalf that can occur as a result. In order to effectively tackle this situation, teachers need to work together with the NSs. Teachers in this scenario…

  16. Umesh V Waghmare | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    Umesh V Waghmare. Theoretical Sciences Unit, Jawaharlal Nehru Centre for Advanced Scientific Research, Jakkur P.O., Bangalore 560 064, ... These ideas apply quite well to dynamical structure of a crystal, as described by the dispersion of its phonons or vibrational waves. The speakers group has shown an interesting ...

  17. Utterance Verification for Text-Dependent Speaker Recognition

    DEFF Research Database (Denmark)

    Kinnunen, Tomi; Sahidullah, Md; Kukanov, Ivan

    2016-01-01

    in the literature, a treatment of simultaneous speaker and utterance verification with a modern, standard database is so far lacking. This is despite the burgeoning demand for voice biometrics in a plethora of practical security applications. With the goal of improving overall verification performance, this paper...

  18. Speaker Recognition from Emotional Speech Using I-vector Approach

    Directory of Open Access Journals (Sweden)

    MACKOVÁ Lenka

    2014-05-01

    Full Text Available In recent years the concept of i-vectors become very popular and successful in the field of the speaker verification. The basic principle of i-vectors is that each utterance is represented by fixed-length feature vector of low-dimension. In the literature for purpose of speaker verification various recordings obtained from telephones or microphones were used. The aim of this experiment was to perform speaker verification using speaker model trained with emotional recordings on i-vector basis. The Mel Frequency Cepstral Coefficients (MFCC, log energy, their deltas and acceleration coefficients were used in process of features extraction. As the classification methods of the verification system Mahalanobis distance metric in combination with Eigen Factor Radial normalization was used and in the second approach Cosine Distance Scoring (CSS metric with Within-class Covariance Normalization as a channel compensation was employed. This verification system used emotional recordings of male subjects from freely available German emotional database (Emo-DB.

  19. Articulatory settings of French-English bilingual speakers

    Science.gov (United States)

    Wilson, Ian

    2005-04-01

    The idea of a language-specific articulatory setting (AS), an underlying posture of the articulators during speech, has existed for centuries [Laver, Historiogr. Ling. 5 (1978)], but until recently it had eluded direct measurement. In an analysis of x-ray movies of French and English monolingual speakers, Gick et al. [Phonetica (in press)] link AS to inter-speech posture, allowing measurement of AS without interference from segmental targets during speech, and they give quantitative evidence showing AS to be language-specific. In the present study, ultrasound and Optotrak are used to investigate whether bilingual English-French speakers have two ASs, and whether this varies depending on the mode (monolingual or bilingual) these speakers are in. Specifically, for inter-speech posture of the lips, lip aperture and protrusion are measured using Optotrak. For inter-speech posture of the tongue, tongue root retraction, tongue body and tongue tip height are measured using optically-corrected ultrasound. Segmental context is balanced across the two languages ensuring that the sets of sounds before and after an inter-speech posture are consistent across languages. By testing bilingual speakers, vocal tract morphology across languages is controlled for. Results have implications for L2 acquisition, specifically the teaching and acquisition of pronunciation.

  20. Psycholinguistic Approaches to Language Processing in Heritage Speakers

    Science.gov (United States)

    Bolger, Patrick A.; Zapata, Gabriela C.

    2011-01-01

    This paper focuses on the dearth of language-processing research addressing Spanish heritage speakers in assimilationist communities. First, we review key offline work on this population, and we then summarize the few psycholinguistic (online) studies that have already been carried out. In an attempt to encourage more such research, in the next…

  1. On the Use of Complementary Spectral Features for Speaker Recognition

    Directory of Open Access Journals (Sweden)

    Sridhar Krishnan

    2007-12-01

    Full Text Available The most popular features for speaker recognition are Mel frequency cepstral coefficients (MFCCs and linear prediction cepstral coefficients (LPCCs. These features are used extensively because they characterize the vocal tract configuration which is known to be highly speaker-dependent. In this work, several features are introduced that can characterize the vocal system in order to complement the traditional features and produce better speaker recognition models. The spectral centroid (SC, spectral bandwidth (SBW, spectral band energy (SBE, spectral crest factor (SCF, spectral flatness measure (SFM, Shannon entropy (SE, and Renyi entropy (RE were utilized for this purpose. This work demonstrates that these features are robust in noisy conditions by simulating some common distortions that are found in the speakers' environment and a typical telephone channel. Babble noise, additive white Gaussian noise (AWGN, and a bandpass channel with 1 dB of ripple were used to simulate these noisy conditions. The results show significant improvements in classification performance for all noise conditions when these features were used to complement the MFCC and ΔMFCC features. In particular, the SC and SCF improved performance in almost all noise conditions within the examined SNR range (10–40 dB. For example, in cases where there was only one source of distortion, classification improvements of up to 8% and 10% were achieved under babble noise and AWGN, respectively, using the SCF feature.

  2. Bilingual Speakers in the Lab: Psychophysiological Measures of Emotional Reactivity

    Science.gov (United States)

    Harris, Catherine L.

    2004-01-01

    Bilingual speakers report experiencing stronger emotions when speaking and hearing their first language compared to their second. Does this occur even when a second language is learned early and becomes the dominant language? Spanish-English bilinguals who had grown up in the USA (early learners) or those who were first exposed to English during…

  3. Application of Native Speaker Models for Identifying Deviations in Rhetorical Moves in Non-Native Speaker Manuscripts

    Directory of Open Access Journals (Sweden)

    Assef Khalili

    2016-06-01

    Full Text Available Introduction: Explicit teaching of generic conventions of a text genre, usually extracted from native-speaker (NS manuscripts, has long been emphasized in the teaching of Academic Writing inEnglish for Specific Purposes (henceforthESP classes, both in theory and practice. While consciousness-raising about rhetorical structure can be instrumental to non-native speakers(NNS, it has to be admitted that most works done in the field of ESP have tended to focus almost exclusively on native-speaker (NS productions, giving scant attention to non-native speaker (NNS manuscripts. That is, having outlined established norms for good writing on the basis of NS productions, few have been inclined to provide a descriptive account of NNS attempts at trying to produce a research article (RA in English. That is what we have tried to do in the present research. Methods: We randomly selected 20 RAs in dentistry and used two well-established models for results and discussion sections to try to describe the move structure of these articles and show the points of divergence from the established norms. Results: The results pointed to significant divergences that could seriously compromise the quality of an RA. Conclusion: It is believed that the insights gained on the deviations in NNS manuscripts could prove very useful in designing syllabi for ESP classes.

  4. Respiratory Control in Stuttering Speakers: Evidence from Respiratory High-Frequency Oscillations.

    Science.gov (United States)

    Denny, Margaret; Smith, Anne

    2000-01-01

    This study examined whether stuttering speakers (N=10) differed from fluent speakers in relations between the neural control systems for speech and life support. It concluded that in some stuttering speakers the relations between respiratory controllers are atypical, but that high participation by the high frequency oscillation-producing circuitry…

  5. Defining "Native Speaker" in Multilingual Settings: English as a Native Language in Asia

    Science.gov (United States)

    Hansen Edwards, Jette G.

    2017-01-01

    The current study examines how and why speakers of English from multilingual contexts in Asia are identifying as native speakers of English. Eighteen participants from different contexts in Asia, including Singapore, Malaysia, India, Taiwan, and The Philippines, who self-identified as native speakers of English participated in hour-long interviews…

  6. Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals

    DEFF Research Database (Denmark)

    Saeidi, Rahim; Mowlaee, Pejman; Kinnunen, Tomi

    2010-01-01

    In this paper, we consider speaker identification for the co-channel scenario in which speech mixture from speakers is recorded by one microphone only. The goal is to identify both of the speakers from their mixed signal. High recognition accuracies have already been reported when an accurately...

  7. Early Language Experience Facilitates the Processing of Gender Agreement in Spanish Heritage Speakers

    Science.gov (United States)

    Montrul, Silvina; Davidson, Justin; De La Fuente, Israel; Foote, Rebecca

    2014-01-01

    We examined how age of acquisition in Spanish heritage speakers and L2 learners interacts with implicitness vs. explicitness of tasks in gender processing of canonical and non-canonical ending nouns. Twenty-three Spanish native speakers, 29 heritage speakers, and 33 proficiency-matched L2 learners completed three on-line spoken word recognition…

  8. An acoustic analysis of English vowels produced by speakers of seven different native-language backgrounds

    NARCIS (Netherlands)

    Heuven, van V.J.J.P.; Gooskens, C.

    2017-01-01

    We measured F1, F2 and duration of ten English monophthongs produced by American native speakers and by Danish, Norwegian, Swedish, Dutch, Hungarian and Chinese L2 speakers. We hypothesized that (i) L2 speakers would approximate the English vowels more closely as the phonological distance between

  9. 78 FR 65511 - Death of Thomas S. Foley Former Speaker of the House of Representatives

    Science.gov (United States)

    2013-10-31

    ... Thomas S. Foley Former Speaker of the House of Representatives #0; #0; #0; Presidential Documents #0; #0... Thomas S. Foley Former Speaker of the House of Representatives By the President of the United States of America A Proclamation As a mark of respect for the memory of Thomas S. Foley, former Speaker of the House...

  10. Is a shared interlanguage beneficial? Mutual intelligibility of American, Dutch and Mandarin speakers of English

    NARCIS (Netherlands)

    Wang, H.; Heuven, van V.J.J.P.; Doel, van den R.; Rupp, L.

    2014-01-01

    We determined the mutual intelligibility Mandarin Chinese, Dutch and American speakers of English in all nine logically possible combinations of speaker and listener native language backgrounds. Designated speakers (one male, one female per language group) were selected from larger sets of 20

  11. Modern Greek Language: Acquisition of Morphology and Syntax by Non-Native Speakers

    Science.gov (United States)

    Andreou, Georgia; Karapetsas, Anargyros; Galantomos, Ioannis

    2008-01-01

    This study investigated the performance of native and non native speakers of Modern Greek language on morphology and syntax tasks. Non-native speakers of Greek whose native language was English, which is a language with strict word order and simple morphology, made more errors and answered more slowly than native speakers on morphology but not…

  12. Race in Conflict with Heritage: "Black" Heritage Language Speaker of Japanese

    Science.gov (United States)

    Doerr, Neriko Musha; Kumagai, Yuri

    2014-01-01

    "Heritage language speaker" is a relatively new term to denote minority language speakers who grew up in a household where the language was used or those who have a family, ancestral, or racial connection to the minority language. In research on heritage language speakers, overlap between these 2 definitions is often assumed--that is,…

  13. Conversational Entrainment of Vocal Fry in Young Adult Female American English Speakers.

    Science.gov (United States)

    Borrie, Stephanie A; Delfino, Christine R

    2017-07-01

    Conversational entrainment, the natural tendency for people to modify their behaviors to more closely match their communication partner, is examined as one possible mechanism modulating the prevalence of vocal fry in the speech of young American women engaged in spoken dialogue. Twenty young adult female American English speakers engaged in two spoken dialogue tasks-one with a young adult female American English conversational partner who exhibited substantial vocal fry and one with a young adult female American English conversational partner who exhibited quantifiably less vocal fry. Dialogues were analyzed for proportion of vocal fry, by speaker, and two measures of communicative success (efficiency and enjoyment). Participants employed significantly more vocal fry when conversing with the partner who exhibited substantial vocal fry than when conversing with the partner who exhibited quantifiably less vocal fry. Further, greater similarity between communication partners in their use of vocal fry tracked with higher scores of communicative efficiency and communicative enjoyment. Conversational entrainment offers a mechanistic framework that may be used to explain, to some degree, the frequency with which vocal fry is employed by young American women engaged in spoken dialogue. Further, young American women who modulated their vocal patterns during dialogue to match those of their conversational partner gained more efficiency and enjoyment from their interactions, demonstrating the cognitive and social benefits of entrainment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  14. The influence of language behavior in social preferences and selective trust of monolingual and bilingual children.

    Science.gov (United States)

    Yow, W Quin; Li, Xiaoqian

    2018-02-01

    Experiences living in a community where people share more than one language may affect children's strategies to selective learning. Language mixing may be one type of speakers' characteristics that bilingual children, but not monolingual children, use to evaluate speakers. A total of 120 English-speaking monolingual (n = 40) and English-Mandarin bilingual (n = 80) 4- and 5-year-olds heard a pair of speakers each tell a story either with or without language mixing and indicated their preferences for either speaker in friendship, explicit judgment, and novel label endorsement. Bilingual children, but not their monolingual counterparts, preferred the single-language speaker to the language-mix speaker across different test questions. Our findings suggest that social relevance in the larger communicative context may contribute to the development of children's social preferences and selective learning based on certain characteristics of the speakers. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Variation among heritage speakers: Sequential vs. simultaneous bilinguals

    Directory of Open Access Journals (Sweden)

    Teresa Lee

    2013-08-01

    Full Text Available This study examines the differences in the grammatical knowledge of two types of heritage speakers of Korean. Early simultaneous bilinguals are exposed to both English and the heritage language from birth, whereas early sequential bilinguals are exposed to the heritage language first and then to English upon schooling. A listening comprehension task involving relative clauses was conducted with 51 beginning-level Korean heritage speakers. The results showed that the early sequential bilinguals exhibited much more accurate knowledge than the early simultaneous bilinguals, who lacked rudimentary knowledge of Korean relative clauses. Drawing on the findings of adult and child Korean L1 data on the acquisition of relative clauses, the performance of each group is discussed with respect to attrition and incomplete acquisition of the heritage language.

  16. English exposed common mistakes made by Chinese speakers

    CERN Document Server

    Hart, Steve

    2017-01-01

    Having analysed the most common English errors made in over 600 academic papers written by Chinese undergraduates, postgraduates, and researchers, Steve Hart has written an essential, practical guide specifically for the native Chinese speaker on how to write good academic English. English Exposed: Common Mistakes Made by Chinese Speakers is divided into three main sections. The first section examines errors made with verbs, nouns, prepositions, and other grammatical classes of words. The second section focuses on problems of word choice. In addition to helping the reader find the right word, it provides instruction for selecting the right style too. The third section covers a variety of other areas essential for the academic writer, such as using punctuation, adding appropriate references, referring to tables and figures, and selecting among various English date and time phrases. Using English Exposed will allow a writer to produce material where content and ideas-not language mistakes-speak the loudest.

  17. Phoneme discrimination and mismatch negativity in English and Japanese speakers

    Science.gov (United States)

    Bomba, Marie D.; Choly, David; Pang, Elizabeth W.

    2012-01-01

    Neural templates for phonemes in one’s native language are formed early in life; these can be modified but are difficult to form de novo. These can be examined with mismatch negativity. Three phonemic contrasts were presented to adult native English compared to Japanese speakers who acquired English later: vowels native to both languages (/i//iy/); consonant-vowel contrasts (/da//wa/) phonemic in both languages; and consonant-vowel contrasts phonemic in English but not in Japanese (/ra//la/). For vowels, no MMN differences were found. For /da//wa/, MMN amplitude was significantly reduced in Japanese speakers. For /ra//la/, only 50% of the Japanese group showed an identifiable MMN. This suggests that phonemic templates are formed early in life and non-native consonant contrasts are difficult to learn later. PMID:21610545

  18. Willing Learners yet Unwilling Speakers in ESL Classrooms

    Directory of Open Access Journals (Sweden)

    Zuraidah Ali

    2007-12-01

    Full Text Available To some of us, speech production in ESL has become so natural and integral that we seem to take it for granted. We often do not even remember how we struggled through the initial process of mastering English. Unfortunately, to students who are still learning English, they seem to face myriad problems that make them appear unwilling or reluctant ESL speakers. This study will investigate this phenomenon which is very common in the ESL classroom. Setting its background on related research findings on this matter, a qualitative study was conducted among foreign students enrolled in the Intensive English Programme (IEP at Institute of Liberal Studies (IKAL, University Tenaga Nasional (UNITEN. The results will show and discuss an extent of truth behind this perplexing phenomenon: willing learners, yet unwilling speakers of ESL, in our effort to provide supportive learning cultures in second language acquisition (SLA to this group of students.

  19. Teaching English to speakers of other languages an introduction

    CERN Document Server

    Nunan, David

    2015-01-01

    David Nunan's dynamic learner-centered teaching style has informed and inspired countless TESOL educators around the world. In this fresh, straightforward introduction to teaching English to speakers of other languages he presents teaching techniques and procedures along with the underlying theory and principles. Complex theories and research studies are explained in a clear and comprehensible, yet non-trivial, manner without trivializing them. Practical examples of how to develop teaching materials and tasks from sound principles provide rich illustrations of theoretical constructs.

  20. Speech rate normalization used to improve speaker verification

    CSIR Research Space (South Africa)

    Van Heerden, CJ

    2006-11-01

    Full Text Available the normalized durations is then compared with the EER using unnormalized durations, and also with the EER when duration information is not employed. 2. Proposed phoneme duration modeling 2.1. Choosing parametric models Since the duration of a phoneme... the known transcription and the speaker-specific acoustic model described above. Only one pronunciation per word was allowed, thus resulting in 49 triphones. To decide which parametric model to use for the duration density func- tions of the triphones...

  1. Neural bases of congenital amusia in tonal language speakers.

    Science.gov (United States)

    Zhang, Caicai; Peng, Gang; Shao, Jing; Wang, William S-Y

    2017-03-01

    Congenital amusia is a lifelong neurodevelopmental disorder of fine-grained pitch processing. In this fMRI study, we examined the neural bases of congenial amusia in speakers of a tonal language - Cantonese. Previous studies on non-tonal language speakers suggest that the neural deficits of congenital amusia lie in the music-selective neural circuitry in the right inferior frontal gyrus (IFG). However, it is unclear whether this finding can generalize to congenital amusics in tonal languages. Tonal language experience has been reported to shape the neural processing of pitch, which raises the question of how tonal language experience affects the neural bases of congenital amusia. To investigate this question, we examined the neural circuitries sub-serving the processing of relative pitch interval in pitch-matched Cantonese level tone and musical stimuli in 11 Cantonese-speaking amusics and 11 musically intact controls. Cantonese-speaking amusics exhibited abnormal brain activities in a widely distributed neural network during the processing of lexical tone and musical stimuli. Whereas the controls exhibited significant activation in the right superior temporal gyrus (STG) in the lexical tone condition and in the cerebellum regardless of the lexical tone and music conditions, no activation was found in the amusics in those regions, which likely reflects a dysfunctional neural mechanism of relative pitch processing in the amusics. Furthermore, the amusics showed abnormally strong activation of the right middle frontal gyrus and precuneus when the pitch stimuli were repeated, which presumably reflect deficits of attending to repeated pitch stimuli or encoding them into working memory. No significant group difference was found in the right IFG in either the whole-brain analysis or region-of-interest analysis. These findings imply that the neural deficits in tonal language speakers might differ from those in non-tonal language speakers, and overlap partly with the

  2. Identifying the Attended Speaker Using Electrocorticographic (ECoG) Signals

    Science.gov (United States)

    Dijkstra, K.; Brunner, P.; Gunduz, A.; Coon, W.; Ritaccio, A.L.; Farquhar, J.; Schalk, G.

    2015-01-01

    People affected by severe neuro-degenerative diseases (e.g., late-stage amyotrophic lateral sclerosis (ALS) or locked-in syndrome) eventually lose all muscular control. Thus, they cannot use traditional assistive communication devices that depend on muscle control, or brain-computer interfaces (BCIs) that depend on the ability to control gaze. While auditory and tactile BCIs can provide communication to such individuals, their use typically entails an artificial mapping between the stimulus and the communication intent. This makes these BCIs difficult to learn and use. In this study, we investigated the use of selective auditory attention to natural speech as an avenue for BCI communication. In this approach, the user communicates by directing his/her attention to one of two simultaneously presented speakers. We used electrocorticographic (ECoG) signals in the gamma band (70–170 Hz) to infer the identity of attended speaker, thereby removing the need to learn such an artificial mapping. Our results from twelve human subjects show that a single cortical location over superior temporal gyrus or pre-motor cortex is typically sufficient to identify the attended speaker within 10 s and with 77% accuracy (50% accuracy due to chance). These results lay the groundwork for future studies that may determine the real-time performance of BCIs based on selective auditory attention to speech. PMID:26949710

  3. Processing advantage for emotional words in bilingual speakers.

    Science.gov (United States)

    Ponari, Marta; Rodríguez-Cuadrado, Sara; Vinson, David; Fox, Neil; Costa, Albert; Vigliocco, Gabriella

    2015-10-01

    Effects of emotion on word processing are well established in monolingual speakers. However, studies that have assessed whether affective features of words undergo the same processing in a native and nonnative language have provided mixed results: Studies that have found differences between native language (L1) and second language (L2) processing attributed the difference to the fact that L2 learned late in life would not be processed affectively, because affective associations are established during childhood. Other studies suggest that adult learners show similar effects of emotional features in L1 and L2. Differences in affective processing of L2 words can be linked to age and context of learning, proficiency, language dominance, and degree of similarity between L2 and L1. Here, in a lexical decision task on tightly matched negative, positive, and neutral words, highly proficient English speakers from typologically different L1s showed the same facilitation in processing emotionally valenced words as native English speakers, regardless of their L1, the age of English acquisition, or the frequency and context of English use. (c) 2015 APA, all rights reserved).

  4. Consistency between verbal and non-verbal affective cues: a clue to speaker credibility.

    Science.gov (United States)

    Gillis, Randall L; Nilsen, Elizabeth S

    2017-06-01

    Listeners are exposed to inconsistencies in communication; for example, when speakers' words (i.e. verbal) are discrepant with their demonstrated emotions (i.e. non-verbal). Such inconsistencies introduce ambiguity, which may render a speaker to be a less credible source of information. Two experiments examined whether children make credibility discriminations based on the consistency of speakers' affect cues. In Experiment 1, school-age children (7- to 8-year-olds) preferred to solicit information from consistent speakers (e.g. those who provided a negative statement with negative affect), over novel speakers, to a greater extent than they preferred to solicit information from inconsistent speakers (e.g. those who provided a negative statement with positive affect) over novel speakers. Preschoolers (4- to 5-year-olds) did not demonstrate this preference. Experiment 2 showed that school-age children's ratings of speakers were influenced by speakers' affect consistency when the attribute being judged was related to information acquisition (speakers' believability, "weird" speech), but not general characteristics (speakers' friendliness, likeability). Together, findings suggest that school-age children are sensitive to, and use, the congruency of affect cues to determine whether individuals are credible sources of information.

  5. Compensations to auditory feedback perturbations in congenitally blind and sighted speakers: Acoustic and articulatory data.

    Science.gov (United States)

    Trudeau-Fisette, Pamela; Tiede, Mark; Ménard, Lucie

    2017-01-01

    This study investigated the effects of visual deprivation on the relationship between speech perception and production by examining compensatory responses to real-time perturbations in auditory feedback. Specifically, acoustic and articulatory data were recorded while sighted and congenitally blind French speakers produced several repetitions of the vowel /ø/. At the acoustic level, blind speakers produced larger compensatory responses to altered vowels than their sighted peers. At the articulatory level, blind speakers also produced larger displacements of the upper lip, the tongue tip, and the tongue dorsum in compensatory responses. These findings suggest that blind speakers tolerate less discrepancy between actual and expected auditory feedback than sighted speakers. The study also suggests that sighted speakers have acquired more constrained somatosensory goals through the influence of visual cues perceived in face-to-face conversation, leading them to tolerate less discrepancy between expected and altered articulatory positions compared to blind speakers and thus resulting in smaller observed compensatory responses.

  6. –ED ALLOMORPHS AND LINGUISTIC KNOWLEDGE OF MALAY SPEAKERS OF ENGLISH: A DESCRIPTIVE AND CORRELATIONAL STUDY

    Directory of Open Access Journals (Sweden)

    Maskanah Mohammad Lotfie

    2017-09-01

    Full Text Available Malay is a language from the Austronesian family and unlike the Indo-European-originated English, it does not generally have inflectional temporal markers. Investigating this from a cross-linguistics - influence perspective, differences between the languages could mean difficulties for Malay speakers to acquire features of English. The objectives of this study are to investigate Malay speakers’ pronunciation of the English language –ed allomorphs – [d], [t] and [ɪd]/[əd] – and the relationship between the morphophonological forms and two types of linguistic knowledge, one of which is implicit while the other is explicit. Data were collated from fifty participants who are social science undergraduates and English majors who speak English as a second language. Four instruments were used to gauge the respondents’ verbal use of –ed allomorphs as well as their implicit and explicit knowledge of the allomorphs. Results indicate that the students’ verbal usage of the target items either lacks approximation to Standard English pronunciation or is largely dropped altogether. Results also suggest a moderate relationship between implicit and explicit knowledge of the allomorphs and their verbal production by Malay speakers of English. The finding illuminates acquisition problem of English language speakers whose mother tongue does not share similar inflectional markers. Pedagogical solutions can help learners of the English language to approximate Standard English and in the long run, enhance effective communication and increase chances of employability.

  7. Eye'm talking to you: speakers' gaze direction modulates co-speech gesture processing in the right MTG.

    Science.gov (United States)

    Holler, Judith; Kokal, Idil; Toni, Ivan; Hagoort, Peter; Kelly, Spencer D; Özyürek, Aslı

    2015-02-01

    Recipients process information from speech and co-speech gestures, but it is currently unknown how this processing is influenced by the presence of other important social cues, especially gaze direction, a marker of communicative intent. Such cues may modulate neural activity in regions associated either with the processing of ostensive cues, such as eye gaze, or with the processing of semantic information, provided by speech and gesture. Participants were scanned (fMRI) while taking part in triadic communication involving two recipients and a speaker. The speaker uttered sentences that were and were not accompanied by complementary iconic gestures. Crucially, the speaker alternated her gaze direction, thus creating two recipient roles: addressed (direct gaze) vs unaddressed (averted gaze) recipient. The comprehension of Speech&Gesture relative to SpeechOnly utterances recruited middle occipital, middle temporal and inferior frontal gyri, bilaterally. The calcarine sulcus and posterior cingulate cortex were sensitive to differences between direct and averted gaze. Most importantly, Speech&Gesture utterances, but not SpeechOnly utterances, produced additional activity in the right middle temporal gyrus when participants were addressed. Marking communicative intent with gaze direction modulates the processing of speech-gesture utterances in cerebral areas typically associated with the semantic processing of multi-modal communicative acts. © The Author (2014). Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  8. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization.

    Science.gov (United States)

    Kriengwatana, Buddhamas; Escudero, Paola; Ten Cate, Carel

    2014-01-01

    The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals - topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.

  9. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization

    Directory of Open Access Journals (Sweden)

    Buddhamas eKriengwatana

    2015-01-01

    Full Text Available The extent to which human speech perception evolved by taking advantage of predispositions and pre-existing features of vertebrate auditory and cognitive systems remains a central question in the evolution of speech. This paper reviews asymmetries in vowel perception, speaker voice recognition, and speaker normalization in non-human animals – topics that have not been thoroughly discussed in relation to the abilities of non-human animals, but are nonetheless important aspects of vocal perception. Throughout this paper we demonstrate that addressing these issues in non-human animals is relevant and worthwhile because many non-human animals must deal with similar issues in their natural environment. That is, they must also discriminate between similar-sounding vocalizations, determine signaler identity from vocalizations, and resolve signaler-dependent variation in vocalizations from conspecifics. Overall, we find that, although plausible, the current evidence is insufficiently strong to conclude that directional asymmetries in vowel perception are specific to humans, or that non-human animals can use voice characteristics to recognize human individuals. However, we do find some indication that non-human animals can normalize speaker differences. Accordingly, we identify avenues for future research that would greatly improve and advance our understanding of these topics.

  10. An analysis of topics and vocabulary in Chinese oral narratives by normal speakers and speakers with fluent aphasia.

    Science.gov (United States)

    Law, Sam-Po; Kong, Anthony Pak-Hin; Lai, Christy

    2018-01-01

    This study analysed the topic and vocabulary of Chinese speakers based on language samples of personal recounts in a large spoken Chinese database recently made available in the public domain, i.e. Cantonese AphasiaBank ( http://www.speech.hku.hk/caphbank/search/ ). The goal of the analysis is to offer clinicians a rich source for selecting ecologically valid training materials for rehabilitating Chinese-speaking people with aphasia (PWA) in the design and planning of culturally and linguistically appropriate treatments. Discourse production of 65 Chinese-speaking PWA of fluent types (henceforth, PWFA) and their non-aphasic controls narrating an important event in their life were extracted from Cantonese AphasiaBank. Analyses of topics and vocabularies in terms of part-of-speech, word frequency, lexical semantics, and diversity were conducted. There was significant overlap in topics between the two groups. While the vocabulary was larger for controls than that of PWFA as expected, they were similar in distribution across parts-of-speech, frequency of occurrence, and the ratio of concrete to abstract items in major open word classes. Moreover, proportionately more different verbs than nouns were employed at the individual level for both speaker groups. The findings provide important implications for guiding directions of aphasia rehabilitation not only of fluent but also non-fluent Chinese aphasic speakers.

  11. Making Math Real: Effective Qualities of Guest Speaker Presentations and the Impact of Speakers on Student Attitude and Achievement in the Algebra Classroom

    Science.gov (United States)

    McKain, Danielle R.

    2012-01-01

    The term real world is often used in mathematics education, yet the definition of real-world problems and how to incorporate them in the classroom remains ambiguous. One way real-world connections can be made is through guest speakers. Guest speakers can offer different perspectives and share knowledge about various subject areas, yet the impact…

  12. More on Dutch English ... please? : a study of request performance by Dutch native speakers, English native speakers and Dutch learners of English

    NARCIS (Netherlands)

    Hendriks, B.C.

    2002-01-01

    The purpose of the study is to investigate similarities and differences in the use of politeness strategies in formulating requests between Dutch learners of English, native speakers of Dutch and native speakers of English. A second objective was to gain more insights into the influence of

  13. Legitimacy and Social Class in Catalan Language Education for Adults

    Science.gov (United States)

    Frekko, Susan E.

    2013-01-01

    Adult students of Catalan are worthy of study because they reveal complexities underlying taken-for-granted assumptions about Catalan speakers and Castilian speakers. Far from fitting into neat bundles aligning language of origin, social class, and national orientation, the students in this study exemplify the breakdown of boundaries traditionally…

  14. "I May Be a Native Speaker but I'm Not Monolingual": Reimagining "All" Teachers' Linguistic Identities in TESOL

    Science.gov (United States)

    Ellis, Elizabeth M.

    2016-01-01

    Teacher linguistic identity has so far mainly been researched in terms of whether a teacher identifies (or is identified by others) as a native speaker (NEST) or nonnative speaker (NNEST) (Moussu & Llurda, 2008; Reis, 2011). Native speakers are presumed to be monolingual, and nonnative speakers, although by definition bilingual, tend to be…

  15. Shhh… I Need Quiet! Children's Understanding of American, British, and Japanese-accented English Speakers.

    Science.gov (United States)

    Bent, Tessa; Holt, Rachael Frush

    2018-02-01

    Children's ability to understand speakers with a wide range of dialects and accents is essential for efficient language development and communication in a global society. Here, the impact of regional dialect and foreign-accent variability on children's speech understanding was evaluated in both quiet and noisy conditions. Five- to seven-year-old children ( n = 90) and adults ( n = 96) repeated sentences produced by three speakers with different accents-American English, British English, and Japanese-accented English-in quiet or noisy conditions. Adults had no difficulty understanding any speaker in quiet conditions. Their performance declined for the nonnative speaker with a moderate amount of noise; their performance only substantially declined for the British English speaker (i.e., below 93% correct) when their understanding of the American English speaker was also impeded. In contrast, although children showed accurate word recognition for the American and British English speakers in quiet conditions, they had difficulty understanding the nonnative speaker even under ideal listening conditions. With a moderate amount of noise, their perception of British English speech declined substantially and their ability to understand the nonnative speaker was particularly poor. These results suggest that although school-aged children can understand unfamiliar native dialects under ideal listening conditions, their ability to recognize words in these dialects may be highly susceptible to the influence of environmental degradation. Fully adult-like word identification for speakers with unfamiliar accents and dialects may exhibit a protracted developmental trajectory.

  16. Social Computing

    CERN Multimedia

    CERN. Geneva

    2011-01-01

    The past decade has witnessed a momentous transformation in the way people interact with each other. Content is now co-produced, shared, classified, and rated by millions of people, while attention has become the ephemeral and valuable resource that everyone seeks to acquire. This talk will describe how social attention determines the production and consumption of content within both the scientific community and social media, how its dynamics can be used to predict the future and the role that social media plays in setting the public agenda. About the speaker Bernardo Huberman is a Senior HP Fellow and Director of the Social Computing Lab at Hewlett Packard Laboratories. He received his Ph.D. in Physics from the University of Pennsylvania, and is currently a Consulting Professor in the Department of Applied Physics at Stanford University. He originally worked in condensed matter physics, ranging from superionic conductors to two-dimensional superfluids, and made contributions to the theory of critical p...

  17. Automatic Classification of Marine Mammals with Speaker Classification Methods.

    Science.gov (United States)

    Kreimeyer, Roman; Ludwig, Stefan

    2016-01-01

    We present an automatic acoustic classifier for marine mammals based on human speaker classification methods as an element of a passive acoustic monitoring (PAM) tool. This work is part of the Protection of Marine Mammals (PoMM) project under the framework of the European Defense Agency (EDA) and joined by the Research Department for Underwater Acoustics and Geophysics (FWG), Bundeswehr Technical Centre (WTD 71) and Kiel University. The automatic classification should support sonar operators in the risk mitigation process before and during sonar exercises with a reliable automatic classification result.

  18. Developing a Speaker Identification System for the DARPA RATS Project

    DEFF Research Database (Denmark)

    Plchot, O; Matsoukas, S; Matejka, P

    2013-01-01

    present results using multiple SID systems differing mainly in the algorithm used for voice activity detection (VAD) and feature extraction. We show that (a) unsupervised VAD performs as well supervised methods in terms of downstream SID performance, (b) noise-robust feature extraction methods......This paper describes the speaker identification (SID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We...

  19. Utilising Tree-Based Ensemble Learning for Speaker Segmentation

    DEFF Research Database (Denmark)

    Abou-Zleikha, Mohamed; Tan, Zheng-Hua; Christensen, Mads Græsbøll

    2014-01-01

    for a certain condition, the model becomes biased to the data used for training limiting the model’s generalisation ability. In this paper, we propose a BIC-based tuning-free approach for speaker segmentation through the use of ensemble-based learning. A forest of segmentation trees is constructed in which each...... points. The proposed approach is tested on artificially created conversations from the TIMIT database. The approach proposed show very accurate results comparable to those achieved by the-state-of-the-art methods with a 9% (absolute) higher F 1 compared with the standard ΔBIC with optimally tuned penalty...

  20. Text-Independent Speaker Identification Using the Histogram Transform Model

    DEFF Research Database (Denmark)

    Ma, Zhanyu; Yu, Hong; Tan, Zheng-Hua

    2016-01-01

    In this paper, we propose a novel probabilistic method for the task of text-independent speaker identification (SI). In order to capture the dynamic information during SI, we design a super-MFCCs features by cascading three neighboring Mel-frequency Cepstral coefficients (MFCCs) frames together....... These super-MFCC vectors are utilized for probabilistic model training such that the speaker’s characteristics can be sufficiently captured. The probability density function (PDF) of the aforementioned super-MFCCs features is estimated by the recently proposed histogram transform (HT) method. To recedes...

  1. Gender Identification of the Speaker Using VQ Method

    Directory of Open Access Journals (Sweden)

    Vasif V. Nabiyev

    2009-11-01

    Full Text Available Speaking is the easiest and natural form of communication between people. Intensive studies are made in order to provide this communication via computers between people. The systems using voice biometric technology are attracting attention especially in the angle of cost and usage. When compared with the other biometic systems the application is much more practical. For example by using a microphone placed in the environment voice record can be obtained even without notifying the user and the system can be applied. Moreover the remote access facility is one of the other advantages of voice biometry. In this study, it is aimed to automatically determine the gender of the speaker through the speech waves which include personal information. If the speaker gender can be determined while composing models according to the gender information, the success of voice recognition systems can be increased in an important degree. Generally all the speaker recognition systems are composed of two parts which are feature extraction and matching. Feature extraction is the procedure in which the least information presenting the speech and the speaker is determined through voice signal. There are different features used in voice applications such as LPC, MFCC and PLP. In this study as a feature vector MFCC is used. Feature mathcing is the procedure in which the features derived from unknown speakers and known speaker group are compared. According to the text used in comparison the system is devided to two parts that are text dependent and text independent. While the same text is used in text dependent systems, different texts are used in indepentent text systems. Nowadays, DTW and HMM are text dependent, VQ and GMM are text indepentent matching methods. In this study due to the high success ratio and simple application features VQ approach is used.In this study a system which determines the speaker gender automatically and text independent is proposed. The proposed

  2. LEARNING VECTOR QUANTIZATION FOR ADAPTED GAUSSIAN MIXTURE MODELS IN AUTOMATIC SPEAKER IDENTIFICATION

    Directory of Open Access Journals (Sweden)

    IMEN TRABELSI

    2017-05-01

    Full Text Available Speaker Identification (SI aims at automatically identifying an individual by extracting and processing information from his/her voice. Speaker voice is a robust a biometric modality that has a strong impact in several application areas. In this study, a new combination learning scheme has been proposed based on Gaussian mixture model-universal background model (GMM-UBM and Learning vector quantization (LVQ for automatic text-independent speaker identification. Features vectors, constituted by the Mel Frequency Cepstral Coefficients (MFCC extracted from the speech signal are used to train the New England subset of the TIMIT database. The best results obtained (90% for gender- independent speaker identification, 97 % for male speakers and 93% for female speakers for test data using 36 MFCC features.

  3. Interpreter-mediated neuropsychological testing of monolingual Spanish speakers.

    Science.gov (United States)

    Casas, Rachel; Guzmán-Vélez, Edmarie; Cardona-Rodriguez, Javier; Rodriguez, Nayra; Quiñones, Gabriela; Izaguirre, Borja; Tranel, Daniel

    2012-01-01

    The primary objective of this study was to investigate empirically whether using an interpreter to conduct neuropsychological testing of monolingual Spanish speakers affects test scores. Participants included 40 neurologically normal Spanish speakers with limited English proficiency, aged 18-65 years (M = 39.7, SD = 13.9), who completed the Vocabulary, Similarities, Block Design, and Matrix Reasoning subtests of the Wechsler Adult Intelligence Scale-III in two counterbalanced conditions: with and without an interpreter. Results indicated that interpreter use significantly increased scores on Vocabulary and Similarities. However, scores on Block Design and Matrix Reasoning did not differ depending on whether or not an interpreter was used. In addition the findings suggested a trend toward higher variability in scores when an interpreter was used to administer Vocabulary and Similarities; this trend did not show up for Block Design or Matrix Reasoning. Together the results indicate that interpreter use may significantly affect scores for some tests commonly used in neuropsychological practice, with this influence being greater for verbally mediated tests. Additional research is needed to identify the types of tests that may be most affected as well as the factors that contribute to the effects. In the meantime neuropsychologists are encouraged to avoid interpreter use whenever practically possible, particularly for tests with high demands on interpreter abilities and skills, with tests that have not been appropriately adapted and translated into the patient's target language, and with interpreters who are not trained professionals.

  4. The Halo surrounding native English speaker teachers in Indonesia

    Directory of Open Access Journals (Sweden)

    Angga Kramadibrata

    2016-01-01

    Full Text Available The Native Speaker Fallacy, a commonly held belief that Native English Speaker Teachers (NESTs are inherently better than Non-NESTs, has long been questioned by ELT researchers. However, this belief still stands strong in the general public. This research looks to understand how much a teacher’s nativeness affects a student’s attitude towards them, as well as the underlying reasons for their attitudes. Sixty seven respondents in two groups were asked to watch an animated teaching video, after which they completed a questionnaire that used Likert-scales to assess comprehensibility, clarity of explanation, engagement, and preference. The videos for both groups were identical apart from the narrator; one spoke in British English, while the other, Indian English. In addition, they were also visually identified as Caucasian and Asian, respectively. The video was controlled for speed of delivery. The quantitative data were then triangulated using qualitative data collected through open questions in the questionnaire as well as from a semi-structured interview conducted with 10 respondents. The data show that there is a significant implicit preference for NEST teachers in the video, as well as in respondent’s actual classes. However, when asked explicitly, respondents didn’t rank nativeness as a very important quality in English teachers. This discrepancy between implicit and explicit attitudes might be due to a subconscious cognitive bias, namely the Halo Effect, in which humans tend to make unjustified presumptions about a person based on known but irrelevant information.

  5. Adaptive dictionary based approach for background noise and speaker classification and subsequent source separation

    OpenAIRE

    Girish, K V Vijay; Ramakrishnan, A G; Ananthapadmanabha, T V

    2016-01-01

    A judicious combination of dictionary learning methods, block sparsity and source recovery algorithm are used in a hierarchical manner to identify the noises and the speakers from a noisy conversation between two people. Conversations are simulated using speech from two speakers, each with a different background noise, with varied SNR values, down to -10 dB. Ten each of randomly chosen male and female speakers from the TIMIT database and all the noise sources from the NOISEX database are used...

  6. The effect of speakers' sex on voice onset time in Mandarin stops

    OpenAIRE

    Li, Fangfang

    2013-01-01

    The goal of the present study is to examine the effect of speakers' gender on voice onset time in Mandarin speakers' stop productions. Word-initial lingual stops were elicited from 10 male and 10 female Mandarin speakers using a word-repetition task. The results revealed differentiated voice onset time (VOT) patterns between the two genders for all four lingual stops on raw VOT values. After factoring out speech rate variation, gender-related differences remained for voiced stops ...

  7. Family language policy, first language Irish speaker attitudes and community-based response to language shift

    OpenAIRE

    Ó hIfearnáin, Tadhg

    2013-01-01

    peer-reviewed This paper describes the complex and sometimes ambiguous attitudes of Gaeltacht Irish speakers towards the intergenerational transmission of Irish. It focuses on first language speaker data that was gathered as part of a larger field-based project among fluent, habitual speakers of Irish in the Múscrai Gaeltacht region in County Cork, Ireland, and compares the findings to Riagain's study of the more strongly Irish-speaking Corca Dhuibhne Gaeltacht some 20 years beforehand. It...

  8. Don’t look blank, happy, or sad: patterns of facial expressions of speakers in banks’ YouTube videos predict video’s popularity over time

    OpenAIRE

    Lewinski, P.

    2015-01-01

    There has been little focus on nonverbal communication in social media advertising campaigns. We propose that specific patterns of facial expressions predict the popularity of YouTube videos among users of social media. To test that proposition, we used a neuromarketing tool—FaceReader—to code facial videos of professional speakers who participated in the YouTube social media campaigns of 2 large commercial banks. We analyzed more than 25,000 video frames of 16 speakers’ 6 basic facial expres...

  9. Effects of Language Background on Gaze Behavior: A Crosslinguistic Comparison Between Korean and German Speakers.

    Science.gov (United States)

    Goller, Florian; Lee, Donghoon; Ansorge, Ulrich; Choi, Soonja

    2017-01-01

    Languages differ in how they categorize spatial relations: While German differentiates between containment ( in ) and support ( auf ) with distinct spatial words-(a) den Kuli IN die Kappe stecken ("put pen in cap"); (b) die Kappe AUF den Kuli stecken ("put cap on pen")-Korean uses a single spatial word ( kkita ) collapsing (a) and (b) into one semantic category, particularly when the spatial enclosure is tight-fit. Korean uses a different word (i.e., netha ) for loose-fits (e.g., apple in bowl). We tested whether these differences influence the attention of the speaker. In a crosslinguistic study, we compared native German speakers with native Korean speakers. Participants rated the similarity of two successive video clips of several scenes where two objects were joined or nested (either in a tight or loose manner). The rating data show that Korean speakers base their rating of similarity more on tight- versus loose-fit, whereas German speakers base their rating more on containment versus support ( in vs. auf ). Throughout the experiment, we also measured the participants' eye movements. Korean speakers looked equally long at the moving Figure object and at the stationary Ground object, whereas German speakers were more biased to look at the Ground object. Additionally, Korean speakers also looked more at the region where the two objects touched than did German speakers. We discuss our data in the light of crosslinguistic semantics and the extent of their influence on spatial cognition and perception.

  10. Sensitivity to phonological context in L2 spelling: evidence from Russian ESL speakers

    DEFF Research Database (Denmark)

    Dich, Nadya

    2010-01-01

    The study attempts to investigate factors underlying the development of spellers’ sensitivity to phonological context in English. Native English speakers and Russian speakers of English as a second language (ESL) were tested on their ability to use information about the coda to predict the spelling...... on the information about the coda when spelling vowels in nonwords. In both native and non-native speakers, context sensitivity was predicted by English word spelling; in Russian ESL speakers this relationship was mediated by English proficiency. L1 spelling proficiency did not facilitate L2 context sensitivity...

  11. Aerodynamic Characteristics of Syllable and Sentence Productions in Normal Speakers.

    Science.gov (United States)

    Thiel, Cedric; Yang, Jin; Crawley, Brianna; Krishna, Priya; Murry, Thomas

    2018-01-08

    Aerodynamic measures of subglottic air pressure (Ps) and airflow rate (AFR) are used to select behavioral voice therapy versus surgical treatment for voice disorders. However, these measures are usually taken during a series of syllables, which differs from conversational speech. Repeated syllables do not share the variation found in even simple sentences, and patients may use their best rather than typical voice unless specifically instructed otherwise. This study examined the potential differences in estimated Ps and AFR in syllable and sentence production and their effects on a measure of vocal efficiency in normal speakers. Prospective study. Measures of estimated Ps, AFR, and aerodynamic vocal efficiency (AVE) were obtained from 19 female and four male speakers ages 22-44 years with no history of voice disorders. Subjects repeated a series of /pa/ syllables and a sentence at comfortable effort level into a face mask with a pressure-sensing tube between the lips. AVE varies as a function of the speech material in normal subjects. Ps measures were significantly higher for the sentence-production samples than for the syllable-production samples. AFR was higher during sentence production than syllable production, but the difference was not statistically significant. AVE values were significantly higher for syllable versus sentence productions. The results suggest that subjects increase Ps and AFR in sentence compared with syllable production. Speaking task is a critical factor when considering measures of AVE, and this preliminary study provides a basis for further aerodynamic studies of patient populations. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  12. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.

    Science.gov (United States)

    Gebru, Israel D; Ba, Sileye; Li, Xiaofei; Horaud, Radu

    2018-05-01

    Speaker diarization consists of assigning speech signals to people engaged in a dialogue. An audio-visual spatiotemporal diarization model is proposed. The model is well suited for challenging scenarios that consist of several participants engaged in multi-party interaction while they move around and turn their heads towards the other participants rather than facing the cameras and the microphones. Multiple-person visual tracking is combined with multiple speech-source localization in order to tackle the speech-to-person association problem. The latter is solved within a novel audio-visual fusion method on the following grounds: binaural spectral features are first extracted from a microphone pair, then a supervised audio-visual alignment technique maps these features onto an image, and finally a semi-supervised clustering method assigns binaural spectral features to visible persons. The main advantage of this method over previous work is that it processes in a principled way speech signals uttered simultaneously by multiple persons. The diarization itself is cast into a latent-variable temporal graphical model that infers speaker identities and speech turns, based on the output of an audio-visual association process, executed at each time slice, and on the dynamics of the diarization variable itself. The proposed formulation yields an efficient exact inference procedure. A novel dataset, that contains audio-visual training data as well as a number of scenarios involving several participants engaged in formal and informal dialogue, is introduced. The proposed method is thoroughly tested and benchmarked with respect to several state-of-the art diarization algorithms.

  13. "More Than Words" : Social Validation in Close Relationships

    NARCIS (Netherlands)

    Koudenburg, Namkje; Gordijn, Ernestine H.; Postmes, Tom

    2014-01-01

    Conversations are susceptible to many disturbances: A speaker's hesitations, distractions, or, when communicating online, technical hiccups that may cause brief delays. Research among previously unacquainted individuals revealed that brief disruptions in conversational flow can have profound social

  14. Omission of definite and indefinite articles in the spontaneous speech of agrammatic speakers with Broca's aphasia

    NARCIS (Netherlands)

    Havik, E.; Bastiaanse, Y.R.M.

    2004-01-01

    Background: Cross-linguistic investigation of agrammatic speech in speakers of different languages allows us to tests theoretical accounts of the nature of agrammatism. A significant feature of the speech of many agrammatic speakers is a problem with article production. Mansson and Ahlsen (2001)

  15. The use of native speaker norms in critical period hypothesis research

    NARCIS (Netherlands)

    Andringa, S.

    2014-01-01

    In critical period hypothesis (CPH) research, native speaker (NS) norm groups have often been used to determine whether nonnative speakers (NNSs) were able to score within the NS range of scores. One goal of this article is to investigate what NS samples were used in previous CPH research. The

  16. Studies on inter-speaker variability in speech and its application in ...

    Indian Academy of Sciences (India)

    independent. ASR systems. ... In this review article, we study the variability in the acoustic signal among different speakers enunciating the ... review paper, we will describe two standard procedures that are used to reduce inter-speaker vari- ations to ...

  17. Studies on inter-speaker variability in speech and its application in ...

    Indian Academy of Sciences (India)

    We then describe the universal-warping approach to speaker normalization which unifies many of the vowel normalization approaches and also shows the relation between speech production, perception and auditory processing. We then address the problem of inter-speaker variability in automatic speech recognition ...

  18. A Study of Non-Native English Speakers' Academic Performance at Santa Ana College.

    Science.gov (United States)

    Slark, Julie; Bateman, Harold

    A study was conducted in 1980-81 at Santa Ana College (SAC) to collect data on the English communication skills of non-native English speakers and to determine if a relationship existed between these skills and student's educational success. A sample of 22 classes, with an enrollment of at least 50% non-native English speakers and representing a…

  19. Speaker Input Variability Does Not Explain Why Larger Populations Have Simpler Languages.

    Science.gov (United States)

    Atkinson, Mark; Kirby, Simon; Smith, Kenny

    2015-01-01

    A learner's linguistic input is more variable if it comes from a greater number of speakers. Higher speaker input variability has been shown to facilitate the acquisition of phonemic boundaries, since data drawn from multiple speakers provides more information about the distribution of phonemes in a speech community. It has also been proposed that speaker input variability may have a systematic influence on individual-level learning of morphology, which can in turn influence the group-level characteristics of a language. Languages spoken by larger groups of people have less complex morphology than those spoken in smaller communities. While a mechanism by which the number of speakers could have such an effect is yet to be convincingly identified, differences in speaker input variability, which is thought to be larger in larger groups, may provide an explanation. By hindering the acquisition, and hence faithful cross-generational transfer, of complex morphology, higher speaker input variability may result in structural simplification. We assess this claim in two experiments which investigate the effect of such variability on language learning, considering its influence on a learner's ability to segment a continuous speech stream and acquire a morphologically complex miniature language. We ultimately find no evidence to support the proposal that speaker input variability influences language learning and so cannot support the hypothesis that it explains how population size determines the structural properties of language.

  20. A general auditory bias for handling speaker variability in speech? Evidence in humans and songbirds

    NARCIS (Netherlands)

    Kriengwatana, B.; Escudero, P.; Kerkhoven, A.H.; ten Cate, C.

    2015-01-01

    Different speakers produce the same speech sound differently, yet listeners are still able to reliably identify the speech sound. How listeners can adjust their perception to compensate for speaker differences in speech, and whether these compensatory processes are unique only to humans, is still

  1. Hispanic Children's Recognition of Languages and Perceptions about Speakers of Spanish, English, and Chinese.

    Science.gov (United States)

    Stafford, Mary E.; Jenckes, Lalima B.; Santos, Sheryl L.

    1997-01-01

    Perceptions about speakers of Spanish, English, and Chinese were studied among 303 Spanish- and English-speaking Hispanic students in grades 3 and 8 in the southwestern United States. Students' prior experience with a language increased favorable attitudes toward the language, solidarity with its speakers, and perceptions of status associated with…

  2. 7 CFR 247.13 - Provisions for non-English or limited-English speakers.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 4 2010-01-01 2010-01-01 false Provisions for non-English or limited-English speakers... § 247.13 Provisions for non-English or limited-English speakers. (a) What must State and local agencies do to ensure that non-English or limited-English speaking persons are aware of their rights and...

  3. Action naming in anomic aphasic speakers : Effects of instrumentality and name relation

    NARCIS (Netherlands)

    Jonkers, R.; Bastiaanse, Y.R.M.

    Many studies reveal effects of verb type on verb retrieval, mainly in agrammatic aphasic speakers. In the current study, two factors that might play a role in action naming in anomic aphasic speakers were considered: the conceptual factor instrumentality and the lexical factor name relation to a

  4. Speaker Input Variability Does Not Explain Why Larger Populations Have Simpler Languages.

    Directory of Open Access Journals (Sweden)

    Mark Atkinson

    Full Text Available A learner's linguistic input is more variable if it comes from a greater number of speakers. Higher speaker input variability has been shown to facilitate the acquisition of phonemic boundaries, since data drawn from multiple speakers provides more information about the distribution of phonemes in a speech community. It has also been proposed that speaker input variability may have a systematic influence on individual-level learning of morphology, which can in turn influence the group-level characteristics of a language. Languages spoken by larger groups of people have less complex morphology than those spoken in smaller communities. While a mechanism by which the number of speakers could have such an effect is yet to be convincingly identified, differences in speaker input variability, which is thought to be larger in larger groups, may provide an explanation. By hindering the acquisition, and hence faithful cross-generational transfer, of complex morphology, higher speaker input variability may result in structural simplification. We assess this claim in two experiments which investigate the effect of such variability on language learning, considering its influence on a learner's ability to segment a continuous speech stream and acquire a morphologically complex miniature language. We ultimately find no evidence to support the proposal that speaker input variability influences language learning and so cannot support the hypothesis that it explains how population size determines the structural properties of language.

  5. Development of a speaker discrimination test for cochlear implant users based on the Oldenburg Logatome corpus.

    Science.gov (United States)

    Mühler, Roland; Ziese, Michael; Rostalski, Dorothea

    2009-01-01

    The purpose of the study was to develop a speaker discrimination test for cochlear implant (CI) users. The speech material was drawn from the Oldenburg Logatome (OLLO) corpus, which contains 150 different logatomes read by 40 German and 10 French native speakers. The prototype test battery included 120 logatome pairs spoken by 5 male and 5 female speakers with balanced representations of the conditions 'same speaker' and 'different speaker'. Ten adult normal-hearing listeners and 12 adult postlingually deafened CI users were included in a study to evaluate the suitability of the test. The mean speaker discrimination score for the CI users was 67.3% correct and for the normal-hearing listeners 92.2% correct. A significant influence of voice gender and fundamental frequency difference on the speaker discrimination score was found in CI users as well as in normal-hearing listeners. Since the test results of the CI users were significantly above chance level and no ceiling effect was observed, we conclude that subsets of the OLLO corpus are very well suited to speaker discrimination experiments in CI users. Copyright 2008 S. Karger AG, Basel.

  6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition

    Science.gov (United States)

    Kim, Myungjong; Kim, Younggwan; Yoo, Joohong; Wang, Jun; Kim, Hoirin

    2017-01-01

    This paper addresses the problem of recognizing the speech uttered by patients with dysarthria, which is a motor speech disorder impeding the physical production of speech. Patients with dysarthria have articulatory limitation, and therefore, they often have trouble in pronouncing certain sounds, resulting in undesirable phonetic variation. Modern automatic speech recognition systems designed for regular speakers are ineffective for dysarthric sufferers due to the phonetic variation. To capture the phonetic variation, Kullback-Leibler divergence based hidden Markov model (KL-HMM) is adopted, where the emission probability of state is parametrized by a categorical distribution using phoneme posterior probabilities obtained from a deep neural network-based acoustic model. To further reflect speaker-specific phonetic variation patterns, a speaker adaptation method based on a combination of L2 regularization and confusion-reducing regularization which can enhance discriminability between categorical distributions of KL-HMM states while preserving speaker-specific information is proposed. Evaluation of the proposed speaker adaptation method on a database of several hundred words for 30 speakers consisting of 12 mildly dysarthric, 8 moderately dysarthric, and 10 non-dysarthric control speakers showed that the proposed approach significantly outperformed the conventional deep neural network based speaker adapted system on dysarthric as well as non-dysarthric speech. PMID:28320669

  7. The Sound of Voice: Voice-Based Categorization of Speakers' Sexual Orientation within and across Languages.

    Directory of Open Access Journals (Sweden)

    Simone Sulpizio

    Full Text Available Empirical research had initially shown that English listeners are able to identify the speakers' sexual orientation based on voice cues alone. However, the accuracy of this voice-based categorization, as well as its generalizability to other languages (language-dependency and to non-native speakers (language-specificity, has been questioned recently. Consequently, we address these open issues in 5 experiments: First, we tested whether Italian and German listeners are able to correctly identify sexual orientation of same-language male speakers. Then, participants of both nationalities listened to voice samples and rated the sexual orientation of both Italian and German male speakers. We found that listeners were unable to identify the speakers' sexual orientation correctly. However, speakers were consistently categorized as either heterosexual or gay on the basis of how they sounded. Moreover, a similar pattern of results emerged when listeners judged the sexual orientation of speakers of their own and of the foreign language. Overall, this research suggests that voice-based categorization of sexual orientation reflects the listeners' expectations of how gay voices sound rather than being an accurate detector of the speakers' actual sexual identity. Results are discussed with regard to accuracy, acoustic features of voices, language dependency and language specificity.

  8. White Native English Speakers Needed: The Rhetorical Construction of Privilege in Online Teacher Recruitment Spaces

    Science.gov (United States)

    Ruecker, Todd; Ives, Lindsey

    2015-01-01

    Over the past few decades, scholars have paid increasing attention to the role of native speakerism in the field of TESOL. Several recent studies have exposed instances of native speakerism in TESOL recruitment discourses published through a variety of media, but none have focused specifically on professional websites advertising programs in…

  9. Processing Lexical and Speaker Information in Repetition and Semantic/Associative Priming

    Science.gov (United States)

    Lee, Chao-Yang; Zhang, Yu

    2018-01-01

    The purpose of this study is to investigate the interaction between processing lexical and speaker-specific information in spoken word recognition. The specific question is whether repetition and semantic/associative priming is reduced when the prime and target are produced by different speakers. In Experiment 1, the prime and target were repeated…

  10. Use of the BAT with a Cantonese-Putonghua Speaker with Aphasia

    Science.gov (United States)

    Kong, Anthony Pak-Hin; Weekes, Brendan Stuart

    2011-01-01

    The aim of this article is to illustrate the use of the Bilingual Aphasia Test (BAT) with a Cantonese-Putonghua speaker. We describe G, who is a relatively young Chinese bilingual speaker with aphasia. G's communication abilities in his L2, Putonghua, were impaired following brain damage. This impairment caused specific difficulties in…

  11. Grammatical versus Pragmatic Error: Employer Perceptions of Nonnative and Native English Speakers

    Science.gov (United States)

    Wolfe, Joanna; Shanmugaraj, Nisha; Sipe, Jaclyn

    2016-01-01

    Many communication instructors make allowances for grammatical error in nonnative English speakers' writing, but do businesspeople do the same? We asked 169 businesspeople to comment on three versions of an email with different types of errors. We found that businesspeople do make allowances for errors made by nonnative English speakers,…

  12. Pragmatic Instruction May Not Be Necessary among Heritage Speakers of Spanish: A Study on Requests

    Science.gov (United States)

    Barros García, María J.; Bachelor, Jeremy W.

    2018-01-01

    This paper studies the pragmatic competence of U.S. heritage speakers of Spanish in an attempt to determine (a) the degree of pragmatic transfer from English to Spanish experienced by heritage speakers when producing different types of requests in Spanish; and (b) how to best teach pragmatics to students of Spanish as a Heritage Language (SHL).…

  13. English Variety for the Public Domain in Kenya: Speakers' Attitudes and Views

    Science.gov (United States)

    Kioko, Angelina Nduku; Muthwii, Margaret Jepkirui

    2003-01-01

    The study sought to establish the attitudes of Kenyan speakers (n = 210) towards three varieties of English: (1) ethnically marked Kenyan English, (2) standard Kenyan English and (3) native speaker English (British, American, Australian, etc). Of the three varieties, the most preferred by both rural and urban respondents for use in the media and…

  14. Congenital Amusia in Speakers of a Tone Language: Association with Lexical Tone Agnosia

    Science.gov (United States)

    Nan, Yun; Sun, Yanan; Peretz, Isabelle

    2010-01-01

    Congenital amusia is a neurogenetic disorder that affects the processing of musical pitch in speakers of non-tonal languages like English and French. We assessed whether this musical disorder exists among speakers of Mandarin Chinese who use pitch to alter the meaning of words. Using the Montreal Battery of Evaluation of Amusia, we tested 117…

  15. Classifications of Vocalic Segments from Articulatory Kinematics: Healthy Controls and Speakers with Dysarthria

    Science.gov (United States)

    Yunusova, Yana; Weismer, Gary G.; Lindstrom, Mary J.

    2011-01-01

    Purpose: In this study, the authors classified vocalic segments produced by control speakers (C) and speakers with dysarthria due to amyotrophic lateral sclerosis (ALS) or Parkinson's disease (PD); classification was based on movement measures. The researchers asked the following questions: (a) Can vowels be classified on the basis of selected…

  16. Intelligibility of Standard German and Low German to Speakers of Dutch

    NARCIS (Netherlands)

    Gooskens, C.S.; Kürschner, Sebastian; van Bezooijen, R.

    2011-01-01

    This paper reports on the intelligibility of spoken Low German and Standard German for speakers of Dutch. Two aspects are considered. First, the relative potential for intelligibility of the Low German variety of Bremen and the High German variety of Modern Standard German for speakers of Dutch is

  17. How Similar Are Adult Second Language Learners and Spanish Heritage Speakers? Spanish Clitics and Word Order

    Science.gov (United States)

    Montrul, Silvina

    2010-01-01

    Recent studies of heritage speakers, many of whom possess incomplete knowledge of their family language, suggest that these speakers may be linguistically superior to second language (L2) learners only in phonology but not in morphosyntax. This study reexamines this claim by focusing on knowledge of clitic pronouns and word order in 24 L2 learners…

  18. A Joint Approach for Single-Channel Speaker Identification and Speech Separation

    DEFF Research Database (Denmark)

    Mowlaee, Pejman; Saeidi, Rahim; Christensen, Mads Græsbøll

    2012-01-01

    a sinusoidal model-based algorithm. The speech separation algorithm consists of a double-talk/single-talk detector followed by a minimum mean square error estimator of sinusoidal parameters for finding optimal codevectors from pre-trained speaker codebooks. In evaluating the proposed system, we start from......In this paper, we present a novel system for joint speaker identification and speech separation. For speaker identification a single-channel speaker identification algorithm is proposed which provides an estimate of signal-to-signal ratio (SSR) as a by-product. For speech separation, we propose...... a situation where we have prior information of codebook indices, speaker identities and SSR-level, and then, by relaxing these assumptions one by one, we demonstrate the efficiency of the proposed fully blind system. In contrast to previous studies that mostly focus on automatic speech recognition (ASR...

  19. Learning foreign labels from a foreign speaker: the role of (limited) exposure to a second language.

    Science.gov (United States)

    Akhtar, Nameera; Menjivar, Jennifer; Hoicka, Elena; Sabbagh, Mark A

    2012-11-01

    Three- and four-year-olds (N = 144) were introduced to novel labels by an English speaker and a foreign speaker (of Nordish, a made-up language), and were asked to endorse one of the speaker's labels. Monolingual English-speaking children were compared to bilingual children and English-speaking children who were regularly exposed to a language other than English. All children tended to endorse the English speaker's labels when asked 'What do you call this?', but when asked 'What do you call this in Nordish?', children with exposure to a second language were more likely to endorse the foreign label than monolingual and bilingual children. The findings suggest that, at this age, exposure to, but not necessarily immersion in, more than one language may promote the ability to learn foreign words from a foreign speaker.

  20. Speaker and Accent Variation Are Handled Differently: Evidence in Native and Non-Native Listeners.

    Science.gov (United States)

    Kriengwatana, Buddhamas; Terry, Josephine; Chládková, Kateřina; Escudero, Paola

    2016-01-01

    Listeners are able to cope with between-speaker variability in speech that stems from anatomical sources (i.e. individual and sex differences in vocal tract size) and sociolinguistic sources (i.e. accents). We hypothesized that listeners adapt to these two types of variation differently because prior work indicates that adapting to speaker/sex variability may occur pre-lexically while adapting to accent variability may require learning from attention to explicit cues (i.e. feedback). In Experiment 1, we tested our hypothesis by training native Dutch listeners and Australian-English (AusE) listeners without any experience with Dutch or Flemish to discriminate between the Dutch vowels /I/ and /ε/ from a single speaker. We then tested their ability to classify /I/ and /ε/ vowels of a novel Dutch speaker (i.e. speaker or sex change only), or vowels of a novel Flemish speaker (i.e. speaker or sex change plus accent change). We found that both Dutch and AusE listeners could successfully categorize vowels if the change involved a speaker/sex change, but not if the change involved an accent change. When AusE listeners were given feedback on their categorization responses to the novel speaker in Experiment 2, they were able to successfully categorize vowels involving an accent change. These results suggest that adapting to accents may be a two-step process, whereby the first step involves adapting to speaker differences at a pre-lexical level, and the second step involves adapting to accent differences at a contextual level, where listeners have access to word meaning or are given feedback that allows them to appropriately adjust their perceptual category boundaries.

  1. Language, meaning, and social cognition.

    Science.gov (United States)

    Holtgraves, Thomas M; Kashima, Yoshihisa

    2008-02-01

    Social cognition is meant to examine the process of meaningful social interaction. Despite the central involvement of language in this process, language has not received the focal attention that it deserves. Conceptualizing meaningful social interaction as the process of construction and exchange of meaning, the authors argue that language can be productively construed as a semiotic tool, a tool for meaning making and exchange, and that language use can produce unintended consequences in its users. First, the article shows a particular instance of language use to be a collaborative process that influences the representation of meaning in the speaker, the listener, and the collective that includes both the speaker and listener. It then argues that language use and social cognition may have reciprocal effects in the long run and may have significant implications for generating and maintaining cultural differences in social cognition.

  2. Efficient speaker verification using Gaussian mixture model component clustering.

    Energy Technology Data Exchange (ETDEWEB)

    De Leon, Phillip L. (New Mexico State University, Las Cruces, NM); McClanahan, Richard D.

    2012-04-01

    In speaker verification (SV) systems that employ a support vector machine (SVM) classifier to make decisions on a supervector derived from Gaussian mixture model (GMM) component mean vectors, a significant portion of the computational load is involved in the calculation of the a posteriori probability of the feature vectors of the speaker under test with respect to the individual component densities of the universal background model (UBM). Further, the calculation of the sufficient statistics for the weight, mean, and covariance parameters derived from these same feature vectors also contribute a substantial amount of processing load to the SV system. In this paper, we propose a method that utilizes clusters of GMM-UBM mixture component densities in order to reduce the computational load required. In the adaptation step we score the feature vectors against the clusters and calculate the a posteriori probabilities and update the statistics exclusively for mixture components belonging to appropriate clusters. Each cluster is a grouping of multivariate normal distributions and is modeled by a single multivariate distribution. As such, the set of multivariate normal distributions representing the different clusters also form a GMM. This GMM is referred to as a hash GMM which can be considered to a lower resolution representation of the GMM-UBM. The mapping that associates the components of the hash GMM with components of the original GMM-UBM is referred to as a shortlist. This research investigates various methods of clustering the components of the GMM-UBM and forming hash GMMs. Of five different methods that are presented one method, Gaussian mixture reduction as proposed by Runnall's, easily outperformed the other methods. This method of Gaussian reduction iteratively reduces the size of a GMM by successively merging pairs of component densities. Pairs are selected for merger by using a Kullback-Leibler based metric. Using Runnal's method of reduction, we

  3. The communicative style of a speaker can affect language comprehension? ERP evidence from the comprehension of irony.

    Science.gov (United States)

    Regel, Stefanie; Coulson, Seana; Gunter, Thomas C

    2010-01-22

    An important issue in irony comprehension concerns when and how listeners integrate extra-linguistic and linguistic information to compute the speaker's intended meaning. To assess whether knowledge about the speaker's communicative style impacts the brain response to irony, ERPs were recorded as participants read short passages that ended either with literal or ironic statements made by one of two speakers. The experiment was carried out in two sessions in which each speaker's use of irony was manipulated. In Session 1, 70% of ironic statements were made by the ironic speaker, while the non-ironic speaker expressed 30% of them. For irony by the non-ironic speaker, an increased P600 was observed relative to literal utterances. By contrast, both ironic and literal statements made by the ironic speaker elicited similar P600 amplitudes. In Session 2, conducted 1 day later, both speakers' use of irony was balanced (i.e. 50% ironic, 50% literal). ERPs for Session 2 showed an irony-related P600 for the ironic speaker but not for the non-ironic speaker. Moreover, P200 amplitude was larger for sentences congruent with each speaker's communicative style (i.e. for irony made by the ironic speaker, and for literal statements made by the non-ironic speaker). These findings indicate that pragmatic knowledge about speakers can affect language comprehension 200 ms after the onset of a critical word, as well as neurocognitive processes underlying the later stages of comprehension (500-900 ms post-onset). Thus perceived speakers' characteristics dynamically impact the construction of appropriate interpretations of ironic utterances. Copyright 2009 Elsevier B.V. All rights reserved.

  4. Looking at the evidence in visual world : eye-movements reveal how bilingual and monolingual Turkish speakers process grammatical evidentiality

    NARCIS (Netherlands)

    Arslan, Seckin; Bastiaanse, Roelien; Felser, Claudia

    2015-01-01

    This study presents pioneering data on how adult early bilinguals (heritage speakers) and late bilingual speakers of Turkish and German process grammatical evidentiality in a visual world setting in comparison to monolingual speakers of Turkish. Turkish marks evidentiality, the linguistic reference

  5. Switches to English during French Service Encounters: Relationships with L2 French Speakers' Willingness to Communicate and Motivation

    Science.gov (United States)

    McNaughton, Stephanie; McDonough, Kim

    2015-01-01

    This exploratory study investigated second language (L2) French speakers' service encounters in the multilingual setting of Montreal, specifically whether switches to English during French service encounters were related to L2 speakers' willingness to communicate or motivation. Over a two-week period, 17 French L2 speakers in Montreal submitted…

  6. Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization

    Directory of Open Access Journals (Sweden)

    Umit H. Yapanel

    2008-08-01

    Full Text Available A proven method for achieving effective automatic speech recognition (ASR due to speaker differences is to perform acoustic feature speaker normalization. More effective speaker normalization methods are needed which require limited computing resources for real-time performance. The most popular speaker normalization technique is vocal-tract length normalization (VTLN, despite the fact that it is computationally expensive. In this study, we propose a novel online VTLN algorithm entitled built-in speaker normalization (BISN, where normalization is performed on-the-fly within a newly proposed PMVDR acoustic front end. The novel algorithm aspect is that in conventional frontend processing with PMVDR and VTLN, two separating warping phases are needed; while in the proposed BISN method only one single speaker dependent warp is used to achieve both the PMVDR perceptual warp and VTLN warp simultaneously. This improved integration unifies the nonlinear warping performed in the front end and reduces simultaneously. This improved integration unifies the nonlinear warping performed in the front end and reduces computational requirements, thereby offering advantages for real-time ASR systems. Evaluations are performed for (i an in-car extended digit recognition task, where an on-the-fly BISN implementation reduces the relative word error rate (WER by 24%, and (ii for a diverse noisy speech task (SPINE 2, where the relative WER improvement was 9%, both relative to the baseline speaker normalization method.

  7. Laryngeal adjustments for devoicing of /h/: A within-speaker study

    Science.gov (United States)

    Koenig, Laura

    2005-09-01

    Past work has investigated cross-speaker and cross-gender differences in voicing of /h/ in English speakers. The purpose of this study was to see whether a phonetically sophisticated speaker could intentionally alter his /h/ voicing patterns, and, if so, how he would effect any changes. One adult male speaker of American English, a trained phonetician and dialectologist, produced approximately 500 repetitions of intervocalic /h/ in short carrier phrases, with differing vowel contexts and loudness levels. In the first block, the speaker produced the utterances normally (i.e., without specific instructions on /h/ production); in the second, he was explicitly asked to devoice his /h/'s. Results indicated that the incidence of devoiced /h/ increased from 2% in the first block to 69% in the second block. On average, the /h/'s in the second block were produced with higher baseline airflows, indicating more extreme laryngeal abduction. This alone did not account for the speaker's devoicing behavior, however, since the soft condition, which had the lowest peak airflows in the second block, had the most devoicing. Voice source measures will be compared between the two blocks to clarify how the speaker altered his laryngeal setting to achieve more devoicing. [Work supported by NIH.

  8. Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization

    Directory of Open Access Journals (Sweden)

    Yapanel UmitH

    2008-01-01

    Full Text Available A proven method for achieving effective automatic speech recognition (ASR due to speaker differences is to perform acoustic feature speaker normalization. More effective speaker normalization methods are needed which require limited computing resources for real-time performance. The most popular speaker normalization technique is vocal-tract length normalization (VTLN, despite the fact that it is computationally expensive. In this study, we propose a novel online VTLN algorithm entitled built-in speaker normalization (BISN, where normalization is performed on-the-fly within a newly proposed PMVDR acoustic front end. The novel algorithm aspect is that in conventional frontend processing with PMVDR and VTLN, two separating warping phases are needed; while in the proposed BISN method only one single speaker dependent warp is used to achieve both the PMVDR perceptual warp and VTLN warp simultaneously. This improved integration unifies the nonlinear warping performed in the front end and reduces simultaneously. This improved integration unifies the nonlinear warping performed in the front end and reduces computational requirements, thereby offering advantages for real-time ASR systems. Evaluations are performed for (i an in-car extended digit recognition task, where an on-the-fly BISN implementation reduces the relative word error rate (WER by 24%, and (ii for a diverse noisy speech task (SPINE 2, where the relative WER improvement was 9%, both relative to the baseline speaker normalization method.

  9. Selective cortical representation of attended speaker in multi-talker speech perception.

    Science.gov (United States)

    Mesgarani, Nima; Chang, Edward F

    2012-05-10

    Humans possess a remarkable ability to attend to a single speaker's voice in a multi-talker background. How the auditory system manages to extract intelligible speech under such acoustically complex and adverse listening conditions is not known, and, indeed, it is not clear how attended speech is internally represented. Here, using multi-electrode surface recordings from the cortex of subjects engaged in a listening task with two simultaneous speakers, we demonstrate that population responses in non-primary human auditory cortex encode critical features of attended speech: speech spectrograms reconstructed based on cortical responses to the mixture of speakers reveal the salient spectral and temporal features of the attended speaker, as if subjects were listening to that speaker alone. A simple classifier trained solely on examples of single speakers can decode both attended words and speaker identity. We find that task performance is well predicted by a rapid increase in attention-modulated neural selectivity across both single-electrode and population-level cortical responses. These findings demonstrate that the cortical representation of speech does not merely reflect the external acoustic environment, but instead gives rise to the perceptual aspects relevant for the listener's intended goal.

  10. A Method to Integrate GMM, SVM and DTW for Speaker Recognition

    Directory of Open Access Journals (Sweden)

    Ing-Jr Ding

    2014-01-01

    Full Text Available This paper develops an effective and efficient scheme to integrate Gaussian mixture model (GMM, support vector machine (SVM, and dynamic time wrapping (DTW for automatic speaker recognition. GMM and SVM are two popular classifiers for speaker recognition applications. DTW is a fast and simple template matching method, and it is frequently seen in applications of speech recognition. In this work, DTW does not play a role to perform speech recognition, and it will be employed to be a verifier for verification of valid speakers. The proposed combination scheme of GMM, SVM and DTW, called SVMGMM-DTW, for speaker recognition in this study is a two-phase verification process task including GMM-SVM verification of the first phase and DTW verification of the second phase. By providing a double check to verify the identity of a speaker, it will be difficult for imposters to try to pass the security protection; therefore, the safety degree of speaker recognition systems will be largely increased. A series of experiments designed on door access control applications demonstrated that the superiority of the developed SVMGMM-DTW on speaker recognition accuracy.

  11. "Real Language": Combining Intermediate Spanish Language Learners and ESOL/Native Speakers for Vernacular Language Acquisition

    Directory of Open Access Journals (Sweden)

    Joe LaValle

    2017-06-01

    Full Text Available This paper addresses the possibilities of combining Spanish language learners and English language learners in high school and post-secondary institutions for mutual benefit to learn authentic language. Academic or "classroom" Spanish is insufficient to empower students for today's workplace. The concept behind "Real Language" is illustrated by an example of an interdisciplinary activity to facilitate communicative interaction in genuine language and promote cultural understanding between intermediate Spanish students and ESOL/native speakers at the high school and post-secondary level. Students are asked to utilize their life skills in interactive, freestyle conversation without the intervention of an instructor. The learning space for language exchange is an out-of-class venue for a non-intimidating, more authentic setting. This simple qualitative study investigates the potential value of this sort of interdisciplinary activity. The intent is to evaluate attitudes of the participants in relation to confidence in their ability to use the target language, and their willingness to use it in social and professional environments and, in addition, to facilitate cultural understanding. The positive result of the project is validated by the voice of the student participants as they reflect on their experience in "Real Language". Could this concept facilitate evolving strategies for interdisciplinary contemporary foreign language learning?

  12. Speaker specificity in speech perception: the importance of what is and is not in the signal

    Science.gov (United States)

    Dahan, Delphine; Scarborough, Rebecca A.

    2005-09-01

    In some American English dialects, /ae/ before /g/ (but not before /k/) raises to a vowel approaching [E], in effect reducing phonetic overlap between (e.g.) ``bag'' and ``back.'' Here, participants saw four written words on a computer screen (e.g., ``bag,'' ``back,'' ``dog,'' ``dock'') and heard a spoken word. Their task was to indicate which word they heard. Participants' eye movements to the written words were recorded. Participants in the ``ae-raising'' group heard identity-spliced ``bag''-like words containing the raised vowel [E] participants in the ``control'' group heard cross-spliced ``bag''-like words containing standard [ae]. Acoustically identical ``back''-like words were subsequently presented to both groups. The ae-raising-group participants identified ``back''-like words faster and more accurately, and made fewer fixations to the competitor ``bag,'' than control-group participants did. Thus, exposure to ae-raised realizations of ``bag'' facilitated the identification of ``back'' because of the reduced fit between the input and the altered representation of the competing hypothesis ``bag.'' This demonstrates that listeners evaluate the spoken input with respect to what is, but also what is not, in the signal, and that this evaluation involves speaker-specific representations. [Work supported by NSF Human and Social Dynamics 0433567.

  13. Listening to a non-native speaker: Adaptation and generalization

    Science.gov (United States)

    Clarke, Constance M.

    2004-05-01

    Non-native speech can cause perceptual difficulty for the native listener, but experience can moderate this difficulty. This study explored the perceptual benefit of a brief (approximately 1 min) exposure to foreign-accented speech using a cross-modal word matching paradigm. Processing speed was tracked by recording reaction times (RTs) to visual probe words following English sentences produced by a Spanish-accented speaker. In experiment 1, RTs decreased significantly over 16 accented utterances and by the end were equal to RTs to a native voice. In experiment 2, adaptation to one Spanish-accented voice improved perceptual efficiency for a new Spanish-accented voice, indicating that abstract properties of accented speech are learned during adaptation. The control group in Experiment 2 also adapted to the accented voice during the test block, suggesting adaptation can occur within two to four sentences. The results emphasize the flexibility of the human speech processing system and the need for a mechanism to explain this adaptation in models of spoken word recognition. [Research supported by an NSF Graduate Research Fellowship and the University of Arizona Cognitive Science Program.] a)Currently at SUNY at Buffalo, Dept. of Psych., Park Hall, Buffalo, NY 14260, cclarke2@buffalo.edu

  14. A Text-Independent Speaker Authentication System for Mobile Devices

    Directory of Open Access Journals (Sweden)

    Florentin Thullier

    2017-09-01

    Full Text Available This paper presents a text independent speaker authentication method adapted to mobile devices. Special attention was placed on delivering a fully operational application, which admits a sufficient reliability level and an efficient functioning. To this end, we have excluded the need for any network communication. Hence, we opted for the completion of both the training and the identification processes directly on the mobile device through the extraction of linear prediction cepstral coefficients and the naive Bayes algorithm as the classifier. Furthermore, the authentication decision is enhanced to overcome misidentification through access privileges that the user should attribute to each application beforehand. To evaluate the proposed authentication system, eleven participants were involved in the experiment, conducted in quiet and noisy environments. Public speech corpora were also employed to compare this implementation to existing methods. Results were efficient regarding mobile resources’ consumption. The overall classification performance obtained was accurate with a small number of samples. Then, it appeared that our authentication system might be used as a first security layer, but also as part of a multilayer authentication, or as a fall-back mechanism.

  15. Speaker gender identification based on majority vote classifiers

    Science.gov (United States)

    Mezghani, Eya; Charfeddine, Maha; Nicolas, Henri; Ben Amar, Chokri

    2017-03-01

    Speaker gender identification is considered among the most important tools in several multimedia applications namely in automatic speech recognition, interactive voice response systems and audio browsing systems. Gender identification systems performance is closely linked to the selected feature set and the employed classification model. Typical techniques are based on selecting the best performing classification method or searching optimum tuning of one classifier parameters through experimentation. In this paper, we consider a relevant and rich set of features involving pitch, MFCCs as well as other temporal and frequency-domain descriptors. Five classification models including decision tree, discriminant analysis, nave Bayes, support vector machine and k-nearest neighbor was experimented. The three best perming classifiers among the five ones will contribute by majority voting between their scores. Experimentations were performed on three different datasets spoken in three languages: English, German and Arabic in order to validate language independency of the proposed scheme. Results confirm that the presented system has reached a satisfying accuracy rate and promising classification performance thanks to the discriminating abilities and diversity of the used features combined with mid-level statistics.

  16. Speech spectrum's correlation with speakers' Eysenck personality traits.

    Science.gov (United States)

    Hu, Chao; Wang, Qiandong; Short, Lindsey A; Fu, Genyue

    2012-01-01

    The current study explored the correlation between speakers' Eysenck personality traits and speech spectrum parameters. Forty-six subjects completed the Eysenck Personality Questionnaire. They were instructed to verbally answer the questions shown on a computer screen and their responses were recorded by the computer. Spectrum parameters of /sh/ and /i/ were analyzed by Praat voice software. Formant frequencies of the consonant /sh/ in lying responses were significantly lower than that in truthful responses, whereas no difference existed on the vowel /i/ speech spectrum. The second formant bandwidth of the consonant /sh/ speech spectrum was significantly correlated with the personality traits of Psychoticism, Extraversion, and Neuroticism, and the correlation differed between truthful and lying responses, whereas the first formant frequency of the vowel /i/ speech spectrum was negatively correlated with Neuroticism in both response types. The results suggest that personality characteristics may be conveyed through the human voice, although the extent to which these effects are due to physiological differences in the organs associated with speech or to a general Pygmalion effect is yet unknown.

  17. DEEP ATTRACTOR NETWORK FOR SINGLE-MICROPHONE SPEAKER SEPARATION.

    Science.gov (United States)

    Chen, Zhuo; Luo, Yi; Mesgarani, Nima

    2017-03-01

    Despite the overwhelming success of deep learning in various speech processing tasks, the problem of separating simultaneous speakers in a mixture remains challenging. Two major difficulties in such systems are the arbitrary source permutation and unknown number of sources in the mixture. We propose a novel deep learning framework for single channel speech separation by creating attractor points in high dimensional embedding space of the acoustic signals which pull together the time-frequency bins corresponding to each source. Attractor points in this study are created by finding the centroids of the sources in the embedding space, which are subsequently used to determine the similarity of each bin in the mixture to each source. The network is then trained to minimize the reconstruction error of each source by optimizing the embeddings. The proposed model is different from prior works in that it implements an end-to-end training, and it does not depend on the number of sources in the mixture. Two strategies are explored in the test time, K-means and fixed attractor points, where the latter requires no post-processing and can be implemented in real-time. We evaluated our system on Wall Street Journal dataset and show 5.49% improvement over the previous state-of-the-art methods.

  18. Phonological processing in Mandarin speakers with congenital amusia.

    Science.gov (United States)

    Wang, Xiao; Peng, Gang

    2014-12-01

    Although there is an emerging consensus that both musical and linguistic pitch processing can be problematic for individuals with a developmental disorder termed congenital amusia, the nature of such a pitch-processing deficit, especially that demonstrated in a speech setting, remains unclear. Therefore, this study tested the performance of native Mandarin speakers, both with and without amusia, on discrimination and imitation tasks for Cantonese level tones, aiming to shed light on this issue. Results suggest that the impact of the phonological deficit, coupled with that of the domain-general pitch deficit, could provide a more comprehensive interpretation of Mandarin amusics' speech impairment. Specifically, when there was a high demand for pitch sensitivity, as in fine-grained pitch discriminations, the operation of the pitch-processing deficit played the more predominant role in modulating amusics' speech performance. But when the demand was low, as in discriminating naturally produced Cantonese level tones, the impact of the phonological deficit was more pronounced compared to that of the pitch-processing deficit. However, despite their perceptual deficits, Mandarin amusics' imitation abilities were comparable to controls'. Such selective impairment in tonal perception suggests that the phonological deficit more severely implicates amusics' input pathways.

  19. Standardization of nasometry for normal Egyptian Arabic speakers.

    Science.gov (United States)

    Abou-Elsaad, Tamer; Quriba, Amal; Baz, Hemmat; Elkassaby, Rasha

    2012-01-01

    The aim of this study was to collect the normative values of nasalance for Egyptian Arabic speakers in different age groups, using Arabic speech samples in order to compare patients with disturbed nasality. This study included 300 normal Egyptian volunteers divided into three groups according to their ages: group I = children (n = 92; age 3 years, 3 months to 9 years), group II = teenagers (n = 76; age 9-18 years) and group III = adults (n = 132; age above 18 years). The Nasometer II 6400 was used for the analysis of speech samples. All subjects were asked to perform four speech tasks that were based on the MacKay-Kummer SNAP Test-R and modified to be applicable to the Arabic language, especially to the Egyptian dialect. The normative values for nasometric assessment in the different age groups were studied. The results demonstrated nasalance score variations according to age and gender. Most of the nasalance score norms of the Egyptian children demonstrated statistically significant differences when compared with the norms of children for the MacKay-Kummer Test-R. The Egyptian Arabic SNAP test is an easy, noninvasive and objective procedure that is suitable for all age groups. Copyright © 2013 S. Karger AG, Basel.

  20. Acoustic variation during passage reading for speakers with dysarthria and healthy controls.

    Science.gov (United States)

    Kuo, Christina; Tjaden, Kris

    2016-01-01

    Acoustic variation in a passage read by speakers with dysarthria and healthy speakers was examined. 15 speakers with Multiple Sclerosis (MS), 12 speakers with Parkinson's disease (PD), and 14 healthy speakers were studied. Acoustic variables included measures of global speech timing (e.g., articulation rate, pause characteristics), vocal intensity (e.g., mean sound pressure level and intensity modulation), and segmental articulation (i.e., utterance-level second formant interquartile range (F2 IQR)). Acoustic measures were obtained from three segments operationally defined to represent the beginning, middle, and end of a reading passage. Two speaking conditions associated with common treatment techniques for dysarthria were included for comparison to a habitual speaking condition. These conditions included a slower-than-habitual rate (Slow) and greater-than-habitual intensity (Loud). There was some degree of acoustic variation across the three operationally-defined segments of the reading passage. The Slow, Loud and Habitual conditions yielded comparable characteristics of variation. Patterns of acoustic variation across the three passage segments also were largely similar across speaker groups. Within-task acoustic variation during passage reading should be considered when making decisions regarding speech sampling in clinical practice and research. The contributions of speech disorder severity and linguistic variables to within-task acoustic change warrant further investigation. Readers will be able to (1) discuss the motivation for studying and understanding within-task variation in contextual speech, (2) describe patterns of acoustic variation for speakers with dysarthria and healthy speakers during passage reading, (3) discuss the relationship between non-habitual speaking conditions and within-task variation, (4) understand the need to consider within-speaker, within-task variation in speech sampling. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Popular Public Discourse at Speakers' Corner: Negotiating Cultural Identities in Interaction

    DEFF Research Database (Denmark)

    McIlvenny, Paul

    1996-01-01

    In this paper I examine how cultural identities are actively negotiated in popular debate at a multicultural public setting in London. Speakers at Speakers' Corner manage the local construction of group affiliation, audience response and argument in and through talk, within the context of ethnic...... in which participant 'citizens' in the public sphere can actively struggle over cultural representation and identities. Using transcribed examples of video data recorded at Speakers' Corner my paper will examine how cultural identity is invoked in the management of active participation. Audiences...... and their affiliations are regulated and made accountable through the routines of membership categorisation and the policing of cultural identities and their imaginary borders....

  2. Popular Public Discourse at Speakers' Corner: Negotiating Cultural Identities in Interaction

    DEFF Research Database (Denmark)

    McIlvenny, Paul

    1996-01-01

    in which participant 'citizens' in the public sphere can actively struggle over cultural representation and identities. Using transcribed examples of video data recorded at Speakers' Corner my paper will examine how cultural identity is invoked in the management of active participation. Audiences......In this paper I examine how cultural identities are actively negotiated in popular debate at a multicultural public setting in London. Speakers at Speakers' Corner manage the local construction of group affiliation, audience response and argument in and through talk, within the context of ethnic...... and their affiliations are regulated and made accountable through the routines of membership categorisation and the policing of cultural identities and their imaginary borders....

  3. The Perception of Breathiness in the Voices of Pediatric Speakers.

    Science.gov (United States)

    Kopf, Lisa M; Skowronski, Mark D; Anand, Supraja; Eddins, David A; Shrivastav, Rahul

    2017-11-18

    The perception of pediatric voice quality has been investigated using clinical protocols developed for adult voices and acoustic analyses designed to identify important physical parameters associated with normal and dysphonic pediatric voices. Laboratory investigations of adult dysphonia have included sophisticated methods, including a psychoacoustic approach that involves a single-variable matching task (SVMT), characterized by high inter- and intra-listener reliability, and analyses that include bio-inspired models of auditory perception that have provided valuable information regarding adult voice quality. To establish the utility of a psychoacoustic approach to the investigation of voice quality perception in the context of pediatric voices? Six listeners judged the breathiness of 20 synthetic vowel stimuli using an SVMT. To support comparisons with previous data, stimuli were modeled after four pediatric speakers and synthesized using Klatt with five parameter settings that influence the perception of breathiness. The population average breathiness judgments were modeled with acoustic measures of loudness ratio, pitch strength, and cepstral peak. Listeners reliably judged the perceived breathiness of pediatric voices, as with previous investigations of breathiness in adult dysphonic voices. Breathiness judgments were accurately modeled by loudness ratio (r 2  = 0.93), pitch strength (r 2  = 0.91), and cepstral peak (r 2  = 0.82). Model accuracy was not affected significantly by including stimulus fundamental frequency and was slightly higher for pediatric than for adult voices. The SVMT proved robust for pediatric voices spanning a wide range of breathiness. The data indicate that this is a promising approach for future investigation of pediatric voice quality. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  4. Voice Handicap Index in Persian Speakers with Various Severities of Hearing Loss.

    Science.gov (United States)

    Aghadoost, Ozra; Moradi, Negin; Dabirmoghaddam, Payman; Aghadoost, Alireza; Naderifar, Ehsan; Dehbokri, Siavash Mohammadi

    2016-01-01

    The purpose of this study was to assess and compare the total score and subscale scores of the Voice Handicap Index (VHI) in speakers with and without hearing loss. A further aim was to determine if a correlation exists between severities of hearing loss with total scores and VHI subscale scores. In this cross-sectional, descriptive analytical study, 100 participants, divided in 2 groups of participants with and without hearing loss, were studied. Background information was gathered by interview, and VHI questionnaires were filled in by all participants. For all variables, including mean total score and VHI subscale scores, there was a considerable difference in speakers with and without hearing loss (p handicap related to voice in speakers with hearing loss. In addition, increased severity of hearing loss leads to more severe voice handicap. This finding emphasizes the need for a multilateral assessment and treatment of voice disorders in speakers with hearing loss. © 2017 S. Karger AG, Basel.

  5. Identifying the nonlinear mechanical behaviour of micro-speakers from their quasi-linear electrical response

    Science.gov (United States)

    Zilletti, Michele; Marker, Arthur; Elliott, Stephen John; Holland, Keith

    2017-05-01

    In this study model identification of the nonlinear dynamics of a micro-speaker is carried out by purely electrical measurements, avoiding any explicit vibration measurements. It is shown that a dynamic model of the micro-speaker, which takes into account the nonlinear damping characteristic of the device, can be identified by measuring the response between the voltage input and the current flowing into the coil. An analytical formulation of the quasi-linear model of the micro-speaker is first derived and an optimisation method is then used to identify a polynomial function which describes the mechanical damping behaviour of the micro-speaker. The analytical results of the quasi-linear model are compared with numerical results. This study potentially opens up the possibility of efficiently implementing nonlinear echo cancellers.

  6. The Impact of Telephone Channels on the Accuracy of Automatic Speaker Recognition

    Directory of Open Access Journals (Sweden)

    V. Delić

    2011-11-01

    Full Text Available This paper presents an experimental study on the impact of telephone channels on the accuracy of automatic speaker recognition. Speaker models and the design of the recognizer used in this study are based on Hidden Markov models. In order to simulate telephonequality speech signals, several experimental conditions were introduced taking two control factors into consideration: the type of the applied codec and the probability of transmission errors. In addition, the impact of echo signals – that are often present in Internet telephony – on the accuracy of automatic speaker recognition systems is considered. Finally, the paper provides a brief overview of several methodologies for the adaptation of the recognizer to the expected environmental conditions that may enhance the robustness of the speaker recognizer.

  7. The native-speaker fever in English language teaching (ELT: Pitting pedagogical competence against historical origin

    Directory of Open Access Journals (Sweden)

    Anchimbe, Eric A.

    2006-01-01

    Full Text Available This paper discusses English language teaching (ELT around the world, and argues that as a profession, it should emphasise pedagogical competence rather than native-speaker requirement in the recruitment of teachers in English as a foreign language (EFL and English as a second language (ESL contexts. It establishes that being a native speaker does not make one automatically a competent speaker or, of that matter, a competent teacher of the language. It observes that on many grounds, including physical, sociocultural, technological and economic changes in the world as well as the status of English as official and national language in many post-colonial regions, the distinction between native and non-native speakers is no longer valid.

  8. Toward a semantic approach to the English passive : on the basis of the speaker's conceptualization

    OpenAIRE

    Okuyama, Mika

    1991-01-01

    Since Bolinger's (1975) analysis, many linguists have been analyzing passives in terms of our (the speaker's) conceptual content.1 In such studies, the passive is assumed to be one of the grammatical codings ...

  9. A cognitive analysis of discourse processing in native and non-native speakers of English

    OpenAIRE

    Fourali, Chahid El-Hak

    1987-01-01

    This study establishes a quantitative and qualitative difference in the pattern of text processing of native and non native speakers of English. The psychological nature of this difference is explored in five studies. They reveal the following influences. 1 - Non-native speakers are disadvantaged when text processing relies on mental operations which are based on schema representations of the language e.g. assumption, evaluation and interpretation. They are not disadva...

  10. New Missing Features Mask Estimation Method for Speaker Recognition in Noisy Environments

    Directory of Open Access Journals (Sweden)

    José Ramón Calvo de Lara

    2012-06-01

    Full Text Available Normal 0 21 false false false ES X-NONE X-NONE /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Tabla normal"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-qformat:yes; mso-style-parent:""; mso-padding-alt:0cm 5.4pt 0cm 5.4pt; mso-para-margin-top:0cm; mso-para-margin-right:0cm; mso-para-margin-bottom:10.0pt; mso-para-margin-left:0cm; line-height:115%; mso-pagination:widow-orphan; font-size:11.0pt; font-family:"Calibri","sans-serif"; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-fareast-font-family:"Times New Roman"; mso-fareast-theme-font:minor-fareast; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin;} Currently, many speaker recognition applications must handle speech corrupted by environmental additive noise without having a priori knowledge about the characteristics of noise. Some previous works in speaker recognition have used Missing Feature (MF approach to compensate for noise. In most of those applications the spectral reliability decision step is done using the Signal to Noise Ratio (SNR criterion. This has the goal of enhancing signal power rather than noise power, which could be dangerous in speaker recognition tasks, because useful speaker information could be removed. This work proposes a new mask estimation method based on Speaker Discriminative Information (SDI for determining spectral reliability in speaker recognition applications based on the MF approach. The proposal was evaluated through speaker verification experiments in speech corrupted by additive noise. Experiments demonstrated that this new criterion has a promising performance in speaker verification tasks.

  11. A Novel Scheme for Speaker Recognition Using a Phonetically-Aware Deep Neural Network

    Science.gov (United States)

    2014-05-01

    A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY -AWARE DEEP NEURAL NETWORK Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren Speech...framework. Our use of a phonetically aware model is motivated by the fact that the speech content’s effect on the speech signal have been mostly ig...SUBTITLE A Novel Scheme for Speaker Recognition Using a Phonetically -Aware Deep Neural Network 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT

  12. Does training make French speakers more able to identify lexical stress?

    OpenAIRE

    Schwab, Sandra; Llisterri, Joaquim

    2013-01-01

    This research takes the stress deafness hypothesis as a starting point (e.g. Dupoux et al., 2008), and, more specifically, the fact that French speakers present difficulties in perceiving lexical stress in a free-stress language. In this framework, we aim at determining whether a prosodic training could improve the ability of French speakers to identify the stressed syllable in Spanish words. Three groups of participants took part in this experiment. The Native group was composed of 16 speake...

  13. Speaker identification for the improvement of the security communication between law enforcement units

    Science.gov (United States)

    Tovarek, Jaromir; Partila, Pavol

    2017-05-01

    This article discusses the speaker identification for the improvement of the security communication between law enforcement units. The main task of this research was to develop the text-independent speaker identification system which can be used for real-time recognition. This system is designed for identification in the open set. It means that the unknown speaker can be anyone. Communication itself is secured, but we have to check the authorization of the communication parties. We have to decide if the unknown speaker is the authorized for the given action. The calls are recorded by IP telephony server and then these recordings are evaluate using classification If the system evaluates that the speaker is not authorized, it sends a warning message to the administrator. This message can detect, for example a stolen phone or other unusual situation. The administrator then performs the appropriate actions. Our novel proposal system uses multilayer neural network for classification and it consists of three layers (input layer, hidden layer, and output layer). A number of neurons in input layer corresponds with the length of speech features. Output layer then represents classified speakers. Artificial Neural Network classifies speech signal frame by frame, but the final decision is done over the complete record. This rule substantially increases accuracy of the classification. Input data for the neural network are a thirteen Mel-frequency cepstral coefficients, which describe the behavior of the vocal tract. These parameters are the most used for speaker recognition. Parameters for training, testing and validation were extracted from recordings of authorized users. Recording conditions for training data correspond with the real traffic of the system (sampling frequency, bit rate). The main benefit of the research is the system developed for text-independent speaker identification which is applied to secure communication between law enforcement units.

  14. REFUSAL STRATEGIES USED BY MALAY ESL STUDENTSAND ENGLISH NATIVE SPEAKERS TO REFUSE A REQUEST

    OpenAIRE

    Saad, Norma; Bidin, Siti Jamilah; Shabdin, Ahmad Affendi

    2017-01-01

    The present study investigates similarities and differences of a speech act of refusal in English as realized by Malay Speakers of English (MSE) and Native Speakers of English (NSE). The study examined the types and also the contents of the strategies used by the two groups when refusing a request made by a higher status interlocutor. An Enhanced Open Role-play was utilized to obtain data on the types and content of refusal strategies. Participants of the study comprised 12 MSE undergraduate ...

  15. Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker Verification

    OpenAIRE

    Sarkar, A. K.; Tan, Zheng-Hua

    2016-01-01

    In this paper, we propose pass-phrase dependent background models (PBMs) for text-dependent (TD) speaker verification (SV) to integrate the pass-phrase identification process into the conventional TD-SV system, where a PBM is derived from a text-independent background model through adaptation using the utterances of a particular pass-phrase. During training, pass-phrase specific target speaker models are derived from the particular PBM using the training data for the respective target model. ...

  16. A Comparative Study on the Use of Compliment Response Strategies by Persian and English Native Speakers

    Directory of Open Access Journals (Sweden)

    Mansour Shabani

    2015-09-01

    Full Text Available The significance of pragmatic knowledge and politeness strategies has recently been emphasized in language learning and teaching. Most communication failures originate in the lack of pragmatic awareness which is evident among EFL learners while communicating with English native speakers. The present study aimed at investigating compliment response strategies, as a sub-category of politeness strategies, used by a group of Persian and English native speakers, and examining the effect of gender on the use of strategies to respond to compliments. To these ends and with the use of convenient sampling, thirty Iranian native speakers (15 females and 15 males in Iran and 26 English native speakers (13 females and 13 males in Canada, all college students with age range of 17-30, participated in this study. In order to collect a corpus of compliment responses, a researcher-made questionnaire in the form of a Discourse Completion Task was distributed among the participants. Using two-way ANOVAs, the findings indicated that there is a significant difference between Persian native speakers and Canadian English speakers (p .05.Considering the findings of the present study, materials developers and textbook writers can make more space in EFL textbooks for exercises about compliments (responses; and help in highlighting the significance of this aspect of pragmatic knowledge. Keywords: pragmatics, politeness, compliment, compliment response, EFL

  17. THE EFFECT OF NATIVE SPEAKER TEACHERS OF ENGLISH ON THE ATTITUDES AND ACHIEVEMENT OF LEARNERS

    Directory of Open Access Journals (Sweden)

    Dr. İsmet ŞAHİN

    2005-04-01

    Full Text Available Teachers are considered to be one of the most crucial elements affecting the success ofstudents learning a foreign language in non-English speaking countries.Native speakers of English were recruited as language teachers all over the non-English speaking world only because of their competency in the language for a long time. Butlater, it was admitted that being adept in a language doesn’t necessarily make anyone asuccessful language teacher. Skill in teaching is also very important in native speakers ofEnglish as language teachers.Attitude and motivation are proved to be the determinants of success in EFL learning.Native speaker teachers may also have an effect on the attitudes of learners as representativesof their culture and society. And they may be more advantageous in doing that because theyare also more competent in that culture compared to nonnative speakers.The study investigated the effect of native speaker teachers of English on the attitudesand success of EFL learners in Turkish setting.The results confirm that the learners who are exposed to native speaker teachers ofEnglish have more positive attitudes towards target language community and are moresuccessful in English lessons compared to those who are not exposed to any native speakerteachers of English. The results also verify that the attitude of EFL learners towards the targetcommunity correlates significantly with their success in language learning.

  18. Acoustic and tongue kinematic vowel space in speakers with and without dysarthria.

    Science.gov (United States)

    Lee, Jimin; Littlejohn, Meghan Anne; Simmons, Zachary

    2017-04-01

    The purpose is to investigate acoustic and tongue body kinematic vowel dispersion patterns and vowel space in speakers with and without dysarthria secondary to amyotrophic lateral sclerosis (ALS). Acoustic and tongue kinematic vowel spaces were examined at the same time sampling point using electromagnetic articulography in 11 speakers with dysarthria secondary to ALS and 11 speakers without dysarthria. Tongue kinematic data were collected from the tongue body sensor (∼25 mm posterior from the tongue apex). A number of acoustic and tongue body kinematic variables were tested. The result showed that the acoustic and tongue kinematic vowel dispersion patterns are different between the groups. Acoustic and tongue body kinematic vowel spaces are highly correlated; however, unlike acoustic vowel space, tongue body kinematic vowel space was not significantly different between the groups. Both acoustic and tongue kinematic vowel dispersion patterns are sensitive to the group difference, especially with high vowels. The tongue kinematic vowel space approach is too crude to differentiate the speakers with dysarthria secondary to ALS from speakers without dysarthria. To examine tongue range of motion in speakers with dysarthria, a more refined articulatory kinematic approach needs to be examined in the future.

  19. PENINGKATAN KUALITAS MELALUI IMPLEMENTASI FILOSOFI SIX SIGMA (Studi Kasus di sebuah Perusahaan Speaker

    Directory of Open Access Journals (Sweden)

    Jani Rahardjo

    2003-01-01

    Full Text Available The article describes the quality improvement program in a speaker company%2C through the implementation of six sigma philosophy; design a quality control system to support the result of the philosophy implementation%2C helped by the house of quality. The improvement of the quality level to 6ƒã is done by knowing the speaker¡¦s critical to quality (CTQ. The CTQ improved by minimized the highest defect. There are an analysis to six sigma indicator and the quality cost. The result is a quality improvement through the two type of speaker%2C 12¡¨ C-1230 PA ACR Pro New and 12¡¨ 30H120 SRW-38B ACR Pro New. Abstract in Bahasa Indonesia : Artikel ini membahas program peningkatan kualitas di sebuah perusahaan speaker melalui implementasi filosofi six sigma%2C perancangan perbaikan sistem pengendalian kualitas guna mendukung hasil implementasi%2C dengan bantuan rumah mutu. Peningkatan level kualitas ke arah 6ƒã dilakukan dengan mengetahui karakteritik kritis konsumen terhadap produk speaker%2C yang kemudian diperbaiki dengan meminimalkan kecacatan tertinggi. Analisa dilakukan terhadap indikator keberhasilan six sigma dan biaya kualitas. Perbandingan indikator keberhasilan menyatakan adanya peningkatan kualitas terhadap kedua tipe speaker yaitu 12¡¨ C-1230 PA ACR Pro New dan 12¡¨ 30H120 SRW-38B ACR Pro New. Six sigma%2C house of quality%2C critical to quality%2C quality cost.

  20. Bilingual children's social preferences hinge on accent.

    Science.gov (United States)

    DeJesus, Jasmine M; Hwang, Hyesung G; Dautel, Jocelyn B; Kinzler, Katherine D

    2017-12-01

    Past research finds that monolingual and bilingual children prefer native speakers to individuals who speak in unfamiliar foreign languages or accents. Do children in bilingual contexts socially distinguish among familiar languages and accents and, if so, how do their social preferences based on language and accent compare? The current experiments tested whether 5- to 7-year-olds in two bilingual contexts in the United States demonstrate social preferences among the languages and accents that are present in their social environments. We compared children's preferences based on language (i.e., English vs. their other native language) and their preferences based on accent (i.e., English with a native accent vs. English with a non-native [yet familiar] accent). In Experiment 1, children attending a French immersion school demonstrated no preference between English and French speakers but preferred American-accented English to French-accented English. In Experiment 2, bilingual Korean American children demonstrated no preference between English and Korean speakers but preferred American-accented English to Korean-accented English. Across studies, bilingual children's preferences based on accent (i.e., American-accented English over French- or Korean-accented English) were not related to their own language dominance. These results suggest that children from diverse linguistic backgrounds demonstrate social preferences for native-accented speakers. Implications for understanding the potential relation between social reasoning and language acquisition are discussed. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Proficiency in English sentence stress production by Cantonese speakers who speak English as a second language (ESL).

    Science.gov (United States)

    Ng, Manwa L; Chen, Yang

    2011-12-01

    The present study examined English sentence stress produced by native Cantonese speakers who were speaking English as a second language (ESL). Cantonese ESL speakers' proficiency in English stress production as perceived by English-speaking listeners was also studied. Acoustical parameters associated with sentence stress including fundamental frequency (F0), vowel duration, and intensity were measured from the English sentences produced by 40 Cantonese ESL speakers. Data were compared with those obtained from 40 native speakers of American English. The speech samples were also judged by eight native listeners who were native speakers of American English for placement, degree, and naturalness of stress. Results showed that Cantonese ESL speakers were able to use F0, vowel duration, and intensity to differentiate sentence stress patterns. Yet, both female and male Cantonese ESL speakers exhibited consistently higher F0 in stressed words than English speakers. Overall, Cantonese ESL speakers were found to be proficient in using duration and intensity to signal sentence stress, in a way comparable with English speakers. In addition, F0 and intensity were found to correlate closely with perceptual judgement and the degree of stress with the naturalness of stress.

  2. Neural decoding of attentional selection in multi-speaker environments without access to clean sources.

    Science.gov (United States)

    O'Sullivan, James; Chen, Zhuo; Herrero, Jose; McKhann, Guy M; Sheth, Sameer A; Mehta, Ashesh D; Mesgarani, Nima

    2017-10-01

    People who suffer from hearing impairments can find it difficult to follow a conversation in a multi-speaker environment. Current hearing aids can suppress background noise; however, there is little that can be done to help a user attend to a single conversation amongst many without knowing which speaker the user is attending to. Cognitively controlled hearing aids that use auditory attention decoding (AAD) methods are the next step in offering help. Translating the successes in AAD research to real-world applications poses a number of challenges, including the lack of access to the clean sound sources in the environment with which to compare with the neural signals. We propose a novel framework that combines single-channel speech separation algorithms with AAD. We present an end-to-end system that (1) receives a single audio channel containing a mixture of speakers that is heard by a listener along with the listener's neural signals, (2) automatically separates the individual speakers in the mixture, (3) determines the attended speaker, and (4) amplifies the attended speaker's voice to assist the listener. Using invasive electrophysiology recordings, we identified the regions of the auditory cortex that contribute to AAD. Given appropriate electrode locations, our system is able to decode the attention of subjects and amplify the attended speaker using only the mixed audio. Our quality assessment of the modified audio demonstrates a significant improvement in both subjective and objective speech quality measures. Our novel framework for AAD bridges the gap between the most recent advancements in speech processing technologies and speech prosthesis research and moves us closer to the development of cognitively controlled hearable devices for the hearing impaired.

  3. BEGINNING TAGALOG, A COURSE FOR SPEAKERS OF ENGLISH.

    Science.gov (United States)

    BOWEN, J. DONALD; AND OTHERS

    THE TWO MAJOR GOALS OF THIS COMPREHENSIVE COURSE ARE--FIRST, AN ORAL CONTROL OF TAGALOG AND SUFFICIENT MASTERY OF ITS STRUCTURE TO CONTINUE INDEPENDENT STUDY, AND SECOND, PRESENTATION OF UP-TO-DATE INFORMATION ABOUT THE SOCIAL CUSTOMS, STANDARDS, VALUES, AND ASPIRATIONS OF THE FILIPINO PEOPLE SO THAT THE LANGUAGE LEARNER MAY PARTICIPATE FULLY IN…

  4. K Ullas Karanth | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    Although the practice of such, much-needed conservation science offers exciting opportunities in India, it also faces serious challenges. These challenges can be scientific, technical or logistical, and often, systemic social ones. Using examples from his three decades of ecological research on India's national animal – the ...

  5. A R Venkatachalapathy | Speakers | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    He has published widely on the social, cultural and intellectual history of colonial Tamil Nadu. An accomplished writer in Tamil, Venkatachalapathy has authored and edited over 20 books in Tamil. His publications include The Province of the Book: Scholars, Scribes and Scribblers in Colonial Tamil Nadu; In Those Days ...

  6. Phoneme Error Pattern by Heritage Speakers of Spanish on an English Word Recognition Test.

    Science.gov (United States)

    Shi, Lu-Feng

    2017-04-01

    Heritage speakers acquire their native language from home use in their early childhood. As the native language is typically a minority language in the society, these individuals receive their formal education in the majority language and eventually develop greater competency with the majority than their native language. To date, there have not been specific research attempts to understand word recognition by heritage speakers. It is not clear if and to what degree we may infer from evidence based on bilingual listeners in general. This preliminary study investigated how heritage speakers of Spanish perform on an English word recognition test and analyzed their phoneme errors. A prospective, cross-sectional, observational design was employed. Twelve normal-hearing adult Spanish heritage speakers (four men, eight women, 20-38 yr old) participated in the study. Their language background was obtained through the Language Experience and Proficiency Questionnaire. Nine English monolingual listeners (three men, six women, 20-41 yr old) were also included for comparison purposes. Listeners were presented with 200 Northwestern University Auditory Test No. 6 words in quiet. They repeated each word orally and in writing. Their responses were scored by word, word-initial consonant, vowel, and word-final consonant. Performance was compared between groups with Student's t test or analysis of variance. Group-specific error patterns were primarily descriptive, but intergroup comparisons were made using 95% or 99% confidence intervals for proportional data. The two groups of listeners yielded comparable scores when their responses were examined by word, vowel, and final consonant. However, heritage speakers of Spanish misidentified significantly more word-initial consonants and had significantly more difficulty with initial /p, b, h/ than their monolingual peers. The two groups yielded similar patterns for vowel and word-final consonants, but heritage speakers made significantly

  7. The archival knowledge applied in the description of phonetic patterns of speakers

    Directory of Open Access Journals (Sweden)

    Tauana dos Santos Barbosa

    2017-04-01

    Full Text Available Introduction: Speaker identification is the activity that is able to determine if the same speaker produced two voices. The phonetician performs the exam by comparing the extracted phonetic patterns of speakers, which are elements that determine the speech traits of an individual. Objective: This study shows how the archival knowledge can be used to organize the phonetic patterns provided by phonetician, helping to build speaker identification systems. Methodology: The research is characterized as descriptive and qualitative and quantitative, with the interpretation, description, evaluation and indexing of phonetic patterns, besides the application of statistical concepts. Findings: The results of the investigation led to the proposal of a model description of the phonetic patterns, organized in a matrix form, with relevant information to the phonetician to be used in speaker identification systems. Conclusion: It is concluded that the practice of interdisciplinary of the two areas, Archivology and Phonetics, can make an important contribution in the results of organizing data and phonetic information, demonstrating the broad scope of archival knowledge on current problem-solving.

  8. Musical Sophistication and the Effect of Complexity on Auditory Discrimination in Finnish Speakers.

    Science.gov (United States)

    Dawson, Caitlin; Aalto, Daniel; Šimko, Juraj; Vainio, Martti; Tervaniemi, Mari

    2017-01-01

    Musical experiences and native language are both known to affect auditory processing. The present work aims to disentangle the influences of native language phonology and musicality on behavioral and subcortical sound feature processing in a population of musically diverse Finnish speakers as well as to investigate the specificity of enhancement from musical training. Finnish speakers are highly sensitive to duration cues since in Finnish, vowel and consonant duration determine word meaning. Using a correlational approach with a set of behavioral sound feature discrimination tasks, brainstem recordings, and a musical sophistication questionnaire, we find no evidence for an association between musical sophistication and more precise duration processing in Finnish speakers either in the auditory brainstem response or in behavioral tasks, but they do show an enhanced pitch discrimination compared to Finnish speakers with less musical experience and show greater duration modulation in a complex task. These results are consistent with a ceiling effect set for certain sound features which corresponds to the phonology of the native language, leaving an opportunity for music experience-based enhancement of sound features not explicitly encoded in the language (such as pitch, which is not explicitly encoded in Finnish). Finally, the pattern of duration modulation in more musically sophisticated Finnish speakers suggests integrated feature processing for greater efficiency in a real world musical situation. These results have implications for research into the specificity of plasticity in the auditory system as well as to the effects of interaction of specific language features with musical experiences.

  9. Indexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences.

    Directory of Open Access Journals (Sweden)

    Karen E Mulak

    Full Text Available Infants preferentially discriminate between speech tokens that cross native category boundaries prior to acquiring a large receptive vocabulary, implying a major role for unsupervised distributional learning strategies in phoneme acquisition in the first year of life. Multiple sources of between-speaker variability contribute to children's language input and thus complicate the problem of distributional learning. Adults resolve this type of indexical variability by adjusting their speech processing for individual speakers. For infants to handle indexical variation in the same way, they must be sensitive to both linguistic and indexical cues. To assess infants' sensitivity to and relative weighting of indexical and linguistic cues, we familiarized 12-month-old infants to tokens of a vowel produced by one speaker, and tested their listening preference to trials containing a vowel category change produced by the same speaker (linguistic information, and the same vowel category produced by another speaker of the same or a different accent (indexical information. Infants noticed linguistic and indexical differences, suggesting that both are salient in infant speech processing. Future research should explore how infants weight these cues in a distributional learning context that contains both phonetic and indexical variation.

  10. Musical Sophistication and the Effect of Complexity on Auditory Discrimination in Finnish Speakers

    Science.gov (United States)

    Dawson, Caitlin; Aalto, Daniel; Šimko, Juraj; Vainio, Martti; Tervaniemi, Mari

    2017-01-01

    Musical experiences and native language are both known to affect auditory processing. The present work aims to disentangle the influences of native language phonology and musicality on behavioral and subcortical sound feature processing in a population of musically diverse Finnish speakers as well as to investigate the specificity of enhancement from musical training. Finnish speakers are highly sensitive to duration cues since in Finnish, vowel and consonant duration determine word meaning. Using a correlational approach with a set of behavioral sound feature discrimination tasks, brainstem recordings, and a musical sophistication questionnaire, we find no evidence for an association between musical sophistication and more precise duration processing in Finnish speakers either in the auditory brainstem response or in behavioral tasks, but they do show an enhanced pitch discrimination compared to Finnish speakers with less musical experience and show greater duration modulation in a complex task. These results are consistent with a ceiling effect set for certain sound features which corresponds to the phonology of the native language, leaving an opportunity for music experience-based enhancement of sound features not explicitly encoded in the language (such as pitch, which is not explicitly encoded in Finnish). Finally, the pattern of duration modulation in more musically sophisticated Finnish speakers suggests integrated feature processing for greater efficiency in a real world musical situation. These results have implications for research into the specificity of plasticity in the auditory system as well as to the effects of interaction of specific language features with musical experiences. PMID:28450829

  11. On Sociophonetic Competence: Phrase-Final Vowel Devoicing in Native and Advanced L2 Speakers of French

    Science.gov (United States)

    Dalola, Amanda; Bullock, Barbara E.

    2017-01-01

    The data from this study investigate phrase-final vowel devoicing in Metropolitan French among L1 and L2 speakers, in terms of number of times a speaker devoices a phrase-final high vowel and percentage of the vowel that is devoiced. The goal is to assess whether experienced L2 speakers use style-based variation in response to the same factors as…

  12. Language ideology and the native speaker ideal : Canadian and Norwegian attitudes toward ESL/EFL pedagogical models

    OpenAIRE

    Dykeman, Rachel Elizabeth

    2016-01-01

    Previous sociolinguistic studies done in Norway have explored attitudes toward native speaker and Norwegian accented varieties of English. This study adds a new angle by comparing the attitudes of first language speakers of English from Canada and second language speakers of English from Norway toward SC (Standard Canadian) and NE (Norwegian accented English) accents. An online survey was undertaken by 107 English teachers, of which 50 self-identified as Norwegian and 57 as Canadian teachers ...

  13. Towards PLDA-RBM based speaker recognition in mobile environment: Designing stacked/deep PLDA-RBM systems

    DEFF Research Database (Denmark)

    Nautsch, Andreas; Hao, Hong; Stafylakis, Themos

    2016-01-01

    The vast majority of text-independent speaker recognition systems rely on intermediate-sized vectors (i-vectors), which are compared by probabilistic linear discriminant analysis (PLDA). This paper proposes a PLDA-alike approach with restricted Boltzmann machines for i-vector based speaker...... for mobile speaker recognition with limited amounts of training data. The experiments show that the proposed system outperforms the baseline i-vector/PLDA approach by relative gains of 31% on female and 9% on male speakers in terms of half total error rate....

  14. Speaker verification system using acoustic data and non-acoustic data

    Science.gov (United States)

    Gable, Todd J [Walnut Creek, CA; Ng, Lawrence C [Danville, CA; Holzrichter, John F [Berkeley, CA; Burnett, Greg C [Livermore, CA

    2006-03-21

    A method and system for speech characterization. One embodiment includes a method for speaker verification which includes collecting data from a speaker, wherein the data comprises acoustic data and non-acoustic data. The data is used to generate a template that includes a first set of "template" parameters. The method further includes receiving a real-time identity claim from a claimant, and using acoustic data and non-acoustic data from the identity claim to generate a second set of parameters. The method further includes comparing the first set of parameters to the set of parameters to determine whether the claimant is the speaker. The first set of parameters and the second set of parameters include at least one purely non-acoustic parameter, including a non-acoustic glottal shape parameter derived from averaging multiple glottal cycle waveforms.

  15. A Posterior Union Model with Applications to Robust Speech and Speaker Recognition

    Directory of Open Access Journals (Sweden)

    Lin Jie

    2006-01-01

    Full Text Available This paper investigates speech and speaker recognition involving partial feature corruption, assuming unknown, time-varying noise characteristics. The probabilistic union model is extended from a conditional-probability formulation to a posterior-probability formulation as an improved solution to the problem. The new formulation allows the order of the model to be optimized for every single frame, thereby enhancing the capability of the model for dealing with nonstationary noise corruption. The new formulation also allows the model to be readily incorporated into a Gaussian mixture model (GMM for speaker recognition. Experiments have been conducted on two databases: TIDIGITS and SPIDRE, for speech recognition and speaker identification. Both databases are subject to unknown, time-varying band-selective corruption. The results have demonstrated the improved robustness for the new model.

  16. Language control in different contexts: the behavioural ecology of bilingual speakers

    Directory of Open Access Journals (Sweden)

    David William Green

    2011-05-01

    Full Text Available This paper proposes that different experimental contexts (single or dual language contexts permit different neural loci at which words in the target language can be selected. However, in order to develop a fuller understanding of the neural circuit mediating language control we need to consider the community context in which bilingual speakers typically use their two languages (the behavioural ecology of bilingual speakers. The contrast between speakers from code-switching and non-code switching communities offers a way to increase our understanding of the cortical, subcortical and, in particular, cerebellar structures involved in language control. It will also help us identify the non-verbal behavioural correlates associated with these control processes.

  17. The effect of speakers' sex on voice onset time in Mandarin stops.

    Science.gov (United States)

    Li, Fangfang

    2013-02-01

    The goal of the present study is to examine the effect of speakers' gender on voice onset time in Mandarin speakers' stop productions. Word-initial lingual stops were elicited from 10 male and 10 female Mandarin speakers using a word-repetition task. The results revealed differentiated voice onset time (VOT) patterns between the two genders for all four lingual stops on raw VOT values. After factoring out speech rate variation, gender-related differences remained for voiced stops only with females' VOTs being shorter than males. The results, together with previous findings from other languages, suggest a sociolinguistic/stylistic account on the relation between gender and VOT that vary in a language-specific manner.

  18. A cross-linguistic PET study of tone perception in Mandarin Chinese and English speakers.

    Science.gov (United States)

    Klein, D; Zatorre, R J; Milner, B; Zhao, V

    2001-04-01

    PET was used in a cross-linguistic study to determine whether neural mechanisms subserving pitch perception differ as a function of linguistic relevance. We compared tone perception in 12 native Mandarin speakers, who use tonal patterns to distinguish lexical meaning, with that of 12 native speakers of a nontone language, English. Subjects were scanned under two conditions: a silent resting baseline and a tonal task involving discrimination of pitch patterns in Mandarin words. Both groups showed common regions of CBF increase, but only Mandarin speakers showed additional activation in frontal, parietal, and parieto-occipital regions of the left hemisphere; this latter finding indicates that language experience may influence brain circuitry in the processing of auditory cues. In contrast, only the English group showed activity in the right inferior frontal cortex, consistent with a right-hemispheric role in pitch perception. Copyright 2001 Academic Press.

  19. Infant sensitivity to speaker and language in learning a second label.

    Science.gov (United States)

    Bhagwat, Jui; Casasola, Marianella

    2014-02-01

    Two experiments examined when monolingual, English-learning 19-month-old infants learn a second object label. Two experimenters sat together. One labeled a novel object with one novel label, whereas the other labeled the same object with a different label in either the same or a different language. Infants were tested on their comprehension of each label immediately following its presentation. Infants mapped the first label at above chance levels, but they did so with the second label only when requested by the speaker who provided it (Experiment 1) or when the second experimenter labeled the object in a different language (Experiment 2). These results show that 19-month-olds learn second object labels but do not readily generalize them across speakers of the same language. The results highlight how speaker and language spoken guide infants' acceptance of second labels, supporting sociopragmatic views of word learning. Copyright © 2013 Elsevier Inc. All rights reserved.

  20. A study on (K, Na) NbO3 based multilayer piezoelectric ceramics micro speaker

    Science.gov (United States)

    Gao, Renlong; Chu, Xiangcheng; Huan, Yu; Sun, Yiming; Liu, Jiayi; Wang, Xiaohui; Li, Longtu

    2014-10-01

    A flat panel micro speaker was fabricated from (K, Na) NbO3 (KNN)-based multilayer piezoelectric ceramics by a tape casting and cofiring process using Ag-Pd alloys as an inner electrode. The interface between ceramic and electrode was investigated by scanning electron microscope (SEM) and transmission electron microscope (TEM). The acoustic response was characterized by a standard audio test system. We found that the micro speaker with dimensions of 23 × 27 × 0.6 mm3, using three layers of 30 μm thickness KNN-based ceramic, has a high average sound pressure level (SPL) of 87 dB, between 100 Hz-20 kHz under five voltage. This result was even better than that of lead zirconate titanate (PZT)-based ceramics under the same conditions. The experimental results show that the KNN-based multilayer ceramics could be used as lead free piezoelectric micro speakers.

  1. Perceptual and acoustic analysis of lexical stress in Greek speakers with dysarthria.

    Science.gov (United States)

    Papakyritsis, Ioannis; Müller, Nicole

    2014-01-01

    The study reported in this paper investigated the abilities of Greek speakers with dysarthria to signal lexical stress at the single word level. Three speakers with dysarthria and two unimpaired control participants were recorded completing a repetition task of a list of words consisting of minimal pairs of Greek disyllabic words contrasted by lexical stress location only. Fourteen listeners were asked to determine the attempted stress location for each word pair. Acoustic analyses of duration and intensity ratios, both within and across words, were undertaken to identify possible acoustic correlates of the listeners' judgments concerning stress location. Acoustic and perceptual data indicate that while each participant with dysarthria in this study had some difficulty in signaling stress unambiguously, the pattern of difficulty was different for each speaker. Further, it was found that the relationship between the listeners' judgments of stress location and the acoustic data was not conclusive.

  2. Text Dependent Speaker Verification Using Un-supervised HMM-UBM and Temporal GMM-UBM

    DEFF Research Database (Denmark)

    Sarkar, Achintya Kumar; Tan, Zheng-Hua

    2016-01-01

    In this paper, we investigate the Hidden Markov Model (HMM) and the temporal Gaussian Mixture Model (GMM) systems based on the Universal Background Model (UBM) concept to capture temporal information of speech for Text Dependent (TD) Speaker Verification (SV). In TD-SV, target speakers...... enrollment. However, the temporal information is not considered in the classical GMM-UBM based TD-SV system. Moreover, no transcription knowledge of the speech is required in the HMM-UBM and temporal GMM-UBM based systems. We also study the fusion of the HMM-UBM, the temporal GMM-UBM and the classical GMM......-UBM systems in SV. We show that the HMM-UBM system yields better performance than the other systems in most cases. Further, fusion of the systems improve the overall speaker verification performance. The results are shown in the different tasks of the RedDots challenge 2016 database....

  3. Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker verification

    DEFF Research Database (Denmark)

    Sarkar, Achintya Kumar; Tan, Zheng-Hua

    2018-01-01

    is compared to conventional text-independent background model based TD-SV systems using either Gaussian mixture model (GMM)-universal background model (UBM) or Hidden Markov model (HMM)-UBM or i-vector paradigms. In addition, we consider two approaches to build PBMs: speaker-independent and speaker......In this paper, we propose pass-phrase dependent background models (PBMs) for text-dependent (TD) speaker verification (SV) to integrate the pass-phrase identification process into the conventional TD-SV system, where a PBM is derived from a text-independent background model through adaptation using...... and the selected PBM is then used for the log likelihood ratio (LLR) calculation with respect to the claimant model. The proposed method incorporates the pass-phrase identification step in the LLR calculation, which is not considered in conventional standalone TD-SV systems. The performance of the proposed method...

  4. Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data

    Directory of Open Access Journals (Sweden)

    J. Silovsky

    2009-09-01

    Full Text Available This paper presents a comparison of three different speaker recognition methods deployed in a broadcast news processing system. We focus on how the generative and discriminative nature of these methods affects the speaker recognition framework and we also deal with intersession variability compensation techniques in more detail, which are of great interest in broadcast processing domain. Performed experiments are specific particularly for the very limited amount of data used for both speaker enrollment (typically ranging from 30 to 60 seconds and recognition (typically ranging from 5 to 15 seconds. Our results show that the system based on Gaussian Mixture Models (GMMs outperforms both systems based on Support Vector Machines (SVMs but its drawback is higher computational cost.

  5. Continuing Medical Education Speakers with High Evaluation Scores Use more Image-based Slides.

    Science.gov (United States)

    Ferguson, Ian; Phillips, Andrew W; Lin, Michelle

    2017-01-01

    Although continuing medical education (CME) presentations are common across health professions, it is unknown whether slide design is independently associated with audience evaluations of the speaker. Based on the conceptual framework of Mayer's theory of multimedia learning, this study aimed to determine whether image use and text density in presentation slides are associated with overall speaker evaluations. This retrospective analysis of six sequential CME conferences (two annual emergency medicine conferences over a three-year period) used a mixed linear regression model to assess whether post-conference speaker evaluations were associated with image fraction (percentage of image-based slides per presentation) and text density (number of words per slide). A total of 105 unique lectures were given by 49 faculty members, and 1,222 evaluations (70.1% response rate) were available for analysis. On average, 47.4% (SD=25.36) of slides had at least one educationally-relevant image (image fraction). Image fraction significantly predicted overall higher evaluation scores [F(1, 100.676)=6.158, p=0.015] in the mixed linear regression model. The mean (SD) text density was 25.61 (8.14) words/slide but was not a significant predictor [F(1, 86.293)=0.55, p=0.815]. Of note, the individual speaker [χ 2 (1)=2.952, p=0.003] and speaker seniority [F(3, 59.713)=4.083, p=0.011] significantly predicted higher scores. This is the first published study to date assessing the linkage between slide design and CME speaker evaluations by an audience of practicing clinicians. The incorporation of images was associated with higher evaluation scores, in alignment with Mayer's theory of multimedia learning. Contrary to this theory, however, text density showed no significant association, suggesting that these scores may be multifactorial. Professional development efforts should focus on teaching best practices in both slide design and presentation skills.

  6. Continuing Medical Education Speakers with High Evaluation Scores Use more Image-based Slides

    Directory of Open Access Journals (Sweden)

    Ferguson, Ian

    2017-01-01

    Full Text Available Although continuing medical education (CME presentations are common across health professions, it is unknown whether slide design is independently associated with audience evaluations of the speaker. Based on the conceptual framework of Mayer’s theory of multimedia learning, this study aimed to determine whether image use and text density in presentation slides are associated with overall speaker evaluations. This retrospective analysis of six sequential CME conferences (two annual emergency medicine conferences over a three-year period used a mixed linear regression model to assess whether postconference speaker evaluations were associated with image fraction (percentage of image-based slides per presentation and text density (number of words per slide. A total of 105 unique lectures were given by 49 faculty members, and 1,222 evaluations (70.1% response rate were available for analysis. On average, 47.4% (SD=25.36 of slides had at least one educationally-relevant image (image fraction. Image fraction significantly predicted overall higher evaluation scores [F(1, 100.676=6.158, p=0.015] in the mixed linear regression model. The mean (SD text density was 25.61 (8.14 words/slide but was not a significant predictor [F(1, 86.293=0.55, p=0.815]. Of note, the individual speaker [χ2 (1=2.952, p=0.003] and speaker seniority [F(3, 59.713=4.083, p=0.011] significantly predicted higher scores. This is the first published study to date assessing the linkage between slide design and CME speaker evaluations by an audience of practicing clinicians. The incorporation of images was associated with higher evaluation scores, in alignment with Mayer’s theory of multimedia learning. Contrary to this theory, however, text density showed no significant association, suggesting that these scores may be multifactorial. Professional development efforts should focus on teaching best practices in both slide design and presentation skills.

  7. Neural decoding of attentional selection in multi-speaker environments without access to clean sources

    Science.gov (United States)

    O’Sullivan, James; Chen, Zhuo; Herrero, Jose; McKhann, Guy M.; Sheth, Sameer A.; Mehta, Ashesh D.; Mesgarani, Nima

    2017-10-01

    Objective. People who suffer from hearing impairments can find it difficult to follow a conversation in a multi-speaker environment. Current hearing aids can suppress background noise; however, there is little that can be done to help a user attend to a single conversation amongst many without knowing which speaker the user is attending to. Cognitively controlled hearing aids that use auditory attention decoding (AAD) methods are the next step in offering help. Translating the successes in AAD research to real-world applications poses a number of challenges, including the lack of access to the clean sound sources in the environment with which to compare with the neural signals. We propose a novel framework that combines single-channel speech separation algorithms with AAD. Approach. We present an end-to-end system that (1) receives a single audio channel containing a mixture of speakers that is heard by a listener along with the listener’s neural signals, (2) automatically separates the individual speakers in the mixture, (3) determines the attended speaker, and (4) amplifies the attended speaker’s voice to assist the listener. Main results. Using invasive electrophysiology recordings, we identified the regions of the auditory cortex that contribute to AAD. Given appropriate electrode locations, our system is able to decode the attention of subjects and amplify the attended speaker using only the mixed audio. Our quality assessment of the modified audio demonstrates a significant improvement in both subjective and objective speech quality measures. Significance. Our novel framework for AAD bridges the gap between the most recent advancements in speech processing technologies and speech prosthesis research and moves us closer to the development of cognitively controlled hearable devices for the hearing impaired.

  8. Speaker Recognition Using Real vs Synthetic Parallel Data for DNN Channel Compensation

    Science.gov (United States)

    2016-09-08

    effort speech does not appear to adversely affect the other microphone chan- nels and there are at most 3 microphone sessions per a speaker in Mixer 6...256 and a learning rate of 0.1. In most cases SGD training is completed in fewer than 20 epochs. The DNN DNN Training AVG POOL None (baseline) 11.5...talking speaker identification,” in EURASIP Journal on Audio, Speech, and Music Processing, 2015. [2] M. Karafiat, F. Grezl, L. Burget, I. Szoke, and J

  9. Gender parity trends for invited speakers at four prominent virology conference series.

    Science.gov (United States)

    Kalejta, Robert F; Palmenberg, Ann C

    2017-06-07

    Scientific conferences are most beneficial to participants when they showcase significant new experimental developments, accurately summarize the current state of the field, and provide strong opportunities for collaborative networking. A top-notch slate of invited speakers, assembled by conference organizers or committees, is key to achieving these goals. The perceived underrepresentation of female speakers at prominent scientific meetings is currently a popular topic for discussion, but one that often lacks supportive data. We compiled the full rosters of invited speakers over the last 35 years for four prominent international virology conferences, the American Society for Virology Annual Meeting (ASV), the International Herpesvirus Workshop (IHW), the Positive-Strand RNA Virus Symposium (PSR), and the Gordon Research Conference on Viruses & Cells (GRC). The rosters were cross-indexed by unique names, gender, year, and repeat invitations. When plotted as gender-dependent trends over time, all four conferences showed a clear proclivity for male-dominated invited speaker lists. Encouragingly, shifts toward parity are emerging within all units, but at different rates. Not surprisingly, both selection of a larger percentage of first time participants and the presence of a woman on the speaker selection committee correlated with improved parity. Session chair information was also collected for the IHW and GRC. These visible positions also displayed a strong male dominance over time that is eroding slowly. We offer our personal interpretation of these data to aid future organizers achieve improved equity among the limited number of available positions for session moderators and invited speakers. IMPORTANCE Politicians and media members have a tendency to cite anecdotes as conclusions without any supporting data. This happens so frequently now, that a name for it has emerged: fake news. Good science proceeds otherwise. The under representation of women as invited

  10. Voice Patterns in Adult English Speakers with Autism Spectrum Disorder

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Lambrechts, Anna; Yarrow, Kielan

    Individuals with Autism Spectrum Disorder (ASD) often display atypical modulation of speech described as awkward, monotone, or sing-songy (Shriberg et al., 2001). These patterns are a robust signal of social communication deficit (Paul et al., 2005) and contribute to reaching a diagnosis of ASD...... to be re-trained. Conclusions: The current data suggest than ASD adults produce highly regular patterns of speech (as measured by pitch and pause distribution). Importantly this provides a quantifiable measurement to capture some of the clinical reports which contribute to reaching a diagnosis of autism...

  11. Objective eye-gaze behaviour during face-to-face communication with proficient alaryngeal speakers: a preliminary study.

    Science.gov (United States)

    Evitts, Paul; Gallop, Robert

    2011-01-01

    There is a large body of research demonstrating the impact of visual information on speaker intelligibility in both normal and disordered speaker populations. However, there is minimal information on which specific visual features listeners find salient during conversational discourse. To investigate listeners' eye-gaze behaviour during face-to-face conversation with normal, laryngeal and proficient alaryngeal speakers. Sixty participants individually participated in a 10-min conversation with one of four speakers (typical laryngeal, tracheoesophageal, oesophageal, electrolaryngeal; 15 participants randomly assigned to one mode of speech). All speakers were > 85% intelligible and were judged to be 'proficient' by two certified speech-language pathologists. Participants were fitted with a head-mounted eye-gaze tracking device (Mobile Eye, ASL) that calculated the region of interest and mean duration of eye-gaze. Self-reported gaze behaviour was also obtained following the conversation using a 10 cm visual analogue scale. While listening, participants viewed the lower facial region of the oesophageal speaker more than the normal or tracheoesophageal speaker. Results of non-hierarchical cluster analyses showed that while listening, the pattern of eye-gaze was predominantly directed at the lower face of the oesophageal and electrolaryngeal speaker and more evenly dispersed among the background, lower face, and eyes of the normal and tracheoesophageal speakers. Finally, results show a low correlation between self-reported eye-gaze behaviour and objective regions of interest data. Overall, results suggest similar eye-gaze behaviour when healthy controls converse with normal and tracheoesophageal speakers and that participants had significantly different eye-gaze patterns when conversing with an oesophageal speaker. Results are discussed in terms of existing eye-gaze data and its potential implications on auditory-visual speech perception. © 2011 Royal College of Speech

  12. Language-Based Social Preferences among Children in South Africa

    Science.gov (United States)

    Kinzler, Katherine D.; Shutts, Kristin; Spelke, Elizabeth S.

    2012-01-01

    Monolingual English-speaking children in the United States express social preferences for speakers of their native language with a native accent. Here we explore the nature of children's language-based social preferences through research with children in South Africa, a multilingual nation. Like children in the United States, Xhosa South African…

  13. A sociolinguistic interpretation of the social meanings of kinship ...

    African Journals Online (AJOL)

    This article provides an interpretive analysis of Shona native speakers' use of kinship terms of address as forms of communicative resources to invoke social meanings in non-kin relations. Two types of data were used for the analysis of social meanings: (a) field notes from participant observations taken of naturally ...

  14. Effects of Peer Tutoring on Young Children's Social Interactions

    Science.gov (United States)

    Xu, Yaoying; Gelfer, Jeffrey I.; Sileo, Nancy; Filler, John; Perkins, Peggy G.

    2008-01-01

    This study examined the effects of peer tutoring on children's social interactions and compared social interaction behaviors between children who are English-language learners (ELL) and children who are primary English speakers (PES). Single-subject withdrawal design (ABA) was applied in this study and classwide peer tutoring was used as the…

  15. Social and Linguistic Cues Facilitate Children's Register Comprehension

    Science.gov (United States)

    Wagner, Laura; Vega-Mendoza, Mariana; Van Horn, Suzanne

    2014-01-01

    Speakers must command different linguistic registers to index various social-discourse elements, including the identity of the addressee. Previous work found that English-learning children could link registers to appropriate addressees by 5 years. Two experiments found that better cues to the linguistic form or to the social meaning of register…

  16. Selective social learning in infancy: looking for mechanisms.

    Science.gov (United States)

    Crivello, Cristina; Phillips, Sara; Poulin-Dubois, Diane

    2017-08-30

    Although there is mounting evidence that selective social learning begins in infancy, the psychological mechanisms underlying this ability are currently a controversial issue. The purpose of this study is to investigate whether theory of mind abilities and statistical learning skills are related to infants' selective social learning. Seventy-seven 18-month-olds were first exposed to a reliable or an unreliable speaker and then completed a word learning task, two theory of mind tasks, and a statistical learning task. If domain-general abilities are linked to selective social learning, then infants who demonstrate superior performance on the statistical learning task should perform better on the selective learning task, that is, should be less likely to learn words from an unreliable speaker. Alternatively, if domain-specific abilities are involved, then superior performance on theory of mind tasks should be related to selective learning performance. Findings revealed that, as expected, infants were more likely to learn a novel word from a reliable speaker. Importantly, infants who passed a theory of mind task assessing knowledge attribution were significantly less likely to learn a novel word from an unreliable speaker compared to infants who failed this task. No such effect was observed for the other tasks. These results suggest that infants who possess superior social-cognitive abilities are more apt to reject an unreliable speaker as informant. A video abstract of this article can be viewed at: https://youtu.be/zuuCniHYzqo. © 2017 John Wiley & Sons Ltd.

  17. Do Children with Autism Use the Speaker's Direction of Gaze Strategy to Crack the Code of Language?

    Science.gov (United States)

    Baron-Cohen, Simon; And Others

    1997-01-01

    Two studies of toddlers and children with autism, mentally handicapped children, and normal toddlers examined whether autistic toddlers used Speaker's Direction of Gaze (SDG) strategy or less powerful Listener's Direction of Gaze (LDG) strategy to learn a word for a novel object. Results suggest autistic toddlers are insensitive to speaker's gaze…

  18. Demographics of Adult Heritage Language Speakers in the United States: Differences by Region and Language and Their Implications

    Science.gov (United States)

    Nagano, Tomonori

    2015-01-01

    Heritage language (HL) speakers have received scholarly attention in recent years as an interdisciplinary research theme, but relatively less attention has been paid to their demographics. Existing studies of HL speakers' demographics often focus on young children in areas of high immigrant concentration (i.e., California, Florida, and New York);…

  19. Phraseology and Frequency of Occurrence on the Web: Native Speakers' Perceptions of Google-Informed Second Language Writing

    Science.gov (United States)

    Geluso, Joe

    2013-01-01

    Usage-based theories of language learning suggest that native speakers of a language are acutely aware of formulaic language due in large part to frequency effects. Corpora and data-driven learning can offer useful insights into frequent patterns of naturally occurring language to second/foreign language learners who, unlike native speakers, are…

  20. The Role of the Colloquial Varieties in the Acquisition of the Standard Variety: The Case of Arabic Heritage Speakers

    Science.gov (United States)

    Albirini, Abdulkafi

    2014-01-01

    The study investigated the acquisition of Modern Standard Arabic (MSA) by second language (L2) learners and by heritage speakers of the colloquial varieties of Arabic. The study focused on three questions: (1) whether heritage speakers who enroll in college-level elementary MSA classes have an advantage over their L2 counterparts, (2) whether any…

  1. Sensitivity to Speaker Control in the Online Comprehension of Conditional Tips and Promises: An Eye-Tracking Study

    Science.gov (United States)

    Stewart, Andrew J.; Haigh, Matthew; Ferguson, Heather J.

    2013-01-01

    Statements of the form if… then… can be used to communicate conditional speech acts such as tips and promises. Conditional promises require the speaker to have perceived control over the outcome event, whereas conditional tips do not. In an eye-tracking study, we examined whether readers are sensitive to information about perceived speaker control…

  2. The influence of instrumentality and name-relation to a noun on verb comprehension in Dutch aphasic speakers

    NARCIS (Netherlands)

    Jonkers, R; Bastiaanse, R

    Background: Different studies have examined the effect of verb type on comprehension, generally in non-fluent aphasic speakers. The current study includes the data of fluent aphasic speakers as well and considers the effect of conceptual and phonological factors on verb comprehension. Aims: The role

  3. "A miña variedade é defectuosa": a lexitimidade social das neofalas

    Directory of Open Access Journals (Sweden)

    Bernadette O'Rourke

    2013-01-01

    Full Text Available Although Galician survived several centuries of language contact with Spanish, a combination of sociopolitical and socioeconomic factors in the latter half of the twentieth century has accelerated the process of language shift. Language policy interventions since the 1980s have, however, attempted to curb this trend in an effort to maintain the language in areas where it continues to be spoken and to revive it elsewhere. While there has been a decline in the number of traditional native speakers, there has been a revitalization of the language amongst certain younger, urban, middleclass and traditionally Spanish-speaking sectors of Galician society. These neofalantes, or "neo-speakers" of Galician, frequently find themselves negotiating their right for linguistic space amongst native speakers of Spanish and Galician. In this paper we explore this linguistic space through an analysis of contemporary discourses both about neo-speakers and by neo-speakers of Galician. These discourses were generated from three discussion groups consisting of native and neospeakers of Galician. In our analysis of the data, we look at the role of the new speaker as a legitimate and legitimized speaker (Bourdieu 1982. The study concludes that the sociolinguistic space occupied by new ways of speaking and new speakers requires greater social recognition.

  4. Facework among L2 Speakers: A Close Look at Intercultural Communication

    Science.gov (United States)

    Jwa, Soomin

    2017-01-01

    Although significant attention has been devoted to the notion of facework and its functions, facework among L2 speakers, whose cultural backgrounds and language proficiencies vary, has remained unexplored. The present study attempts to explore situations of intercultural communication in which facework is used as a way to remedy moments of…

  5. The "Tse Tsa Watle" Speaker Series: An Example of Ensemble Leadership and Generative Adult Learning

    Science.gov (United States)

    McKendry, Virginia

    2017-01-01

    This chapter examines an Indigenous speaker series formed to foster intercultural partnerships at a Canadian university. Using ensemble leadership and generative learning theories to make sense of the project, the author argues that ensemble leadership is key to designing the generative learning adult learners need in an era of ambiguity.

  6. Prevalence of vocal fry in young adult male American English speakers.

    Science.gov (United States)

    Abdelli-Beruh, Nassima B; Wolk, Lesley; Slavin, Dianne

    2014-03-01

    The purpose of this study was to assess possible gender differences in the prevalence of vocal fry in the voices of young male college students. Results were compared with previously published findings derived from a matched sample of female speakers. Thirty-four male college students, native American English speakers, produced speech samples in two speaking conditions: (1) sustained isolated vowel /a/ and (2) reading task. Data analyses included perceptual evaluations by two licensed speech-language pathologists. Results showed that vocal fry was perceived significantly more frequently in sentences than in isolated vowel productions. When vocal fry occurred in sentences, it was detected significantly more often in sentence-final position than in initial- and/or mid-sentence position. Furthermore, the prevalence of vocal fry in sentences was significantly lower for male speakers than has previously been reported for female speakers. Possible physiological and sociolinguistic explanations are discussed. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  7. A Kaleidoscope of Models and Strategies for Teaching English to Speakers of Other Languages

    Science.gov (United States)

    Norland, Deborah; Pruett-Said, Terry

    2006-01-01

    Written by teachers for teachers, "A Kaleidoscope of Models and Strategies for Teaching English to Speakers of Other Languages," is a practical introduction to models and strategies employed in the teaching of English language learners. Each chapter discusses several models and/or strategies by focusing on particular methods and gives the…

  8. A comparison of three speaker-intrinsic vowel formant frequency normalization algorithms for sociophonetics

    DEFF Research Database (Denmark)

    Fabricius, Anne; Watt, Dominic; Johnson, Daniel Ezra

    2009-01-01

    from RP and Aberdeen English (northeast Scotland). We conclude that, for the data examined here, the S-centroid W&F procedures performs at least as well as the two most recognized speaker-intrinsic, vowel-extrinsic, formant-intrinsic normalization methods, Lobanov's (1971) z-score procedure and Nearey...

  9. Language Variation and Score Variation in the Testing of English Language Learners, Native Spanish Speakers

    Science.gov (United States)

    Solano-Flores, Guillermo; Li, Min

    2009-01-01

    We investigated language variation and score variation in the testing of English language learners, native Spanish speakers. We gave students the same set of National Assessment of Educational Progress mathematics items in both their first language and their second language. We examined the amount of score variation due to the main and interaction…

  10. Native Speakers of Arabic and ESL Texts: Evidence for the Transfer of Written Word Identification Processes

    Science.gov (United States)

    Hayes-Harb, Rachel

    2006-01-01

    English as a second language (ESL) teachers have long noted that native speakers of Arabic exhibit exceptional difficulty with English reading comprehension (e.g., Thompson-Panos & Thomas-Ruzic, 1983). Most existing work in this area has looked to higher level aspects of reading such as familiarity with discourse structure and cultural knowledge…

  11. Revealing Word Order: Using Serial Position in Binomials to Predict Properties of the Speaker

    Science.gov (United States)

    Iliev, Rumen; Smirnova, Anastasia

    2016-01-01

    Three studies test the link between word order in binomials and psychological and demographic characteristics of a speaker. While linguists have already suggested that psychological, cultural and societal factors are important in choosing word order in binomials, the vast majority of relevant research was focused on general factors and on broadly…

  12. Visual and auditory digit-span performance in native and nonnative speakers

    NARCIS (Netherlands)

    Olsthoorn, N.M.; Andringa, S.; Hulstijn, J.H.

    2014-01-01

    We compared 121 native and 114 non-native speakers of Dutch (with 35 different first languages) on four digit-span tasks, varying modality (visual/auditory) and direction (forward/backward). An interaction was observed between nativeness and modality, such that, while natives performed better than

  13. Studies on inter-speaker variability in speech and its application in ...

    Indian Academy of Sciences (India)

    In this review article, we study the variability in the acoustic signal among different speakers enunciating the same sound. ...... search, etc., nevertheless, the computational costs are high. Therefore, there have ... generation but will also help in accounting for the Jacobian of the transformation which is dis- cussed later in this ...

  14. "Feminism Lite?" Feminist Identification, Speaker Appearance, and Perceptions of Feminist and Antifeminist Messengers

    Science.gov (United States)

    Bullock, Heather E.; Fernald, Julian L.

    2003-01-01

    Drawing on a communications model of persuasion (Hovland, Janis, & Kelley, 1953), this study examined the effect of target appearance on feminists' and nonfeminists' perceptions of a speaker delivering a feminist or an antifeminist message. One hundred three college women watched one of four videotaped speeches that varied by content (profeminist…

  15. Myths and Misconceptions about Nonnative English Speakers in the TESOL (NNEST) Movement

    Science.gov (United States)

    Selvi, Ali Fuad

    2014-01-01

    Parallel to the growing recognition of English as an international language, the fundamental premises of the TESOL discipline (e.g., the ownership of the language, native speakers as a goal and model of competence for learning and teaching, linguistic standards and language variety/ies to be taught, monolingual/monocultural approach to teaching)…

  16. Selectivity in L1 Attrition: Differential Object Marking in Spanish Near-Native Speakers of English

    Science.gov (United States)

    Chamorro, Gloria; Sturt, Patrick; Sorace, Antonella

    2016-01-01

    Previous research has shown L1 attrition to be restricted to structures at the interfaces between syntax and pragmatics, but not to occur with syntactic properties that do not involve such interfaces ("Interface Hypothesis", Sorace and Filiaci in "Anaphora resolution in near-native speakers of Italian." "Second Lang…

  17. Eye Gaze and Production Accuracy Predict English L2 Speakers' Morphosyntactic Learning

    Science.gov (United States)

    McDonough, Kim; Trofimovich, Pavel; Dao, Phung; Dio, Alexandre

    2017-01-01

    This study investigated the relationship between second language (L2) speakers' success in learning a new morphosyntactic pattern and characteristics of one-on-one learning activities, including opportunities to comprehend and produce the target pattern, receive feedback from an interlocutor, and attend to the meaning of the pattern through self-…

  18. Longitudinal Speaker Clustering and Verification Corpus with Code-switching Frisian-Dutch Speech

    NARCIS (Netherlands)

    Yilmaz, E.; Dijkstra, J.E.; Van de Velde, Hans; Kampstra, F.; Algra, J.; van den Heuvel, H.; van Leeuwen, D.

    2017-01-01

    In this paper, we present a new longitudinal and bilingual broadcast database designed for speaker clustering and text- independent verification research. The broadcast data is ex- tracted from the archives of Omrop Fryslaˆn which is the re- gional broadcaster in the province of Fryslaˆn, located in

  19. Production of verbs in base position by Dutch agrammatic speakers : Inflection versus finiteness

    NARCIS (Netherlands)

    Bastiaanse, Y.R.M.

    Several hypotheses have been formulated to predict the pattern of performance on finite verbs by agrammatic speakers. The present study is focused on finite and nonfinite verbs in base-position. Three finite verb forms (third person singular in past and present tense and third person plural in

  20. A Nonword Repetition Task for Speakers with Misarticulations: The Syllable Repetition Task (SRT)

    Science.gov (United States)

    Shriberg, Lawrence D.; Lohmeier, Heather L.; Campbell, Thomas F.; Dollaghan, Christine A.; Green, Jordan R.; Moore, Christopher A.

    2009-01-01

    Purpose: Conceptual and methodological confounds occur when non(sense) word repetition tasks are administered to speakers who do not have the target speech sounds in their phonetic inventories or who habitually misarticulate targeted speech sounds. In this article, the authors (a) describe a nonword repetition task, the Syllable Repetition Task…

  1. Infants Prefer Tunes Previously Introduced by Speakers of Their Native Language

    Science.gov (United States)

    Soley, Gaye; Sebastián-Gallés, Núria

    2015-01-01

    Infants show attentional biases for certain individuals over others based on various cues. However, the role of these biases in shaping infants' preferences and learning is not clear. This study asked whether infants' preference for native speakers (Kinzler, Dupoux, & Spelke, 2007) would modulate their preferences for tunes. After getting…

  2. Native Speaker Norms and China English: From the Perspective of Learners and Teachers in China

    Science.gov (United States)

    He, Deyuan; Zhang, Qunying

    2010-01-01

    This article explores the question of whether the norms based on native speakers of English should be kept in English teaching in an era when English has become World Englishes. This is an issue that has been keenly debated in recent years, not least in the pages of "TESOL Quarterly." However, "China English" in such debates…

  3. Development of a Self-Report Tool to Evaluate Hearing Aid Outcomes among Chinese Speakers

    Science.gov (United States)

    Wong, Lena L. N.; Hang, Na

    2014-01-01

    Purpose: This article reports on the development of a self-report tool--the Chinese Hearing Aid Outcomes Questionnaire (CHAOQ)--to evaluate hearing aid outcomes among Chinese speakers. Method: There were 4 phases to construct the CHAOQ and evaluate its psychometric properties. First, items were selected to evaluate a range of culturally relevant…

  4. Speech overlap detection in a two-pass speaker diarization system

    NARCIS (Netherlands)

    Huijbregts, M.; Leeuwen, D.A. van; Jong, F.M.G. de

    2009-01-01

    In this paper we present the two-pass speaker diarization system that we developed for the NIST RT09s evaluation. In the first pass of our system a model for speech overlap detection is generated automatically. This model is used in two ways to reduce the diarization errors due to overlapping

  5. The Role of Statistical Learning and Working Memory in L2 Speakers' Pattern Learning

    Science.gov (United States)

    McDonough, Kim; Trofimovich, Pavel

    2016-01-01

    This study investigated whether second language (L2) speakers' morphosyntactic pattern learning was predicted by their statistical learning and working memory abilities. Across three experiments, Thai English as a Foreign Language (EFL) university students (N = 140) were exposed to either the transitive construction in Esperanto (e.g., "tauro…

  6. Discrepancy Between Native English Speaker Teachers' Teaching Styles and Chinese English Learners' Learning Styles

    Science.gov (United States)

    Zhou, Li; Fan, Zhi-zhong

    2007-01-01

    Recent decades have witnessed a growing number of native English speakers teaching English in Chinese classroom. However, their teaching does not gain expected ends. Extensive studies have found that the mismatch between learning styles and teaching styles is a possible reason for this phenomenon. This paper aims to investigate whether the…

  7. INTERPOL survey of the use of speaker identification by law enforcement agencies.

    Science.gov (United States)

    Morrison, Geoffrey Stewart; Sahito, Farhan Hyder; Jardine, Gaëlle; Djokic, Djordje; Clavet, Sophie; Berghs, Sabine; Goemans Dorny, Caroline

    2016-06-01

    A survey was conducted of the use of speaker identification by law enforcement agencies around the world. A questionnaire was circulated to law enforcement agencies in the 190 member countries of INTERPOL. 91 responses were received from 69 countries. 44 respondents reported that they had speaker identification capabilities in house or via external laboratories. Half of these came from Europe. 28 respondents reported that they had databases of audio recordings of speakers. The clearest pattern in the responses was that of diversity. A variety of different approaches to speaker identification were used: The human-supervised-automatic approach was the most popular in North America, the auditory-acoustic-phonetic approach was the most popular in Europe, and the spectrographic/auditory-spectrographic approach was the most popular in Africa, Asia, the Middle East, and South and Central America. Globally, and in Europe, the most popular framework for reporting conclusions was identification/exclusion/inconclusive. In Europe, the second most popular framework was the use of verbal likelihood ratio scales. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  8. Influential Factors in Incomplete Acquisition and Attrition of Young Heritage Speakers' Vocabulary Knowledge

    Science.gov (United States)

    Gharibi, Khadijeh; Boers, Frank

    2017-01-01

    This study investigates whether young heritage speakers, either simultaneous or sequential bilinguals, have limited vocabulary knowledge in their family language compared to matched monolingual counterparts and, if so, what factors help to account for this difference. These factors include age, age at emigration, length of emigration, frequency of…

  9. Emotion-Memory Effects in Bilingual Speakers: A Levels-of-Close Processing Approach

    Science.gov (United States)

    Aycicegi-Dinn, Ayse; Caldwell-Harris, Catherine L.

    2009-01-01

    Emotion-memory effects occur when emotion words are more frequently recalled than neutral words. Bilingual speakers report that taboo terms and emotional phrases generate a stronger emotional response when heard or spoken in their first language. This suggests that the basic emotion-memory will be stronger for words presented in a first language.…

  10. Linguistic skills of adult native speakers, as a function of age and level of education

    NARCIS (Netherlands)

    Mulder, K.; Hulstijn, J.H.

    2011-01-01

    This study assessed, in a sample of 98 adult native speakers of Dutch, how their lexical skills and their speaking proficiency varied as a function of their age and level of education and profession (EP). Participants, categorized in terms of their age (18-35, 36-50, and 51-76 years old) and the

  11. Spanish Is Foreign: Heritage Speakers' Interpretations of the Introductory Spanish Language Curriculum

    Science.gov (United States)

    DeFeo, Dayna Jean

    2015-01-01

    This article presents a case study of the perceptions of Spanish heritage speakers enrolled in introductory-level Spanish foreign language courses. Despite their own identities that were linked to the United States and Spanish of the Borderlands, the participants felt that the curriculum acknowledged the Spanish of Spain and foreign countries but…

  12. The Effects of Listener and Speaker Training on Emergent Relations in Children with Autism

    Science.gov (United States)

    Sprinkle, Evelyn C.; Miguel, Caio F.

    2012-01-01

    The current study assessed the use of standard conditional discrimination (i.e., listener) and textual/tact (i.e., speaker) training in the establishment of equivalence classes containing dictated names, tacts/textual responses, pictures and printed words. Four children (ages 5 to 7 years) diagnosed with autism were taught to select pictures and…

  13. Self-Disclosure in Initial Interactions amongst Speakers of American and Australian English

    Science.gov (United States)

    Haugh, Michael; Carbaugh, Donal

    2015-01-01

    Getting acquainted with others is one of the most basic interpersonal communication events. Yet there has only been a limited number of studies that have examined variation in the interactional practices through which unacquainted persons become acquainted and establish relationships across speakers of the same language. The current study focuses…

  14. Intonation Contrast in Cantonese Speakers with Hypokinetic Dysarthria Associated with Parkinson's Disease

    Science.gov (United States)

    Ma, Joan K.-Y.; Whitehill, Tara L.; So, Susanne Y.-S.

    2010-01-01

    Purpose: Speech produced by individuals with hypokinetic dysarthria associated with Parkinson's disease (PD) is characterized by a number of features including impaired speech prosody. The purpose of this study was to investigate intonation contrasts produced by this group of speakers. Method: Speech materials with a question-statement contrast…

  15. Bilingual Language Development and Disorders in Spanish-English Speakers. Second Edition

    Science.gov (United States)

    Goldstein, Brian A., Ed.

    2012-01-01

    Because dual language learners are the fastest--growing segment of the U.S. student population--and the majority speak Spanish as a first language--the new generation of SLPs must have comprehensive knowledge of how to work effectively with bilingual speakers. That's what they'll get in the second edition of this book, an ideal graduate-level text…

  16. Neural Control of Rising and Falling Tones in Mandarin Speakers Who Stutter

    Science.gov (United States)

    Howell, Peter; Jiang, Jing; Peng, Danling; Lu, Chunming

    2012-01-01

    Neural control of rising and falling tones in Mandarin people who stutter (PWS) was examined by comparing with that which occurs in fluent speakers [Howell, Jiang, Peng, and Lu (2012). Neural control of fundamental frequency rise and fall in Mandarin tones. "Brain and Language, 121"(1), 35-46]. Nine PWS and nine controls were scanned. Functional…

  17. Minority Language Speakers as Migrants: Some Preliminary Observations on the Sudanese Community in Melbourne

    Science.gov (United States)

    Musgrave, Simon; Hajek, John

    2013-01-01

    The language problems faced by migrants may be more complex when they come from a minority language group in their homeland. The new arrivals may find that there are few, or even no, speakers of their language in the community to which they have moved. Then decisions have to be made as to whether to attempt to maintain the native language and also…

  18. Memory for Discourse: The Effects of Inferences about the Speaker's Intentions.

    Science.gov (United States)

    Dodd, David H.; Housel, Thomas J.

    A listener's inferences about why speakers say what they do can influence what the listener remembers. Psychologists studying memory for discourse have neglected to give attention to the source of the information to be remembered. In a study to determine what effects adding information has on remembering initial discourse, subjects listening to…

  19. An Evaluation of Native-speaker Judgements of Foreign-accented British and American English

    NARCIS (Netherlands)

    Doel, W.Z. van den

    2006-01-01

    This study is the first ever to employ a large-scale Internet survey to investigate priorities in English pronunciation training. Well over 500 native speakers from throughout the English-speaking world, including North America, the British Isles, Australia and New Zealand, were asked to detect and

  20. The Use of Voice Cues for Speaker Gender Recognition in Cochlear Implant Recipients

    Science.gov (United States)

    Meister, Hartmut; Fürsen, Katrin; Streicher, Barbara; Lang-Roth, Ruth; Walger, Martin

    2016-01-01

    Purpose: The focus of this study was to examine the influence of fundamental frequency (F0) and vocal tract length (VTL) modifications on speaker gender recognition in cochlear implant (CI) recipients for different stimulus types. Method: Single words and sentences were manipulated using isolated or combined F0 and VTL cues. Using an 11-point…

  1. Acoustic Cues to Perception of Word Stress by English, Mandarin, and Russian Speakers

    Science.gov (United States)

    Chrabaszcz, Anna; Winn, Matthew; Lin, Candise Y.; Idsardi, William J.

    2014-01-01

    Purpose: This study investigated how listeners' native language affects their weighting of acoustic cues (such as vowel quality, pitch, duration, and intensity) in the perception of contrastive word stress. Method: Native speakers (N = 45) of typologically diverse languages (English, Russian, and Mandarin) performed a stress identification…

  2. Do Native Speakers of North American and Singapore English Differentially Perceive Comprehensibility in Second Language Speech?

    Science.gov (United States)

    Saito, Kazuya; Shintani, Natsuko

    2016-01-01

    The current study examined the extent to which native speakers of North American and Singapore English differentially perceive the comprehensibility (ease of understanding) of second language (L2) speech. Spontaneous speech samples elicited from 50 Japanese learners of English with various proficiency levels were first rated by 10 Canadian and 10…

  3. Art, Symbol and Royalty: A Case Study of the Yoruba Speakers in ...

    African Journals Online (AJOL)

    This paper set out to examine the relationship and importance of Art, Symbol and Royalty amongst the Yoruba speakers in Nigeria. While studies have been done in different areas of this topic, yet this area remains unattended to despite its importance in art history. The aim of this paper is to bring awareness and fill the ...

  4. DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS

    Science.gov (United States)

    2015-05-29

    standardization [3], feature mapping [4], source normalization [5], inter-dataset variability compensation via nuisance attribute projection [6], and within-class...longstanding method of normalization that was originally developed to provide noise robust speech recognition [9] and was then applied to speaker...an acoustic feature vector post- processing method designed to introduce robustness under linear channel mismatch and additive noise conditions

  5. Insight into the Attitudes of Speakers of Urban Meccan Hijazi Arabic towards Their Dialect

    Science.gov (United States)

    Alahmadi, Sameeha D.

    2016-01-01

    The current study mainly aims to examine the attitudes of speakers of Urban Meccan Hijazi Arabic (UMHA) towards their dialect, which is spoken in Mecca, Saudi Arabia. It also investigates whether the participants' age, sex and educational level have any impact on their perception of their dialect. To this end, I designed a 5-point-Likert-scale…

  6. Embodied language in first- and second-language speakers: neural correlates of processing motor verbs.

    Science.gov (United States)

    De Grauwe, Sophie; Willems, Roel M; Rueschemeyer, Shirley-Ann; Lemhöfer, Kristin; Schriefers, Herbert

    2014-04-01

    The involvement of neural motor and sensory systems in the processing of language has so far mainly been studied in native (L1) speakers. In an fMRI experiment, we investigated whether non-native (L2) semantic representations are rich enough to allow for activation in motor and somatosensory brain areas. German learners of Dutch and a control group of Dutch native speakers made lexical decisions about visually presented Dutch motor and non-motor verbs. Region-of-interest (ROI) and whole-brain analyses indicated that L2 speakers, like L1 speakers, showed significantly increased activation for simple motor compared to non-motor verbs in motor and somatosensory regions. This effect was not restricted to Dutch-German cognate verbs, but was also present for non-cognate verbs. These results indicate that L2 semantic representations are rich enough for motor-related activations to develop in motor and somatosensory areas. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Impact of Industry Guest Speakers on Business Students' Perceptions of Employability Skills Development

    Science.gov (United States)

    Riebe, L.; Sibson, R.; Roepen, D.; Meakins, K.

    2013-01-01

    This study provides insights into the perceptions and expectations of Australian undergraduate business students (n=150) regarding the incorporation of guest speakers into the curriculum of a leadership unit focused on employability skills development. The authors adopted a mixed methods approach. A survey was conducted, with quantitative results…

  8. Encouraging Advanced Second Language Speakers to Recognise Their Language Difficulties: A Personalised Computer-Based Approach

    Science.gov (United States)

    Xu, Jing; Bull, Susan

    2010-01-01

    Despite holding advanced language qualifications, many overseas students studying at English-speaking universities still have difficulties in formulating grammatically correct sentences. This article introduces an "independent open learner model" for advanced second language speakers of English, which confronts students with the state of their…

  9. Improving Speaker Identification Performance Under the Shouted Talking Condition Using the Second-Order Hidden Markov Models

    Directory of Open Access Journals (Sweden)

    Shahin Ismail

    2005-01-01

    Full Text Available Speaker identification systems perform well under the neutral talking condition; however, they suffer sharp degradation under the shouted talking condition. In this paper, the second-order hidden Markov models (HMM2s have been used to improve the recognition performance of isolated-word text-dependent speaker identification systems under the shouted talking condition. Our results show that HMM2s significantly improve the speaker identification performance compared to the first-order hidden Markov models (HMM1s. The average speaker identification performance under the shouted talking condition based on HMM1s is . On the other hand, the average speaker identification performance based on HMM2s is .

  10. Evaluation of Speakers at a National Continuing Medical Education (CME Course

    Directory of Open Access Journals (Sweden)

    Jannette Collins, MD, MEd, FCCP

    2002-12-01

    Full Text Available Purpose: Evaluations of a national radiology continuing medical education (CME course in thoracic imaging were analyzed to determine what constitutes effective and ineffective lecturing. Methods and Materials: Evaluations of sessions and individual speakers participating in a five-day course jointly sponsored by the Society of Thoracic Radiology (STR and the Radiological Society of North America (RSNA were tallied by the RSNA Department of Data Management and three members of the STR Training Committee. Comments were collated and analyzed to determine the number of positive and negative comments and common themes related to ineffective lecturing. Results: Twenty-two sessions were evaluated by 234 (75.7% of 309 professional registrants. Eighty-one speakers were evaluated by an average of 153 registrants (range, 2 – 313. Mean ratings for 10 items evaluating sessions ranged from 1.28 – 2.05 (1=most positive, 4=least positive; SD .451 - .902. The average speaker rating was 5.7 (1=very poor, 7=outstanding; SD 0.94; range 4.3 – 6.4. Total number of comments analyzed was 862, with 505 (58.6% considered positive and 404 (46.9% considered negative (the total number exceeds 862 as a “comment” could consist of both positive and negative statements. Poor content was mentioned most frequently, making up 107 (26.5% of 404 negative comments, and applied to 51 (63% of 81 speakers. Other negative comments, in order of decreasing frequency, were related to delivery, image slides, command of the English language, text slides, and handouts. Conclusions: Individual evaluations of speakers at a national CME course provided information regarding the quality of lectures that was not provided by evaluations of grouped presentations. Systematic review of speaker evaluations provided specific information related to the types and frequency of features related to ineffective lecturing. This information can be used to design CME course evaluations, design future CME

  11. The Acquisition of English Focus Marking by Non-Native Speakers

    Science.gov (United States)

    Baker, Rachel Elizabeth

    This dissertation examines Mandarin and Korean speakers' acquisition of English focus marking, which is realized by accenting particular words within a focused constituent. It is important for non-native speakers to learn how accent placement relates to focus in English because appropriate accent placement and realization makes a learner's English more native-like and easier to understand. Such knowledge may also improve their English comprehension skills. In this study, 20 native English speakers, 20 native Mandarin speakers, and 20 native Korean speakers participated in four experiments: (1) a production experiment, in which they were recorded reading the answers to questions, (2) a perception experiment, in which they were asked to determine which word in a recording was the last prominent word, (3) an understanding experiment, in which they were asked whether the answers in recorded question-answer pairs had context-appropriate prosody, and (4) an accent placement experiment, in which they were asked which word they would make prominent in a particular context. Finally, a new group of native English speakers listened to utterances produced in the production experiment, and determined whether the prosody of each utterance was appropriate for its context. The results of the five experiments support a novel predictive model for second language prosodic focus marking acquisition. This model holds that both transfer of linguistic features from a learner's native language (L1) and features of their second language (L2) affect learners' acquisition of prosodic focus marking. As a result, the model includes two complementary components: the Transfer Component and the L2 Challenge Component. The Transfer Component predicts that prosodic structures in the L2 will be more easily acquired by language learners that have similar structures in their L1 than those who do not, even if there are differences between the L1 and L2 in how the structures are realized. The L2

  12. The effects of L2 proficiency level on the processing of wh-questions among Dutch second language speakers of English

    NARCIS (Netherlands)

    Jackson, C.N.; Hell, J.G. van

    2011-01-01

    Using a self-paced reading task, the present study explores how Dutch-English L2 speakers parse English wh-subject-extractions and wh-object-extractions. Results suggest that English native speakers and highly-proficient Dutch–English L2 speakers do not always exhibit measurable signs of on-line

  13. On the same wavelength: predictable language enhances speaker-listener brain-to-brain synchrony in posterior superior temporal gyrus.

    Science.gov (United States)

    Dikker, Suzanne; Silbert, Lauren J; Hasson, Uri; Zevin, Jason D

    2014-04-30

    Recent research has shown that the degree to which speakers and listeners exhibit similar brain activity patterns during human linguistic interaction is correlated with communicative success. Here, we used an intersubject correlation approach in fMRI to test the hypothesis that a listener's ability to predict a speaker's utterance increases such neural coupling between speakers and listeners. Nine subjects listened to recordings of a speaker describing visual scenes that varied in the degree to which they permitted specific linguistic predictions. In line with our hypothesis, the temporal profile of listeners' brain activity was significantly more synchronous with the speaker's brain activity for highly predictive contexts in left posterior superior temporal gyrus (pSTG), an area previously associated with predictive auditory language processing. In this region, predictability differentially affected the temporal profiles of brain responses in the speaker and listeners respectively, in turn affecting correlated activity between the two: whereas pSTG activation increased with predictability in the speaker, listeners' pSTG activity instead decreased for more predictable sentences. Listeners additionally showed stronger BOLD responses for predictive images before sentence onset, suggesting that highly predictable contexts lead comprehenders to preactivate predicted words.

  14. THE ROLE OF NON-NATIVE ENGLISH SPEAKER TEACHERS IN ENGLISH LANGUAGE LEARNING

    Directory of Open Access Journals (Sweden)

    Lutfi Ashar Mauludin

    2017-04-01

    Full Text Available Native-English Speaker Teachers (NESTs and Non-Native English Speaker Teachers (NNESTs have their own advantages and disadvantages. However, for English Language Learners (ELLs, NNESTs have more advantages in helping students to acquire English skills. At least there are three factors that can only be performed by NNESTs in English Language Learning. The factors are knowledge of the subject, effective communication, and understanding students‘ difficulties/needs. The NNESTs can effectively provide the clear explanation of knowledge of the language because they are supported by the same background and culture. NNESTs also can communicate with the students with all levels effectively. The use of L1 is effective to help students building their knowledge. Finally, NNESTs can provide the objectives and materials that are suitable with the needs of the students.

  15. Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech

    DEFF Research Database (Denmark)

    Sahidullah, Md; Gonzalez Hautamäki, Rosa; Thomsen, Dennis Alexander Lehmann

    2016-01-01

    Accuracy of automatic speaker recognition (ASV) systems degrades severely in the presence of background noise. In this paper, we study the use of additional side information provided by a body-conducted sensor, throat microphone. Throat microphone signal is much less affected by background noise...... in comparison to acoustic microphone signal. This makes throat microphones potentially useful for feature extraction or speech activity detection. This paper, firstly, proposes a new prototype system for simultaneous data-acquisition of acoustic and throat microphone signals. Secondly, we study the use...... of this additional information for both speech activity detection, feature extraction and fusion of the acoustic and throat microphone signals. We collect a pilot database consisting of 38 subjects including both clean and noisy sessions. We carry out speaker verification experiments using Gaussian mixture model...

  16. Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition

    DEFF Research Database (Denmark)

    Shepstone, Sven Ewan; Tan, Zheng-Hua; Jensen, Søren Holdt

    2013-01-01

    In this paper we show a new method of using automatic age and gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers. Instead of relying on explicitly provided demographic data for each user, we define an audio-based demographic group...... profile that captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusion of acoustic and prosodic features determines the probability of each speaker belonging to each class. The information for all speakers is then combined to form the group profile......, which itself is the input to a recommender system. The recommender system finds the content items whose demographics best match the group profile. We tested the effectiveness of the system for several typical home audience configurations. In a survey, users were given a configuration and asked to rate...

  17. Achieving Speaker Gender Equity at the American Society for Microbiology General Meeting.

    Science.gov (United States)

    Casadevall, Arturo

    2015-08-04

    In 2015, the American Society for Microbiology (ASM) General Meeting essentially achieved gender equity, with 48.5% of the oral presentations being given by women. The mechanisms associated with increased female participation were (i) making the Program Committee aware of gender statistics, (ii) increasing female representation among session convener teams, and (iii) direct instruction to try to avoid all-male sessions. The experience with the ASM General Meeting shows that it is possible to increase the participation of female speakers in a relatively short time and suggests concrete steps that may be taken to achieve this at other meetings. Public speaking is very important for academic advancement in science. Historically women have been underrepresented as speakers in many scientific meetings. This article describes concrete steps that were associated with achieving gender equity at a major meeting. Copyright © 2015 Casadevall.

  18. A Novel Approach in Text-Independent Speaker Recognition in Noisy Environment

    Directory of Open Access Journals (Sweden)

    Nona Heydari Esfahani

    2014-10-01

    Full Text Available In this paper, robust text-independent speaker recognition is taken into consideration. The proposed method performs on manual silence-removed utterances that are segmented into smaller speech units containing few phones and at least one vowel. The segments are basic units for long-term feature extraction. Sub-band entropy is directly extracted in each segment. A robust vowel detection method is then applied on each segment to separate a high energy vowel that is used as unit for pitch frequency and formant extraction. By applying a clustering technique, extracted short-term features namely MFCC coefficients are combined with long term features. Experiments using MLP classifier show that the average speaker accuracy recognition rate is 97.33% for clean speech and 61.33% in noisy environment for -2db SNR, that shows improvement compared to other conventional methods.

  19. Phonological knowledge of rhotics in English L2 of brazilian speakers

    Directory of Open Access Journals (Sweden)

    Thais Cristófaro Silva

    2016-01-01

    Full Text Available This article discusses the appropriation of rhotics in English-L2 spoken by Brazilian speakers. he results show that rhotics are quickly incorporated into the English-L2 speakers’ production. A detailed examination of the data indicates that the following factors are relevant in the appropriation of retrolex approximant in English-L2: proficiency, the individual (learner and the lexical item. he fact that the appropriation of the retrolex approximant quickly achieves excellent levels in English-L2 spoken by Brazilian speakers suggests that teaching of pronunciation is speciic and not global. Based on Multirepresentational Models, it is argued that grammatical knowledge is a dynamic construct, interlaced by various linguistic and non-linguistic factors.

  20. Does Grammatical Gender Influence Perception? A Study of Polish and French Speakers

    Directory of Open Access Journals (Sweden)

    Haertlé Izabella

    2017-12-01

    Full Text Available Can the perception of a word be influenced by its grammatical gender? Can it happen that speakers of one language perceive an object to have masculine features, while speakers of another language perceive the same object to have feminine features? Previous studies suggest that this is the case, and also that there is some supra-language gender categorisation of objects as natural/feminine and artefact/masculine. This study was an attempt to replicate these findings on another population of subjects. This is the first Polish study of this kind, comparing the perceptions of objects by Polish- and French-speaking individuals. The results of this study show that grammatical gender may cue people to assess objects as masculine or feminine. However, the findings of some previous studies, that feminine features are more often ascribed to natural objects than artifacts, were not replicated.

  1. MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech

    Directory of Open Access Journals (Sweden)

    J. Nouza

    2004-09-01

    Full Text Available The paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriorimethod for this purpose. The MAP based formulae for the adaptation ofthe HMM (Hidden Markov Model parameters are described. Severalalternative versions of this method have been implemented andexperimentally verified in two areas, first in the isolated-wordrecognition (IWR task and later also in the large vocabularycontinuous speech recognition (LVCSR system, both developed for theCzech language. The results show that the word error rate (WER can bereduced by more than 20% for a speaker who provides tens of words (incase of IWR or tens of sentences (in case of LVCSR for theadaptation. Recently, we have used the described methods in the designof two practical applications: voice dictation to a PC and automatictranscription of radio and TV news.

  2. Lip-Synching Using Speaker-Specific Articulation, Shape and Appearance Models

    Directory of Open Access Journals (Sweden)

    Breton Gaspard

    2009-01-01

    Full Text Available We describe here the control, shape and appearance models that are built using an original photogrammetric method to capture characteristics of speaker-specific facial articulation, anatomy, and texture. Two original contributions are put forward here: the trainable trajectory formation model that predicts articulatory trajectories of a talking face from phonetic input and the texture model that computes a texture for each 3D facial shape according to articulation. Using motion capture data from different speakers and module-specific evaluation procedures, we show here that this cloning system restores detailed idiosyncrasies and the global coherence of visible articulation. Results of a subjective evaluation of the global system with competing trajectory formation models are further presented and commented.

  3. Speaker-sensitive emotion recognition via ranking: Studies on acted and spontaneous speech☆

    Science.gov (United States)

    Cao, Houwei; Verma, Ragini; Nenkova, Ani

    2015-01-01

    We introduce a ranking approach for emotion recognition which naturally incorporates information about the general expressivity of speakers. We demonstrate that our approach leads to substantial gains in accuracy compared to conventional approaches. We train ranking SVMs for individual emotions, treating the data from each speaker as a separate query, and combine the predictions from all rankers to perform multi-class prediction. The ranking method provides two natural benefits. It captures speaker specific information even in speaker-independent training/testing conditions. It also incorporates the intuition that each utterance can express a mix of possible emotion and that considering the degree to which each emotion is expressed can be productively exploited to identify the dominant emotion. We compare the performance of the rankers and their combination to standard SVM classification approaches on two publicly available datasets of acted emotional speech, Berlin and LDC, as well as on spontaneous emotional data from the FAU Aibo dataset. On acted data, ranking approaches exhibit significantly better performance compared to SVM classification both in distinguishing a specific emotion from all others and in multi-class prediction. On the spontaneous data, which contains mostly neutral utterances with a relatively small portion of less intense emotional utterances, ranking-based classifiers again achieve much higher precision in identifying emotional utterances than conventional SVM classifiers. In addition, we discuss the complementarity of conventional SVM and ranking-based classifiers. On all three datasets we find dramatically higher accuracy for the test items on whose prediction the two methods agree compared to the accuracy of individual methods. Furthermore on the spontaneous data the ranking and standard classification are complementary and we obtain marked improvement when we combine the two classifiers by late-stage fusion.

  4. A One-Class Classification Approach to Generalised Speaker Verification Spoofing Countermeasures using Local Binary Patterns

    OpenAIRE

    Alegre, Federico; Amehraye, Asmaa; Evans, Nicholas

    2013-01-01

    International audience; The vulnerability of automatic speaker verification systems to spoofing is now well accepted. While recent work has shown the potential to develop countermeasures capable of detecting spoofed speech signals, existing solutions typically function well only for specific attacks on which they are optimised. Since the exact nature of spoofing attacks can never be known in practice, there is thus a need for generalised countermeasures which can detect previously unseen spoo...

  5. ON THE VULNERABILITY OF AUTOMATIC SPEAKER RECOGNITION TO SPOOFING ATTACKS WITH ARTIFICIAL SIGNALS

    OpenAIRE

    Alegre, Federico; Vipperla, Ravichander; Evans, Nicholas; Fauve, Benoît

    2012-01-01

    International audience; Automatic speaker verification (ASV) systems are increasingly being used for biometric authentication even if their vulnerability to imposture or spoofing is now widely acknowledged. Recent work has proposed different spoofing approaches which can be used to test vulnerabilities. This paper introduces a new approach based on artificial, tone-like signals which provoke higher ASV scores than genuine client tests. Experimental results show degradations in the equal error...

  6. Hemispheric lateralization of linguistic prosody recognition in comparison to speech and speaker recognition.

    Science.gov (United States)

    Kreitewolf, Jens; Friederici, Angela D; von Kriegstein, Katharina

    2014-11-15

    Hemispheric specialization for linguistic prosody is a controversial issue. While it is commonly assumed that linguistic prosody and emotional prosody are preferentially processed in the right hemisphere, neuropsychological work directly comparing processes of linguistic prosody and emotional prosody suggests a predominant role of the left hemisphere for linguistic prosody processing. Here, we used two functional magnetic resonance imaging (fMRI) experiments to clarify the role of left and right hemispheres in the neural processing of linguistic prosody. In the first experiment, we sought to confirm previous findings showing that linguistic prosody processing compared to other speech-related processes predominantly involves the right hemisphere. Unlike previous studies, we controlled for stimulus influences by employing a prosody and speech task using the same speech material. The second experiment was designed to investigate whether a left-hemispheric involvement in linguistic prosody processing is specific to contrasts between linguistic prosody and emotional prosody or whether it also occurs when linguistic prosody is contrasted against other non-linguistic processes (i.e., speaker recognition). Prosody and speaker tasks were performed on the same stimulus material. In both experiments, linguistic prosody processing was associated with activity in temporal, frontal, parietal and cerebellar regions. Activation in temporo-frontal regions showed differential lateralization depending on whether the control task required recognition of speech or speaker: recognition of linguistic prosody predominantly involved right temporo-frontal areas when it was contrasted against speech recognition; when contrasted against speaker recognition, recognition of linguistic prosody predominantly involved left temporo-frontal areas. The results show that linguistic prosody processing involves functions of both hemispheres and suggest that recognition of linguistic prosody is based on

  7. Decoding speech perception by native and non-native speakers using single-trial electrophysiological data.

    Directory of Open Access Journals (Sweden)

    Alex Brandmeyer

    Full Text Available Brain-computer interfaces (BCIs are systems that use real-time analysis of neuroimaging data to determine the mental state of their user for purposes such as providing neurofeedback. Here, we investigate the feasibility of a BCI based on speech perception. Multivariate pattern classification methods were applied to single-trial EEG data collected during speech perception by native and non-native speakers. Two principal questions were asked: 1 Can differences in the perceived categories of pairs of phonemes be decoded at the single-trial level? 2 Can these same categorical differences be decoded across participants, within or between native-language groups? Results indicated that classification performance progressively increased with respect to the categorical status (within, boundary or across of the stimulus contrast, and was also influenced by the native language of individual participants. Classifier performance showed strong relationships with traditional event-related potential measures and behavioral responses. The results of the cross-participant analysis indicated an overall increase in average classifier performance when trained on data from all participants (native and non-native. A second cross-participant classifier trained only on data from native speakers led to an overall improvement in performance for native speakers, but a reduction in performance for non-native speakers. We also found that the native language of a given participant could be decoded on the basis of EEG data with accuracy above 80%. These results indicate that electrophysiological responses underlying speech perception can be decoded at the single-trial level, and that decoding performance systematically reflects graded changes in the responses related to the phonological status of the stimuli. This approach could be used in extensions of the BCI paradigm to support perceptual learning during second language acquisition.

  8. Some complications in the methodology of teaching Tuvan to non-native speakers

    Directory of Open Access Journals (Sweden)

    Nadezhda D. Suvandii

    2018-03-01

    Full Text Available The foundation of the language policies of the Republic of Tuva is based on strategy of preserving and distributing the Tuvan language as a state language in the territory of Tuva, declared in the State Program for Developing and Preserving the Tuvan Language for the period from 2017 to 2020. One of the named pending problems of methodology of teaching Tuvan is teaching Tuvan to non-native speakers. At the same time, we leak special scientific studies in teaching the Tuvan language to non-native speakers in common methodologies of teaching Tuvan. Teachers giving Tuvan language to non-native speakers are facing a challenge of solving this issue. Article describes some aspects in methodology of teaching Tuvan to Russian-speaking residents of Tuva and foreigners. It is based on the authors’ expertise in 2016-2017. Special letters ө, ү, ң among 36 letters of Tuvan alphabet present a special task of learning their pronunciation. One of most challenging tasks for Russian-speakers in learning Tuvan is assimilation of long and pharyngealized vowels. Long vowels in writing are presented with paired same letters and extemporaneous students would try to pronounce them as two sounds. Special attention when learning Tuvan should be paid to specific sounds articulated with throat. In Tuvan language these vowels are called pharyngealized. In comparison with other Turkic languages pharyngealization is a distinctive feature of language vocalism in Tuvan. Another challenge in learning Tuvan is assimilation of harmony of vowels – a distinctive feature of all Turkic languages. Another significant feature of Tuvan language is presence of strong aspirate consonants. Authors believe that studies in methodological basis should be continued and it is essential for teachers to receive a special education and additional training.

  9. Linear array of photodiodes to track a human speaker for video recording

    Science.gov (United States)

    DeTone, D.; Neal, H.; Lougheed, R.

    2012-12-01

    Communication and collaboration using stored digital media has garnered more interest by many areas of business, government and education in recent years. This is due primarily to improvements in the quality of cameras and speed of computers. An advantage of digital media is that it can serve as an effective alternative when physical interaction is not possible. Video recordings that allow for viewers to discern a presenter's facial features, lips and hand motions are more effective than videos that do not. To attain this, one must maintain a video capture in which the speaker occupies a significant portion of the captured pixels. However, camera operators are costly, and often do an imperfect job of tracking presenters in unrehearsed situations. This creates motivation for a robust, automated system that directs a video camera to follow a presenter as he or she walks anywhere in the front of a lecture hall or large conference room. Such a system is presented. The system consists of a commercial, off-the-shelf pan/tilt/zoom (PTZ) color video camera, a necklace of infrared LEDs and a linear photodiode array detector. Electronic output from the photodiode array is processed to generate the location of the LED necklace, which is worn by a human speaker. The computer controls the video camera movements to record video of the speaker. The speaker's vertical position and depth are assumed to remain relatively constant- the video camera is sent only panning (horizontal) movement commands. The LED necklace is flashed at 70Hz at a 50% duty cycle to provide noise-filtering capability. The benefit to using a photodiode array versus a standard video camera is its higher frame rate (4kHz vs. 60Hz). The higher frame rate allows for the filtering of infrared noise such as sunlight and indoor lighting-a capability absent from other tracking technologies. The system has been tested in a large lecture hall and is shown to be effective.

  10. Reflecting on the dichotomy native-non native speakers in an EFL context

    OpenAIRE

    Mariño, Claudia

    2011-01-01

    This article provides a discussion based on constructs about the dichotomy betweennative and non-native speakers. Several models and examples are displayed about thespreading of the English language with the intention of understanding its developmentin the whole world and in Colombia, specifically. Then, some possible definitions aregiven to the term “native speaker” and its conceptualization is described as both realityand myth. One of the main reasons for writing this article is grounded on...

  11. Teaching Arabic to Native Speakers Educational Games in a New Curriculum

    DEFF Research Database (Denmark)

    Alshikhabobakr, Hanan; Papadopoulos, Pantelis M.; Ibrahim, Zeinab

    2016-01-01

    This paper presents nine educational games and activities for learning the Arabic language developed for Arabiyyatii project, a three-year endeavor that involves re-conceptualization of the standard Arabic language learning curriculum as a first language for kindergarten students. The applications...... presented in this paper are developed for tabletop surface computers, which support a collaborative and interactive learning environment. These applications focus on speaking, word production, and sentence recognition of the Modern Standard Arabic to young native speakers. This work employs...

  12. Phrase Structure Patterning and Licensing for English and Serbian Speaker-Oriented Adverb Subclasses

    Directory of Open Access Journals (Sweden)

    Dimković-Telebaković Gordana

    2016-12-01

    Full Text Available This paper attempts to set phrase structure rules for English and Serbian speaker-oriented adverb subclasses. Adverbs are looked at here as specifiers licensed by the semantic feature [ILLOCUTIONARY FORCE]. The results suggest that illocutionary, evaluative and evidential adverbs normally merge within the complementizer layer and the inflectional layer, and that English epistemic adverbs are in most cases preferably integrated into the inflectional layer, whereas Serbian epistemic adverbs tend to occur in the sentence-initial position.

  13. Neural control of rising and falling tones in Mandarin speakers who stutter

    OpenAIRE

    Howell, P.; Jiang, J.; Peng, D.; Lu, C.

    2012-01-01

    Neural control of rising and falling tones in Mandarin people who stutter (PWS) was examined by comparing with that which occurs in fluent speakers [Howell, Jiang, Peng, and Lu (2012). Neural control of fundamental frequency rise and fall in Mandarin tones. Brain and Language, 121(1), 35–46]. Nine PWS and nine controls were scanned. Functional connectivity analysis showed that the connections between the insula and LMC and between the LMC and the putamen differed significantly between PWS and...

  14. Pitch perception and production in congenital amusia: Evidence from Cantonese speakers

    OpenAIRE

    Liu, Fang; Chan, Alice H. D.; Ciocca, Valter; Roquet, Catherine; Peretz, Isabelle; Wong, Patrick C. M.

    2016-01-01

    This study investigated pitch perception and production in speech and music in individuals with congenital amusia (a disorder of musical pitch processing) who are native speakers of Cantonese, a tone language with a highly complex tonal system. Sixteen Cantonese-speaking congenital amusics and 16 controls performed a set of lexical tone perception, production, singing, and psychophysical pitch threshold tasks. Their tone production accuracy and singing proficiency were subsequently judged by ...

  15. The Mechanism of Speech Processing in Congenital Amusia: Evidence from Mandarin Speakers

    OpenAIRE

    Liu, Fang; Jiang, Cunmei; Thompson, William Forde; Xu, Yi; Yang, Yufang; Stewart, Lauren

    2012-01-01

    Congenital amusia is a neuro-developmental disorder of pitch perception that causes severe problems with music processing but only subtle difficulties in speech processing. This study investigated speech processing in a group of Mandarin speakers with congenital amusia. Thirteen Mandarin amusics and thirteen matched controls participated in a set of tone and intonation perception tasks and two pitch threshold tasks. Compared with controls, amusics showed impaired performance on word discrimin...

  16. Linear array of photodiodes to track a human speaker for video recording

    International Nuclear Information System (INIS)

    DeTone, D; Neal, H; Lougheed, R

    2012-01-01

    Communication and collaboration using stored digital media has garnered more interest by many areas of business, government and education in recent years. This is due primarily to improvements in the quality of cameras and speed of computers. An advantage of digital media is that it can serve as an effective alternative when physical interaction is not possible. Video recordings that allow for viewers to discern a presenter's facial features, lips and hand motions are more effective than videos that do not. To attain this, one must maintain a video capture in which the speaker occupies a significant portion of the captured pixels. However, camera operators are costly, and often do an imperfect job of tracking presenters in unrehearsed situations. This creates motivation for a robust, automated system that directs a video camera to follow a presenter as he or she walks anywhere in the front of a lecture hall or large conference room. Such a system is presented. The system consists of a commercial, off-the-shelf pan/tilt/zoom (PTZ) color video camera, a necklace of infrared LEDs and a linear photodiode array detector. Electronic output from the photodiode array is processed to generate the location of the LED necklace, which is worn by a human speaker. The computer controls the video camera movements to record video of the speaker. The speaker's vertical position and depth are assumed to remain relatively constant– the video camera is sent only panning (horizontal) movement commands. The LED necklace is flashed at 70Hz at a 50% duty cycle to provide noise-filtering capability. The benefit to using a photodiode array versus a standard video camera is its higher frame rate (4kHz vs. 60Hz). The higher frame rate allows for the filtering of infrared noise such as sunlight and indoor lighting–a capability absent from other tracking technologies. The system has been tested in a large lecture hall and is shown to be effective.

  17. Lexical and grammatical development in trilingual speakers of isiXhosa, English and Afrikaans.

    Science.gov (United States)

    Potgieter, Anneke P

    2016-05-20

    There is a dearth of normative data on linguistic development among child speakers of Southern African languages, especially in the case of the multilingual children who constitute the largest part of this population. This inevitably impacts on the accuracy of developmental assessments of such speakers. Already negative lay opinion on the effect of early multilingualism on language development rates could be exacerbated by the lack of developmental data, ultimately affecting choices regarding home and school language policies. To establish whether trilinguals necessarily exhibit developmental delay when compared to monolinguals and, if so, whether this delay (1) occurs in terms of both lexical and grammatical development; and (2) in all three the trilinguals' languages, regardless of input quantity. Focusing on isiXhosa, South African English and Afrikaans, the study involved a comparison of 11 four-year-old developing trilinguals' acquisition of vocabulary and passive constructions with that of 10 age-matched monolingual speakers of each language. The trilinguals proved to be monolingual-like in their lexical development in the language to which, on average, they had been exposed most over time, that is, isiXhosa. No developmental delay was found in the trilinguals' acquisition of passive constructions, regardless of the language of testing. As previously found for bilingual development, necessarily reduced quantity of exposure does not hinder lexical development in the trilinguals' input dominant language. The overall lack of delay in their acquisition of the passive is interpreted as possible evidence of cross-linguistic bootstrapping and support for early multilingual exposure.

  18. Verbal short-term memory and language impairments in Cantonese speakers after stroke.

    Science.gov (United States)

    Ho, Diana Wai-Lam; Kong, Anthony Pak-Hin; Koon, Nim-Ting

    2017-02-21

    The study examined the relationship between verbal short-term memory (STM) and language impairment in Cantonese speakers after stroke. It is hypothesised that Cantonese speakers with left-hemisphere (LH) stroke would perform worse than those with right hemisphere (RH) stroke and normal controls. Specific linguistic factors of Cantonese might affect results in the tasks. Fifteen participants with LH stroke, 10 with RH stroke and 25 healthy controls were tested with auditory-verbal immediate serial recall (ISR) tasks and auditory linguistic tasks. All stroke participants were assessed with the Cantonese version of Western Aphasia Battery (CAB). The LH group performed significantly worse than the RH and healthy control groups in the auditory verbal ISR and auditory linguistic tasks. There were significant lexicality, frequency and imageability effects in most tasks. Auditory discrimination and word comprehension tasks, but not the auditory word recognition task had correlations with ISR tasks. Verbal STM and language performance of Cantonese-speakers with history of LH stroke were inferior to RH stroke and healthy controls. The effects of lexicality, word frequency and imageability on verbal STM memory performance were found. Cantonese tones have effects on performance in auditory word recognition task, similar to onset, nucleus and rime.

  19. The effect of different speaker accents on sentence comprehension in children with speech sound disorder.

    Science.gov (United States)

    Harte, Jennifer; Frizelle, Pauline; Gibbon, Fiona

    2017-12-26

    There is substantial evidence that a speaker's accent, specifically an unfamiliar accent, can affect the listener's comprehension. In general, this effect holds true for both adults and children as well as those with typical and impaired language. Previous studies have investigated the effect of different accents on individuals with language disorders, but children with speech sound disorders (SSDs) have received little attention. The current study aims to learn more about the ability of children with SSD to process different speaker accents. Fifteen children with SSD aged between 4;01 and 5;11 years, and 16 typically developing children matched on language ability, age, socioeconomic status, gender and cognitive ability participated in the current study. A sentence comprehension task was carried out with each child, requiring them to follow instructions of increasing length spoken in three different accents - (i) a local Irish (Cork) accent, (ii) a regional North American accent and (iii) a non-native Indian English accent. Results showed no significant group difference and speaker accent did not significantly impact children's performance on the task. The results are discussed in relation to factors that influence accent comprehension, and their implications for children's underlying phonological representations.

  20. Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms.

    Science.gov (United States)

    Bentz, Christian; Verkerk, Annemarie; Kiela, Douwe; Hill, Felix; Buttery, Paula

    2015-01-01

    Explaining the diversity of languages across the world is one of the central aims of typological, historical, and evolutionary linguistics. We consider the effect of language contact-the number of non-native speakers a language has-on the way languages change and evolve. By analysing hundreds of languages within and across language families, regions, and text types, we show that languages with greater levels of contact typically employ fewer word forms to encode the same information content (a property we refer to as lexical diversity). Based on three types of statistical analyses, we demonstrate that this variance can in part be explained by the impact of non-native speakers on information encoding strategies. Finally, we argue that languages are information encoding systems shaped by the varying needs of their speakers. Language evolution and change should be modeled as the co-evolution of multiple intertwined adaptive systems: On one hand, the structure of human societies and human learning capabilities, and on the other, the structure of language.

  1. Degrees of subjunctive vitality among monolingual speakers of Peninsular and Argentinian Spanish

    Directory of Open Access Journals (Sweden)

    Muriel Gallego

    2014-10-01

    Full Text Available This investigation seeks to expand the current understanding of mood use and vitality among monolingual native speakers of two varieties of Spanish. It focuses on establishing a cross-dialectal analysis of mood use in semi-spontaneous oral production. A total of 112 adult monolingual native speakers of Spanish participated in the study (N=56 from Rosario, Argentina and N=56 from Toledo, Spain. A controlled interview was administered to elicit the frequency and vitality of subjunctive use. Results indicate that the use of present subjunctive in complement clauses was found stable and vital among older speakers (51-73 years old in both regions. However, while the youngest generation (18-30 years old shows prevalence of non-subjunctive forms in Rosario, this age group favors the use of present subjunctive in Toledo. A similar pattern is observed with middle-aged participants. Results indicate an overall reduction of present subjunctive use among younger generations in Rosario, and vitality in every age group in Toledo.

  2. Native-speaker and English as a lingua franca pronunciation norms: English majors’ views

    Directory of Open Access Journals (Sweden)

    Aleksandra Wach

    2011-08-01

    Full Text Available Within the communicative approach to English as a foreign language (EFL teaching, the aims of instruction are primarily to enable learners to communicate; hence, functional and communicative intelligibility has become the goal of pronunciation training. On the other hand, contemporary approaches to EFL teaching leave sufficient room for accommodating the individual learner and contextual factors which largely influence the choice of the target pronunciation models. Moreover, in a globalized world, where English has become a contemporary lingua franca for intercultural communication, the pronunciation norms of English as a Lingua Franca (ELF appear to meet the needs and expectations of learners of English in international settings, coexisting with or replacing native-speaker pronunciation models as the target of instruction. The ELF approach and the Lingua Franca Core elaborated by Jenkins (2000, 2002 have aroused controversy among both researchers and EFL teachers. The paper presents the findings of a questionnaire study involving 234 Polish students, English majors, which aimed to determine their preferences and opinions concerning native-speaker and ELF norms as pronunciation instruction targets. The findings revealed a strong preference for native-like pronunciation models in the subjects’ own language development and a less strong preference for such models in pronunciation teaching at all levels of proficiency. Moreover, the results pointed to the significant role played by the intensity of pronunciation training and the level of awareness of native-speaker pronunciation models in shaping the subjects’ attitudes toward native-like and ELF pronunciation norms.

  3. An Investigation of Wavelet Average Framing LPC for Noisy Speaker Identification Environment

    Directory of Open Access Journals (Sweden)

    Khaled Daqrouq

    2015-01-01

    Full Text Available In the presented research paper, an average framing linear prediction coding (AFLPC method for a text-independent speaker identification system is studied. AFLPC was proposed in our previous work. Generally, linear prediction coding (LPC has been used in numerous speech recognition tasks. Here, an investigative procedure was based on studying the AFLPC speaker recognition system in a noisy environment. In the stage of feature extraction, the speaker-specific resonances of the vocal tract were extracted using the AFLPC technique. In the phase of classification, a probabilistic neural network (PNN and Bayesian classifier (BC were applied for comparison. In the performed investigation, the quality of different wavelet transforms with AFLPC techniques was compared with each other. In addition, the capability analysis of the proposed system was examined for comparison with other systems suggested in the literature. In response to an achieved experimental result in a noisy environment, the PNN classifier could have a better performance with the fusion of wavelets and AFLPC as a feature extraction technique termed WFALPCF.

  4. Lexical access in a bilingual speaker with dementia: Changes over time.

    Science.gov (United States)

    Lind, Marianne; Simonsen, Hanne Gram; Ribu, Ingeborg Sophie Bjønness; Svendsen, Bente Ailin; Svennevig, Jan; de Bot, Kees

    2017-10-18

    In this article, we explore the naming skills of a bilingual English-Norwegian speaker diagnosed with Primary Progressive Aphasia, in each of his languages across three different speech contexts: confrontation naming, semi-spontaneous narrative (picture description), and conversation, and at two points in time: 12 and 30 months post diagnosis, respectively. The results are discussed in light of two main theories of lexical retrieval in healthy, elderly speakers: the Transmission Deficit Hypothesis and the Inhibitory Deficit Theory. Our data show that, consistent with the participant's premorbid use of and proficiency in the two languages, his performance in his L2 is lower than in his L1, but this difference diminishes as the disease progresses. This is the case across the three speech contexts; however, the difference is smaller in the narrative task, where his performance is very low in both languages already at the first measurement point. Despite his word finding problems, he is able to take active part in conversation, particularly in his L1 and more so at the first measurement point. In addition to the task effect, we find effects of word class, frequency, and cognateness on his naming skills. His performance seems to support the Transmission Deficit Hypothesis. By combining different tools and methods of analysis, we get a more comprehensive picture of the impact of the dementia on the speaker's languages from an intra-individual as well as an inter-individual perspective, which may be useful in research as well as in clinical practice.

  5. Effect of Level of Presentation to Listeners on Scaled Speech Intelligibility of Speakers with Dysarthria

    Science.gov (United States)

    Kim, Yunjung; Kuo, Christina

    2011-01-01

    Objectivea This study examined the effect of intensity level of presentation on scaling of speech intelligibility in speakers with and without dysarthria. Patients and Methods A total of 50 utterances produced by speakers with dysarthria and healthy speakers were played to 60 listeners in four conditions, which consisted of two different presentation levels (‘high’ vs. ‘low’) and equalization of levels across utterances (‘adjusted’ vs. ‘unadjusted’). Speech intelligibility was scaled by using a direct magnitude estimation technique with and without modulus. Results A significant decrease in speech intelligibility was indicated when the stimuli were adjusted to have fixed intensity on the most intense vocalic nuclei of each word, while no significant change was found between ‘high’ and ‘low’ presentation level conditions. Conclusion The findings suggest that an increase in presentation level alone does not result in significant improvement in speech intelligibility ratings. The results are discussed by considering clinical implications in conducting speech therapy with emphasis on intensity variation. PMID:21952405

  6. Disrupted behaviour in grammatical morphology in French speakers with autism spectrum disorders.

    Science.gov (United States)

    Le Normand, Marie-Thérèse; Blanc, Romuald; Caldani, Simona; Bonnet-Brilhault, Frédérique

    2018-01-18

    Mixed and inconsistent findings have been reported across languages concerning grammatical morphology in speakers with Autism Spectrum Disorders (ASD). Some researchers argue for a selective sparing of grammar whereas others claim to have identified grammatical deficits. The present study aimed to investigate this issue in 26 participants with ASD speaking European French who were matched on age, gender and SES to 26 participants with typical development (TD). The groups were compared regarding their productivity and accuracy of syntactic and agreement categories using the French MOR part-of-speech tagger available from the CHILDES. The groups significantly differed in productivity with respect to nouns, adjectives, determiners, prepositions and gender markers. Error analysis revealed that ASD speakers exhibited a disrupted behaviour in grammatical morphology. They made gender, tense and preposition errors and they omitted determiners and pronouns in nominal and verbal contexts. ASD speakers may have a reduced sensitivity to perceiving and processing the distributional structure of syntactic categories when producing grammatical morphemes and agreement categories. The theoretical and cross-linguistic implications of these findings are discussed.

  7. Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms

    Science.gov (United States)

    Bentz, Christian; Verkerk, Annemarie; Kiela, Douwe; Hill, Felix; Buttery, Paula

    2015-01-01

    Explaining the diversity of languages across the world is one of the central aims of typological, historical, and evolutionary linguistics. We consider the effect of language contact-the number of non-native speakers a language has-on the way languages change and evolve. By analysing hundreds of languages within and across language families, regions, and text types, we show that languages with greater levels of contact typically employ fewer word forms to encode the same information content (a property we refer to as lexical diversity). Based on three types of statistical analyses, we demonstrate that this variance can in part be explained by the impact of non-native speakers on information encoding strategies. Finally, we argue that languages are information encoding systems shaped by the varying needs of their speakers. Language evolution and change should be modeled as the co-evolution of multiple intertwined adaptive systems: On one hand, the structure of human societies and human learning capabilities, and on the other, the structure of language. PMID:26083380

  8. Claps and Claptrap: The Analysis of Speaker-Audience Interaction in Political Speeches

    Directory of Open Access Journals (Sweden)

    Peter Bull

    2016-07-01

    Full Text Available Significant insights have been gained into how politicians interact with live audiences through the detailed microanalysis of video and audio recordings, especially of rhetorical techniques used by politicians to invite applause. The overall aim of this paper is to propose a new theoretical model of speaker-audience interaction in set-piece political speeches, based on the concept of dialogue between speaker and audience. Research is reviewed not only on applause, but also on other audience responses, such as laughter, cheering, chanting, and booing. Research is also reviewed on other factors besides rhetorical devices, in particular, delivery, speech content, and uninvited applause. Although these analyses are based primarily on British speeches, they also include recent studies of speeches delivered in both Japan and the USA. This cross-cultural perspective, it is proposed, provides significant insights into the role of political rhetoric in speaker-audience interaction, which may be usefully conceptualized in terms of broader cross-cultural differences between collectivist and individualist societies.

  9. Reading emotion cues: social communication difficulties in pediatric populations.

    Science.gov (United States)

    Timler, Geralyn R

    2003-05-01

    Speech-language pathologists frequently address social communication difficulties in children with diverse linguistic profiles. Of consequence to effective management of social communication skills is that some children with language disorders may also have difficulty understanding emotional cues. The ability to recognize and comprehend the emotional meaning of messages is accomplished through integration of linguistic cues (e.g., what the speaker says), nonlinguistic cues (e.g., the speaker's facial expressions), and situational cues (e.g., predicting how the speaker is likely to feel about the particular topic). This article explores children's comprehension of emotion as expressed through facial expressions and situational cues. First, development of emotion comprehension in children with normal development is summarized. This is followed by a brief review of studies investigating emotion comprehension in clinical populations. Suggestions for assessment and intervention of children's emotion comprehension skills are presented.

  10. Context and Linguistic Skills Factors Affecting the Pronunciation of Arabic Proper Names in Speakers of Bahdini Kurdish

    Directory of Open Access Journals (Sweden)

    Dilgash Mohammed Shareef

    2017-12-01

    Full Text Available The purpose of this research is to explore the pronunciation of Arabic proper names by Bahdini Kurds living in Duhok and the districts around. Thirty-two respondents were selected to say fifty Arabic proper names commonly used by Kurdish today’s society. The variables identified are linguistic skills in Kurdish and Arabic, and the extent of the formality of the context. The study concluded that being a fluent speaker of Arabic gives a Bahdini Kurdish speaker the ability to pronounce the Arabic proper names in a native-like accent. Yet, those speakers have revealed a tendency towards the use of a Kurdish pronunciation of such names when the context was informal. Dissimilarly, the Kurdish pronunciation was regularly used by speakers skilled or unskilled in Kurdish language and linguistics in both formal and informal situations.

  11. Segmentation and accuracy-based scores for the automatic assessment of oral proficiency for proficient L2 speakers

    CSIR Research Space (South Africa)

    De Wet, Febe

    2010-11-01

    Full Text Available This paper reports on the automatic assessment of oral proficiency for advanced second language speakers. A spoken dialogue system is used to guide students through an oral test and to record their answers. Indicators of oral proficiency...

  12. Speakers comfort and voice use in different environments and babble-noise. What are the effects on effort and cognition?

    DEFF Research Database (Denmark)

    Lyberg-Åhlander, Viveka; von Lochow, Heike; Brunskog, Jonas

    2017-01-01

    the speaker-comfort of an environment. Teachers with voice problems use the room differently than their voice-healthy controls. The aim of this study was to investigate what vocal changes speakers do in different acoustical environments and noise conditions. Nine female speakers, voice patients, and voice...... freely for 3-5 min in three noise conditions in each setting: silence, classroom noise (60 dBA), and day-care noise (75 dBA). Questionnaires on effort needed were completed by speakers and listeners. There was a co-play between the rooms and the subjectively assessed vocal- and listening effort and also...... a correlation to cognitive aspects. Listener assessments and the data from the voice accumulator will be presented. This knowledge may contribute to the area of classroom acoustics and speakers’ comfort in general....

  13. The acquisition of English prepositions in first language speakers of Northern Sotho and Afrikaans : a cognitive linguistic study

    OpenAIRE

    2013-01-01

    M.A. (Applied Linguistics) This study seeks to understand the acquisition of English prepositions by second language (L2) speakers of English. Prepositions are notoriously difficult linguistic items to acquire, and the reasons for this are manifold. This study looks into the relationship between the linguistic similarities between prepositions in the first language (L1) and the English prepositions in, on, to and into. The study focuses on two particular groups of L1 speakers: (i) Northern...

  14. The Impact of Gender Difference on Effective Communication Between Learners of English Language and Native Speakers of English

    OpenAIRE

    KOÇOĞLU, ZEYNEP

    2014-01-01

    This study investigated the influence of gender on English learners' production of communication strategies while interacting with native speakers of English. Ten Turkish EFL students were paired with 10 native speakers of English, producing a total of 20 (10-minute long) conversations. Data collection procedures consisted of a background questionnaire, observation, a post-session questionnaire and an interview. A modified version of Faerch and Kasper's (1983) framework of communica...

  15. Thermal Stresses Analysis and Optimized TTP Processes to Achieved CNT-Based Diaphragm for Thin Panel Speakers

    Directory of Open Access Journals (Sweden)

    Feng-Min Lai

    2016-01-01

    Full Text Available Industrial companies popularly used the powder coating, classing, and thermal transfer printing (TTP technique to avoid oxidation on the metallic surface and stiffened speaker diaphragm. This study developed a TTP technique to fabricate a carbon nanotubes (CNTs stiffened speaker diaphragm for thin panel speaker. The self-developed TTP stiffening technique did not require a high curing temperature that decreased the mechanical property of CNTs. In addition to increasing the stiffness of diaphragm substrate, this technique alleviated the middle and high frequency attenuation associated with the smoothing sound pressure curve of thin panel speaker. The advantage of TTP technique is less harmful to the ecology, but it causes thermal residual stresses and some unstable connections between printed plates. Thus, this study used the numerical analysis software (ANSYS to analyze the stress and thermal of work piece which have not delaminated problems in transfer interface. The Taguchi quality engineering method was applied to identify the optimal manufacturing parameters. Finally, the optimal manufacturing parameters were employed to fabricate a CNT-based diaphragm, which was then assembled onto a speaker. The result indicated that the CNT-based diaphragm improved the sound pressure curve smoothness of the speaker, which produced a minimum high frequency dip difference (ΔdB value.

  16. Student perceptions of native and non-native speaker language instructors: A comparison of ESL and Spanish

    Directory of Open Access Journals (Sweden)

    Laura Callahan

    2006-12-01

    Full Text Available The question of the native vs. non-native speaker status of second and foreign language instructors has been investigated chiefly from the perspective of the teacher. Anecdotal evidence suggests that students have strong opinions on the relative qualities of instruction by native and non-native speakers. Most research focuses on students of English as a foreign or second language. This paper reports on data gathered through a questionnaire administered to 55 university students: 31 students of Spanish as FL and 24 students of English as SL. Qualitative results show what strengths students believe each type of instructor has, and quantitative results confirm that any gap students may perceive between the abilities of native and non-native instructors is not so wide as one might expect based on popular notions of the issue. ESL students showed a stronger preference for native-speaker instructors overall, and were at variance with the SFL students' ratings of native-speaker instructors' performance on a number of aspects. There was a significant correlation in both groups between having a family member who is a native speaker of the target language and student preference for and self-identification with a native speaker as instructor. (English text

  17. The Effect of Noise on Relationships Between Speech Intelligibility and Self-Reported Communication Measures in Tracheoesophageal Speakers.

    Science.gov (United States)

    Eadie, Tanya L; Otero, Devon Sawin; Bolt, Susan; Kapsner-Smith, Mara; Sullivan, Jessica R

    2016-08-01

    The purpose of this study was to examine how sentence intelligibility relates to self-reported communication in tracheoesophageal speakers when speech intelligibility is measured in quiet and noise. Twenty-four tracheoesophageal speakers who were at least 1 year postlaryngectomy provided audio recordings of 5 sentences from the Sentence Intelligibility Test. Speakers also completed self-reported measures of communication-the Voice Handicap Index-10 and the Communicative Participation Item Bank short form. Speech recordings were presented to 2 groups of inexperienced listeners who heard sentences in quiet or noise. Listeners transcribed the sentences to yield speech intelligibility scores. Very weak relationships were found between intelligibility in quiet and measures of voice handicap and communicative participation. Slightly stronger, but still weak and nonsignificant, relationships were observed between measures of intelligibility in noise and both self-reported measures. However, 12 speakers who were more than 65% intelligible in noise showed strong and statistically significant relationships with both self-reported measures (R2 = .76-.79). Speech intelligibility in quiet is a weak predictor of self-reported communication measures in tracheoesophageal speakers. Speech intelligibility in noise may be a better metric of self-reported communicative function for speakers who demonstrate higher speech intelligibility in noise.

  18. Theory of mind and context processing in schizophrenia: the role of social knowledge

    Directory of Open Access Journals (Sweden)

    Maud eChampagne-Lavau

    2015-07-01

    Full Text Available The present study sought to determine whether social knowledge such as speaker occupation stereotypes may impact theory of mind (ToM ability in patients with schizophrenia. Thirty individuals with schizophrenia (SZ and 30 matched healthy control (HC participants were tested individually on their ToM ability using a paradigm showing that stereotypes such as speaker occupation influences the extent to which speaker ironic intent is understood. ToM ability was assessed with open questions on the speaker ironic intent, irony rating and mockery rating. Social perception was also assessed through politeness rating. The main results showed that SZ participants, like HC participants, were sensitive to the social stereotypes. They used these stereotypes adequately to attribute mental states such as speaker ironic intent to a protagonist while they found it difficult to explicitly judge and attribute negative attitude and emotion, as evidenced by mockery rating. No difference was found between the two groups regarding social perception ability. These performances were not associated with clinical symptoms. The integration of contextual information seems to be a good target for cognitive remediation aiming to increase social cognition ability.

  19. Multilingualism and Social Inclusion

    DEFF Research Database (Denmark)

    2017-01-01

    This is a thematic issue on the relation between multilingualism and social inclusion. Due to globalization, Europeanization, supranational and transnational regulations linguistic diversity and multilingualism are on the rise. Migration and old and new forms of mobility play an important role...... into practice. This lead to forced linguistic assimilation and the elimination of cultural and linguistic heritage. Now, in the present age of globalization, linguistic diversity at the national state level has been recognized and multilingual states have been developing where all types of languages can be used...... in governance and daily life protected by a legal framework. This does not mean that there is full equality of languages. This carries over to the fair and just social inclusion of the speakers of these weaker, dominated languages as well. There is always a power question related to multilingualism. The ten...

  20. Multilingualism and Social Identity: The Case of Singapore.

    Science.gov (United States)

    Kamwangamalu, Knonko M.

    1993-01-01

    A questionnaire surveyed how multilingual speakers in Singapore express their social identities through language, how they relate to their language, and how they perceive the various English accents to which they are exposed. Respondents related more to the local Singapore accent than to British or American English. Implications for the…

  1. Informal Language Learning Setting: Technology or Social Interaction?

    Science.gov (United States)

    Bahrani, Taher; Sim, Tam Shu

    2012-01-01

    Based on the informal language learning theory, language learning can occur outside the classroom setting unconsciously and incidentally through interaction with the native speakers or exposure to authentic language input through technology. However, an EFL context lacks the social interaction which naturally occurs in an ESL context. To explore…

  2. Meeting the Social Needs of Preschool English Learners

    Science.gov (United States)

    Luchtel, Molly; Hughes, Kere; Luze, Gayle; Bruna, Katherine Richardson; Peterson, Carla

    2010-01-01

    This article summarizes the findings of a study that examined the differences between preschool English learners and preschool English speakers in the areas of classroom conduct, social skills, and teacher-child relationship quality, as rated by their teachers. Theoretical and practical implications of the findings for early care and education…

  3. Normalizing Social Networking in a Beginners' Japanese Course

    Science.gov (United States)

    Morofushi, Mari; Pasfield-Neofitou, Sarah Ellen

    2014-01-01

    With the spread of the Internet, students now have greater opportunities to use Japanese outside of the classroom. For example, they can interact with other Japanese speakers through instant messaging or social networking, or utilize online dictionaries and translation tools to decipher websites in ways that would be impossible with traditional…

  4. Occurrence Frequencies of Acoustic Patterns of Vocal Fry in American English Speakers.

    Science.gov (United States)

    Abdelli-Beruh, Nassima B; Drugman, Thomas; Red Owl, R H

    2016-11-01

    The goal of this study was to analyze the occurrence frequencies of three individual acoustic patterns (A, B, C) and of vocal fry overall (A + B + C) as a function of gender, word position in the sentence (Not Last Word vs. Last Word), and sentence length (number of words in a sentence). This is an experimental design. Twenty-five male and 29 female American English (AE) speakers read the Grandfather Passage. The recordings were processed by a Matlab toolbox designed for the analysis and detection of creaky segments, automatically identified using the Kane-Drugman algorithm. The experiment produced subsamples of outcomes, three that reflect a single, discrete acoustic pattern (A, B, or C) and the fourth that reflects the occurrence frequency counts of Vocal Fry Overall without regard to any specific pattern. Zero-truncated Poisson regression analyses were conducted with Gender and Word Position as predictors and Sentence Length as a covariate. The results of the present study showed that the occurrence frequencies of the three acoustic patterns and vocal fry overall (A + B + C) are greatest at the end of sentences but are unaffected by sentence length. The findings also reveal that AE female speakers exhibit Pattern C significantly more frequently than Pattern B, and the converse holds for AE male speakers. Future studies are needed to confirm such outcomes, assess the perceptual salience of these acoustic patterns, and determine the physiological correlates of these acoustic patterns. The findings have implications for the design of new excitation models of vocal fry. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  5. Lexical and grammatical development in trilingual speakers of isiXhosa, English and Afrikaans

    Directory of Open Access Journals (Sweden)

    Anneke P. Potgieter

    2016-05-01

    Full Text Available Background: There is a dearth of normative data on linguistic development among child speakers of Southern African languages, especially in the case of the multilingual children who constitute the largest part of this population. This inevitably impacts on the accuracy of developmental assessments of such speakers. Already negative lay opinion on the effect of early multilingualism on language development rates could be exacerbated by the lack of developmental data, ultimately affecting choices regarding home and school language policies. Objectives: To establish whether trilinguals necessarily exhibit developmental delay when compared to monolinguals and, if so, whether this delay (1 occurs in terms of both lexical and grammatical development; and (2 in all three the trilinguals’ languages, regardless of input quantity. Method: Focusing on isiXhosa, South African English and Afrikaans, the study involved a comparison of 11 four-year-old developing trilinguals’ acquisition of vocabulary and passive constructions with that of 10 age-matched monolingual speakers of each language. Results: The trilinguals proved to be monolingual-like in their lexical development in the language to which, on average, they had been exposed most over time, that is, isiXhosa. No developmental delay was found in the trilinguals’ acquisition of passive constructions, regardless of the language of testing. Conclusion: As previously found for bilingual development, necessarily reduced quantity of exposure does not hinder lexical development in the trilinguals’ input dominant language. The overall lack of delay in their acquisition of the passive is interpreted as possible evidence of cross-linguistic bootstrapping and support for early multilingual exposure.

  6. Lexical and grammatical development in trilingual speakers of isiXhosa, English and Afrikaans

    Science.gov (United States)

    2016-01-01

    Background There is a dearth of normative data on linguistic development among child speakers of Southern African languages, especially in the case of the multilingual children who constitute the largest part of this population. This inevitably impacts on the accuracy of developmental assessments of such speakers. Already negative lay opinion on the effect of early multilingualism on language development rates could be exacerbated by the lack of developmental data, ultimately affecting choices regarding home and school language policies. Objectives To establish whether trilinguals necessarily exhibit developmental delay when compared to monolinguals and, if so, whether this delay (1) occurs in terms of both lexical and grammatical development; and (2) in all three the trilinguals’ languages, regardless of input quantity. Method Focusing on isiXhosa, South African English and Afrikaans, the study involved a comparison of 11 four-year-old developing trilinguals’ acquisition of vocabulary and passive constructions with that of 10 age-matched monolingual speakers of each language. Results The trilinguals proved to be monolingual-like in their lexical development in the language to which, on average, they had been exposed most over time, that is, isiXhosa. No developmental delay was found in the trilinguals’ acquisition of passive constructions, regardless of the language of testing. Conclusion As previously found for bilingual development, necessarily reduced quantity of exposure does not hinder lexical development in the trilinguals’ input dominant language. The overall lack of delay in their acquisition of the passive is interpreted as possible evidence of cross-linguistic bootstrapping and support for early multilingual exposure. PMID:27245133

  7. An Investigation of Refusal Strategies as Used by Bahdini Kurdish and Syriac Aramaic Speakers

    Directory of Open Access Journals (Sweden)

    Dilgash M.Shareef

    2018-02-01

    Full Text Available For the purpose of achieving a successful communication, issues such as the appropriateness of speech acts and face saving become essential. Therefore, it is very important to achieve a high level of pragmatic competence in speech acts. Bearing this in mind, this study was conducted to investigate the preferred refusal strategies Kurdish and Syriac native speakers use when faced with offers and requests from equal status interlocutors.  The current study has used a modified Written Discourse Completion Test (WDCT consisting of six situations (three of which elicit refusals to offerings and the other three to requests. Forty subjects participated in this study: 20 native speakers of the Kurdish language (10 male and 10 female students and 20 native speakers of Syriac language (10 male and 10 female students. All participants are currently pre-graduate students attending Zakho University. The participants were asked to provide written data that express their refusals to these situations. The data collected have then been analyzed descriptively according to frequency and number of occurrences of semantic formulas used by Beebe et al (1990.  The results showed that a the Syriac Native Participants (SNPs frequently preferred indirect and adjunct strategies for refusals rather than direct ones, b the Kurdish Native Participants (KNPs often preferred direct and indirect strategies more than adjunct ones, c the results also revealed that gender has a great influence on the use of refusal strategies in various ways. Finally, this study concludes that both KNPs and SNPs tended to use more strategies when refusing requests than offers whereas gender has shown to play a significant role in the choice and number of the refusal strategies used by both groups of participants.

  8. Extended turn construction and test question sequences in the conversations of three speakers with agrammatic aphasia.

    Science.gov (United States)

    Beeke, Suzanne; Beckley, Firle; Best, Wendy; Johnson, Fiona; Edwards, Susan; Maxim, Jane

    2013-01-01

    The application of Conversation Analysis (CA) to the investigation of agrammatic aphasia reveals that utterances produced by speakers with agrammatism engaged in everyday conversation differ significantly from utterances produced in response to decontextualised assessment and therapy tasks. Early studies have demonstrated that speakers with agrammatism construct turns from sequences of nouns, adjectives, discourse markers and conjunctions, packaged by a distinct pattern of prosody. This article presents examples of turn construction methods deployed by three people with agrammatism as they take an extended turn, in order to recount a past event, initiate a discussion or have a disagreement. This is followed by examples of sequences occurring in the talk of two of these speakers that result in different, and more limited, turn construction opportunities, namely "test" questions asked in order to initiate a new topic of talk, despite the conversation partner knowing the answer. The contrast between extended turns and test question sequences illustrates the effect of interactional context on aphasic turn construction practices, and the potential of less than optimal sequences to mask turn construction skills. It is suggested that the interactional motivation for test question sequences in these data are to invite people with aphasia to contribute to conversation, rather than to practise saying words in an attempt to improve language skills. The idea that test question sequences may have their origins in early attempts to deal with acute aphasia, and the potential for conversation partnerships to become "stuck" in such interactional patterns after they may have outlived their usefulness, are discussed with a view to clinical implications.

  9. Time-Contrastive Learning Based DNN Bottleneck Features for Text-Dependent Speaker Verification

    DEFF Research Database (Denmark)

    Sarkar, Achintya Kumar; Tan, Zheng-Hua

    2017-01-01

    In this paper, we present a time-contrastive learning (TCL) based bottleneck (BN) feature extraction method for speech signals with an application to text-dependent (TD) speaker verification (SV). It is well-known that speech signals exhibit quasi-stationary behavior in and only in a short interval......, and the TCL method aims to exploit this temporal structure. More specifically, it trains deep neural networks (DNNs) to discriminate temporal events obtained by uniformly segmenting speech signals, in contrast to existing DNN based BN feature extraction methods that train DNNs using labeled data...

  10. Parametric Representation of the Speaker's Lips for Multimodal Sign Language and Speech Recognition

    Science.gov (United States)

    Ryumin, D.; Karpov, A. A.

    2017-05-01

    In this article, we propose a new method for parametric representation of human's lips region. The functional diagram of the method is described and implementation details with the explanation of its key stages and features are given. The results of automatic detection of the regions of interest are illustrated. A speed of the method work using several computers with different performances is reported. This universal method allows applying parametrical representation of the speaker's lipsfor the tasks of biometrics, computer vision, machine learning, and automatic recognition of face, elements of sign languages, and audio-visual speech, including lip-reading.

  11. An automatic speech recognition system with speaker-independent identification support

    Science.gov (United States)

    Caranica, Alexandru; Burileanu, Corneliu

    2015-02-01

    The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.

  12. Further optimisations of constant Q cepstral processing for integrated utterance and text-dependent speaker verification

    DEFF Research Database (Denmark)

    Delgado, Hector; Todisco, Massimiliano; Sahidullah, Md

    2016-01-01

    performance, but also provide an ancillary level of security. This can take the form of explicit utterance verification (UV). An integrated UV + ASV system should then verify access attempts which contain not just the expected speaker, but also the expected text content. This paper presents such a system...... and introduces new features which are used for both UV and ASV tasks. Based upon multi-resolution, spectro-temporal analysis and when fused with more traditional parameterisations, the new features not only generally outperform Mel-frequency cepstral coefficients, but also are shown to be complementary when...

  13. 2012 Gordon Research Conference on Microbial Stress Response, Schedule and Speaker/Poster Program

    Energy Technology Data Exchange (ETDEWEB)

    Donohue, Timothy J. [Univ. of Wisconsin, Madison, WI (United States)

    2012-07-20

    The Gordon Research Conference on Microbial Stress Response was held at Mount Holyoke College, South Hadley, Massachusetts, July 15-20, 2012. The Conference was well-attended with 180 participants. The 2012 Microbial Stress Responses Gordon Research Conference will provide a forum for the open reporting of recent discoveries on the diverse mechanisms employed by microbes to respond to stress. Approaches range from analysis at the molecular level (how are signals perceived and transmitted to change gene expression or function) to cellular and microbial community responses. Attached is a copy of the formal schedule and speaker program and the poster program.

  14. Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data

    DEFF Research Database (Denmark)

    Sarkar, Achintya Kumar; Sahidullah, Md; Tan, Zheng-Hua

    2017-01-01

    Automatic speaker verification (ASV) systems are vulnerable to spoofing attacks using speech generated by voice conversion and speech synthesis techniques. Commonly, a countermeasure (CM) system is integrated with an ASV system for improved protection against spoofing attacks. But integration...... of the two systems is challenging and often leads to increased false rejection rates. Furthermore, the performance of CM severely degrades if in-domain development data are unavailable. In this study, therefore, we propose a solution that uses two separate background models – one from human speech...

  15. Objective measures to improve the selection of training speakers in HMM-based child speech synthesis

    CSIR Research Space (South Africa)

    Govender, Avashna

    2016-12-01

    Full Text Available to those of female voices than male voices. Therefore, for the remainder of the study, only the MCD measures were used. TABLE III: Objective measures calculated for speaker inde- pendent models Voice Average MCD [db] Average RMSE logF0 [cent] bdl rms 2....11 68 bdl clb 2.22 31 bdl slt 2.36 36 rms clb 1.79 45 rms slt 2.01 47 clb slt 2.05 16 Since [9] showed that the best initial model for child speech synthesis is a gender-independent model, the hypothesis tested in this study was that the closest female...

  16. Social class, social capital, social practice and language in British sociolinguistics

    DEFF Research Database (Denmark)

    Fabricius, Anne H.

    2017-01-01

    Social class has recently re-emerged strongly within academic sociology in the UK, and I argue in this paper that sociolinguists benefit from an awareness of these currents in our work with speakers and communities in the UK setting. The discussion will elaborate on the approaches to social class...... ideological construct within British society all have ramifications for the resonance of social class in sociolinguistics and real-time corpus work. I will look at several research traditions of social class analysis and examine their potential contributions to sociolinguistic research. The importance of fine...... analysis and coding that I and others have pursued in studies of variation and change in several contexts, including the study of the elite/establishment sociolect commonly known as RP. Changing socio-economic and political circumstances as well as the changing nature of ‘class’ as an enregistered...

  17. L2 speakers decompose morphologically complex verbs: fMRI evidence from priming of transparent derived verbs.

    Science.gov (United States)

    De Grauwe, Sophie; Lemhöfer, Kristin; Willems, Roel M; Schriefers, Herbert

    2014-01-01

    In this functional magnetic resonance imaging (fMRI) long-lag priming study, we investigated the processing of Dutch semantically transparent, derived prefix verbs. In such words, the meaning of the word as a whole can be deduced from the meanings of its parts, e.g., wegleggen "put aside." Many behavioral and some fMRI studies suggest that native (L1) speakers decompose transparent derived words. The brain region usually implicated in morphological decomposition is the left inferior frontal gyrus (LIFG). In non-native (L2) speakers, the processing of transparent derived words has hardly been investigated, especially in fMRI studies, and results are contradictory: some studies find more reliance on holistic (i.e., non-decompositional) processing by L2 speakers; some find no difference between L1 and L2 speakers. In this study, we wanted to find out whether Dutch transparent derived prefix verbs are decomposed or processed holistically by German L2 speakers of Dutch. Half of the derived verbs (e.g., omvallen "fall down") were preceded by their stem (e.g., vallen "fall") with a lag of 4-6 words ("primed"); the other half (e.g., inslapen "fall asleep") were not ("unprimed"). L1 and L2 speakers of Dutch made lexical decisions on these visually presented verbs. Both region of interest analyses and whole-brain analyses showed that there was a significant repetition suppression effect for primed compared to unprimed derived verbs in the LIFG. This was true both for the analyses over L2 speakers only and for the analyses over the two language groups together. The latter did not reveal any interaction with language group (L1 vs. L2) in the LIFG. Thus, L2 speakers show a clear priming effect in the LIFG, an area that has been associated with morphological decomposition. Our findings are consistent with the idea that L2 speakers engage in decomposition of transparent derived verbs rather than processing them holistically.

  18. L2 speakers decompose morphologically complex verbs: fMRI evidence from priming of transparent derived verbs

    Directory of Open Access Journals (Sweden)

    Sophie eDe Grauwe

    2014-10-01

    Full Text Available In this fMRI long-lag priming study, we investigated the processing of Dutch semantically transparent, derived prefix verbs. In such words, the meaning of the word as a whole can be deduced from the meanings of its parts, e.g. wegleggen ‘put aside’. Many behavioral and some fMRI studies suggest that native (L1 speakers decompose transparent derived words. The brain region usually implicated in morphological decomposition is the left inferior frontal gyrus (LIFG. In non-native (L2 speakers, the processing of transparent derived words has hardly been investigated, especially in fMRI studies, and results are contradictory: Some studies find more reliance on holistic (i.e. non-decompositional processing by L2 speakers; some find no difference between L1 and L2 speakers. In this study, we wanted to find out whether Dutch transparent derived prefix verbs are decomposed or processed holistically by German L2 speakers of Dutch. Half of the derived verbs (e.g. omvallen ‘fall down’ were preceded by their stem (e.g. vallen ‘fall’ with a lag of 4 to 6 words (‘primed’; the other half (e.g. inslapen ‘fall asleep’ were not (‘unprimed’. L1 and L2 speakers of Dutch made lexical decisions on these visually presented verbs. Both ROI analyses and whole-brain analyses showed that there was a significant repetition suppression effect for primed compared to unprimed derived verbs in the LIFG. This was true both for the analyses over L2 speakers only and for the analyses over the two language groups together. The latter did not reveal any interaction with language group (L1 vs. L2 in the LIFG. Thus, L2 speakers show a clear priming effect in the LIFG, an area that has been associated with morphological decomposition. Our findings are consistent with the idea that L2 speakers engage in decomposition of transparent derived verbs rather than processing them holistically.

  19. Looking at the evidence in visual world: eye-movements reveal how bilingual and monolingual Turkish speakers process grammatical evidentiality

    Science.gov (United States)

    Arslan, Seçkin; Bastiaanse, Roelien; Felser, Claudia

    2015-01-01

    This study presents pioneering data on how adult early bilinguals (heritage speakers) and late bilingual speakers of Turkish and German process grammatical evidentiality in a visual world setting in comparison to monolingual speakers of Turkish. Turkish marks evidentiality, the linguistic reference to information source, through inflectional affixes signaling either direct (-DI) or indirect (-mIş) evidentiality. We conducted an eye-tracking-during-listening experiment where participants were given access to visual ‘evidence’ supporting the use of either a direct or indirect evidential form. The behavioral results indicate that the monolingual Turkish speakers comprehended direct and indirect evidential scenarios equally well. In contrast, both late and early bilinguals were less accurate and slower to respond to direct than to indirect evidentials. The behavioral results were also reflected in the proportions of looks data. That is, both late and early bilinguals fixated less frequently on the target picture in the direct than in the indirect evidential condition while the monolinguals showed no difference between these conditions. Taken together, our results indicate reduced sensitivity to the semantic and pragmatic function of direct evidential forms in both late and early bilingual speakers, suggesting a simplification of the Turkish evidentiality system in Turkish heritage grammars. We discuss our findings with regard to theories of incomplete acquisition and first language attrition. PMID:26441762

  20. Language contact phenomena in the language use of speakers of German descent and the significance of their language attitudes

    Directory of Open Access Journals (Sweden)

    Ries, Veronika

    2014-03-01

    Full Text Available Within the scope of my investigation on language use and language attitudes of People of German Descent from the USSR, I find almost regular different language contact phenomena, such as viel bliny habn=wir gbackt (engl.: 'we cooked lots of pancakes' (cf. Ries 2011. The aim of analysis is to examine both language use with regard to different forms of language contact and the language attitudes of the observed speakers. To be able to analyse both of these aspects and synthesize them, different types of data are required. The research is based on the following two data types: everyday conversations and interviews. In addition, the individual speakers' biography is a key part of the analysis, because it allows one to draw conclusions about language attitudes and use. This qualitative research is based on morpho-syntactic and interactional linguistic analysis of authentic spoken data. The data arise from a corpus compiled and edited by myself. My being a member of the examined group allowed me to build up an authentic corpus. The natural language use is analysed from the perspective of different language contact phenomena and potential functions of language alternations. One central issue is: How do speakers use the languages available to them, German and Russian? Structural characteristics such as code switching and discursive motives for these phenomena are discussed as results, together with the socio-cultural background of the individual speaker. Within the scope of this article I present exemplarily the data and results of one speaker.

  1. Revisiting the role of language in spatial cognition: Categorical perception of spatial relations in English and Korean speakers.

    Science.gov (United States)

    Holmes, Kevin J; Moty, Kelsey; Regier, Terry

    2017-12-01

    The spatial relation of support has been regarded as universally privileged in nonlinguistic cognition and immune to the influence of language. English, but not Korean, obligatorily distinguishes support from nonsupport via basic spatial terms. Despite this linguistic difference, previous research suggests that English and Korean speakers show comparable nonlinguistic sensitivity to the support/nonsupport distinction. Here, using a paradigm previously found to elicit cross-language differences in color discrimination, we provide evidence for a difference in sensitivity to support/nonsupport between native English speakers and native Korean speakers who were late English learners and tested in a context that privileged Korean. Whereas the former group showed categorical perception (CP) when discriminating spatial scenes capturing the support/nonsupport distinction, the latter did not. An additional group of native Korean speakers-relatively early English learners tested in an English-salient context-patterned with the native English speakers in showing CP for support/nonsupport. These findings suggest that obligatory marking of support/nonsupport in one's native language can affect nonlinguistic sensitivity to this distinction, contra earlier findings, but that such sensitivity may also depend on aspects of language background and the immediate linguistic context.

  2. The Structure of Clinical Consultation: A Case of Non-Native Speakers of English as Participants

    Science.gov (United States)

    Bagheri, H.; Ibrahim, N. A.; Habil, H.

    2015-01-01

    Background: In many parts of the world, patients may find it difficult to visit doctors who share the same language and culture due to the intermingling of people and international recruitment of doctors among many other reasons. In these multilingual multicultural settings (MMSs), doctor-patient interactions face new communication challenges. This study aims to identify the structure of clinical consultation and its phases in an MMS where both doctors and patients are non-native speakers (NNSs) of English. Method: This study takes on a discourse analytic approach to examine the structure of clinical consultation as an activity type. 25 clinical consultation sessions between non-native speakers of English in a public healthcare centre in Malaysia were audio-recorded. Findings and Discussion: The results show that there are some deviations from the mainstream structure of clinical consultations although, in general, the pattern is compatible with previous studies. Deviations are particularly marked in the opening and closing phases of consultation. Conclusion: In almost all interactions, there is a straightforward manner of beginning medical consultations. The absence of greetings may have naturally reduced the length of talk. Hence, by directly entering medical talks, the doctors voice their concern on the curing aspects of the consultation rather than its caring facets. The preference of curing priority to caring is more goal-oriented and in alignment with the consultation as an activity type. PMID:25560336

  3. Shaping your Identity as a Speaker of English: The Struggles of a Beginner Language Learner

    Directory of Open Access Journals (Sweden)

    Janeth María Ortiz Medina

    2017-08-01

    Full Text Available In Colombia, political and educational campaigns have promoted English as the language of success and as a fundamental requirement to comply with the demands of a globalized world; nevertheless, little attention has been paid to the individual experiences of Colombians as they learn this language or to what happens to their identities during this process. This paper reports on a study that explored how three young adult learners of English constructed their identities as speakers of English through their positioning in oral tasks in an English class. It focuses on the story of one of the participants: a beginner female English learner. Data collection methods included class video and audio-recordings, interviews, and diaries kept by the participants. Findings indicated that learners’ positioning in spoken interactions not only directly affected their use of English, but also led them to non-linear transformations of their identities as speakers of English. These results pinpoint the need to gain awareness of the centrality of students’ identities in both their learning process and in language teaching practices and the need to observe how positioning operates in the language classroom.

  4. [Understanding the symbolic values of Japanese onomatopoeia: comparison of Japanese and Chinese speakers].

    Science.gov (United States)

    Haryu, Etsuko; Zhao, Lihua

    2007-10-01

    Do non-native speakers of the Japanese language understand the symbolic values of Japanese onomatopoeia matching a voiced/unvoiced consonant with a big/small sound made by a big/small object? In three experiments, participants who were native speakers of Japanese, Japanese-learning Chinese, or Chinese without knowledge of the Japanese language were shown two pictures. One picture was of a small object making a small sound, such as a small vase being broken, and the other was of a big object making a big sound, such as a big vase being broken. Participants were presented with two novel onomatopoetic words with voicing contrasts, e.g.,/dachan/vs./tachan/, and were told that each word corresponded to one of the two pictures. They were then asked to match the words to the corresponding pictures. Chinese without knowledge of Japanese performed only at chance level, whereas Japanese and Japanese-learning Chinese successfully matched a voiced/unvoiced consonant with a big/small object respectively. The results suggest that the key to understanding the symbolic values of voicing contrasts in Japanese onomatopoeia is some basic knowledge that is intrinsic to the Japanese language.

  5. Numerical investigation on vibration characteristics of a micro-speaker diaphragm considering thermoforming effects

    International Nuclear Information System (INIS)

    Kim, Kyeong Min; Park, Ke Un

    2013-01-01

    Micro-speaker diaphragms play an important role in generating desired sound responses, and are designed to have thin membrane shapes for flexibility in the axial direction. The micro-speaker diaphragms are formed from thin polymer film through the thermoforming process, in which local thickness reductions occur due to strain localization. This thickness reduction results in a change in vibration characteristics of the diaphragm and different sound responses from that of the original design. In this study, the effect of this thickness change in the diaphragm on its vibration characteristics is numerically investigated by coupling thermoforming simulation, structural analysis and modal analysis. Thus, the thickness change in the diaphragm is calculated from the thermoforming simulation, and reflected in the further structural and modal analyses in order to estimate the relevant stiffness and vibration modes. Comparing these simulation results with those from a diaphragm with the uniform thickness, it is found that a local thickness reduction results in the stiffness reduction and the relevant change in the natural frequencies and the corresponding vibration modes.

  6. The relation between working memory and language comprehension in signers and speakers.

    Science.gov (United States)

    Emmorey, Karen; Giezen, Marcel R; Petrich, Jennifer A F; Spurgeon, Erin; O'Grady Farnady, Lucinda

    2017-06-01

    This study investigated the relation between linguistic and spatial working memory (WM) resources and language comprehension for signed compared to spoken language. Sign languages are both linguistic and visual-spatial, and therefore provide a unique window on modality-specific versus modality-independent contributions of WM resources to language processing. Deaf users of American Sign Language (ASL), hearing monolingual English speakers, and hearing ASL-English bilinguals completed several spatial and linguistic serial recall tasks. Additionally, their comprehension of spatial and non-spatial information in ASL and spoken English narratives was assessed. Results from the linguistic serial recall tasks revealed that the often reported advantage for speakers on linguistic short-term memory tasks does not extend to complex WM tasks with a serial recall component. For English, linguistic WM predicted retention of non-spatial information, and both linguistic and spatial WM predicted retention of spatial information. For ASL, spatial WM predicted retention of spatial (but not non-spatial) information, and linguistic WM did not predict retention of either spatial or non-spatial information. Overall, our findings argue against strong assumptions of independent domain-specific subsystems for the storage and processing of linguistic and spatial information and furthermore suggest a less important role for serial encoding in signed than spoken language comprehension. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Cross-Cultural Analysis of Prototypicality Norms Used by Male and Female Persian and American Speakers.

    Science.gov (United States)

    Biria, Reza; Bahadoran-Baghbaderani, Ali

    2016-12-01

    Human beings generally cut the realities of the world in various ways in order to express their thoughts and interact with other people. Naturally, words are linguistic categories which carry the intended concepts; however, humans tend to economize by bundling words into different conceptual classes. Accordingly, the present study sought to explore the prototypicality norms used by male and female Persian and American speakers adopting a cross cultural analysis. For this purpose, from the existing prototypes, four conceptual categories; namely, vehicles, vegetables, furniture, and clothes were randomly selected and used as the tertium comparationis for determining the way 120 male and female monolingual American and Persian speakers, sixty from each language, ranked the category membership of different conceptual members belonging to the targeted prototypes. The analysis of the obtained data based on participants' ranking of the prototypes along with the respective frequency values revealed that gender and culture played a significant role in identifying the category membership of various members of a given concept across different languages. The findings can have important implications for language teachers, material designers, and second language students since culture is an overriding factor in language learning.

  8. Analysis of error type and frequency in apraxia of speech among Portuguese speakers

    Directory of Open Access Journals (Sweden)

    Maysa Luchesi Cera

    Full Text Available Abstract Most studies characterizing errors in the speech of patients with apraxia involve English language. Objectives: To analyze the types and frequency of errors produced by patients with apraxia of speech whose mother tongue was Brazilian Portuguese. Methods: 20 adults with apraxia of speech caused by stroke were assessed. The types of error committed by patients were analyzed both quantitatively and qualitatively, and frequencies compared. Results: We observed the presence of substitution, omission, trial-and-error, repetition, self-correction, anticipation, addition, reiteration and metathesis, in descending order of frequency, respectively. Omission type errors were one of the most commonly occurring whereas addition errors were infrequent. These findings differed to those reported in English speaking patients, probably owing to differences in the methodologies used for classifying error types; the inclusion of speakers with apraxia secondary to aphasia; and the difference in the structure of Portuguese language to English in terms of syllable onset complexity and effect on motor control. Conclusions: The frequency of omission and addition errors observed differed to the frequency reported for speakers of English.

  9. Teaching Semantic Radicals Facilitates Inferring New Character Meaning in Sentence Reading for Nonnative Chinese Speakers

    Science.gov (United States)

    Nguyen, Thi Phuong; Zhang, Jie; Li, Hong; Wu, Xinchun; Cheng, Yahua

    2017-01-01

    This study investigates the effects of teaching semantic radicals in inferring the meanings of unfamiliar characters among nonnative Chinese speakers. A total of 54 undergraduates majoring in Chinese Language from a university in Hanoi, Vietnam, who had 1 year of learning experience in Chinese were assigned to two experimental groups that received instructional intervention, called “old-for-new” semantic radical teaching, through two counterbalanced sets of semantic radicals, with one control group. All of the students completed pre- and post-tests of a sentence cloze task where they were required to choose an appropriate character that fit the sentence context among four options. The four options shared the same phonetic radicals but had different semantic radicals. The results showed that the pre-test and post-test score increases were significant for the experimental groups, but not for the control group. Most importantly, the experimental groups successfully transferred the semantic radical strategy to figure out the meanings of unfamiliar characters containing semantic radicals that had not been taught. The results demonstrate the effectiveness of teaching semantic radicals for lexical inference in sentence reading for nonnative speakers, and highlight the ability of transfer learning to acquire semantic categories of sub-lexical units (semantic radicals) in Chinese characters among foreign language learners. PMID:29109694

  10. From body motion to cheers: Speakers' body movements as predictors of applause.

    Science.gov (United States)

    Koppensteiner, Markus; Stephan, Pia; Jäschke, Johannes Paul Michael

    2015-02-01

    Appearance cues and brief displays of behavior are related to people's personality, to their performance at work and to the outcomes of elections. Thus, people present themselves to others on different communication channels, while their interaction partners form first impressions on the basis of the displayed cues. In the current study we examined whether people are able to read information from politicians' body motion. For a rating experiment we translated short video clips of politicians giving a speech into animated stick-figures and had these animations rated on trustworthiness, dominance, competence and the Big Five personality dimensions. Afterwards we correlated the ratings with the applause and the hecklings that the speakers received throughout their entire speech. This revealed that speakers whose body movements were perceived as high on dominance, as high on extraversion and as low on agreeableness received more applause. Although the results obtained need support from additional studies they indicate that body motion is an informative cue in real life settings.

  11. Analysis of error type and frequency in apraxia of speech among Portuguese speakers.

    Science.gov (United States)

    Cera, Maysa Luchesi; Minett, Thaís Soares Cianciarullo; Ortiz, Karin Zazo

    2010-01-01

    Most studies characterizing errors in the speech of patients with apraxia involve English language. To analyze the types and frequency of errors produced by patients with apraxia of speech whose mother tongue was Brazilian Portuguese. 20 adults with apraxia of speech caused by stroke were assessed. The types of error committed by patients were analyzed both quantitatively and qualitatively, and frequencies compared. We observed the presence of substitution, omission, trial-and-error, repetition, self-correction, anticipation, addition, reiteration and metathesis, in descending order of frequency, respectively. Omission type errors were one of the most commonly occurring whereas addition errors were infrequent. These findings differed to those reported in English speaking patients, probably owing to differences in the methodologies used for classifying error types; the inclusion of speakers with apraxia secondary to aphasia; and the difference in the structure of Portuguese language to English in terms of syllable onset complexity and effect on motor control. The frequency of omission and addition errors observed differed to the frequency reported for speakers of English.

  12. A multisite community-based health literacy intervention for Spanish speakers.

    Science.gov (United States)

    Soto Mas, F; Cordova, C; Murrietta, A; Jacobson, H E; Ronquillo, F; Helitzer, D

    2015-06-01

    The National Action Plan to Improve Health Literacy emphasizes the importance of community-based opportunities for education, such as English as a second language (ESL) programs. It recommends collaborations among the adult literacy and ESL communities. However, limited attention has been given to researching the effectiveness of community-based interventions that combine ESL and health literacy. The purpose of this study was to explore the feasibility of using different community settings for improving health literacy among adult Spanish speakers through an English language program. The study used a pre-experimental, single arm pretest-posttest design, and implemented the Health Literacy and ESL Curriculum. A collaborative was established between the community and university researchers. Participants were recruited at three distinctive sites. Health literacy was assessed using the Spanish version of the Test of Functional Health Literacy in Adults (TOFHLA). Analysis included descriptive and paired-group t test. Forty-nine participants completed the intervention and post-tests (92% retention rate). Overall--all sites--posttest scores significantly improved for total TOFHLA, raw numeracy, and reading comprehension (p literacy/language instruction to Spanish speaking adults. The study also points to community engagement and ESL programs as two essential components of effective health literacy interventions among Spanish speakers.

  13. On the native/nonnative speaker notion and World Englishes: Debating with K. Rajagopalan

    Directory of Open Access Journals (Sweden)

    John Robert SCHMITZ

    Full Text Available ABSTRACT In a series of three articles published in the Journal of Pragmatics (1995, henceforth JP, the purpose of the papers is to question the division of English spoken in the world into, on one hand, "native" varieties (British English, American English. Australian English and, on the other, "new/nonnative" varieties (Indian English, Singaporean English, Nigerian English. The JP articles are indeed groundbreaking for they mark one of the first interactions among scholars from the East with researchers in the West with regard to the growth and spread of the language as well as the roles English is made to play by its impressive number of users. The privileged position of prestige and power attributed to the inner circle varieties (USA, UK, Canada, Australia and New Zealand is questioned. Rajagopalan (1997, motivated by his reading of the JP papers, adds another dimension to this questioning by pointing to the racial and discriminatory stance underlying the notions "native speaker" and "nonnative speaker" (henceforth, respectively NS and NNS. Rajagopalan has written extensively on the issue of nativity or "nativeness"; over the years, Schmitz has also written on the same topic. There appears, in some cases, to be a number of divergent views with regard to subject on hand on the part of both authors. The purpose of this article is to engage in a respectful debate to uncover misreading and possible misunderstanding on the part of Schmitz. Listening to one another and learning from each another are essential in all academic endeavors.

  14. Communicating with the crowd: speakers use abstract messages when addressing larger audiences.

    Science.gov (United States)

    Joshi, Priyanka D; Wakslak, Cheryl J

    2014-02-01

    Audience characteristics often shape communicators' message framing. Drawing from construal level theory, we suggest that when speaking to many individuals, communicators frame messages in terms of superordinate characteristics that focus attention on the essence of the message. On the other hand, when communicating with a single individual, communicators increasingly describe events and actions in terms of their concrete details. Using different communication tasks and measures of construal, we show that speakers communicating with many individuals, compared with 1 person, describe events more abstractly (Study 1), describe themselves as more trait-like (Study 2), and use more desirability-related persuasive messages (Study 3). Furthermore, speakers' motivation to communicate with their audience moderates their tendency to frame messages based on audience size (Studies 3 and 4). This audience-size abstraction effect is eliminated when a large audience is described as homogeneous, suggesting that people use abstract construal strategically in order to connect across a disparate group of individuals (Study 5). Finally, we show that participants' experienced fluency in communication is influenced by the match between message abstraction and audience size (Study 6).

  15. Foreign language learning in French speakers is associated with rhythm perception, but not with melody perception.

    Science.gov (United States)

    Bhatara, Anjali; Yeung, H Henny; Nazzi, Thierry

    2015-04-01

    There has been increasing interest in links between language and music. Here, we investigate the relation between foreign language learning and music perception. We administered tests measuring melody and rhythm perception as well as a questionnaire on musical and foreign language experience to 147 monolingual French speakers. As expected, we found that musicians had better melody and rhythm perception than nonmusicians and that, among musicians, there was a positive correlation between the total number of years of music training and test scores. Crucially, we also found a positive correlation between the total number of years learning foreign languages and rhythm perception, but we found no such relation with melody perception. Moreover, the degree to which participants were better at rhythm than melody perception was also related to foreign language experience. Results suggest that both music training and learning foreign languages (primarily English, Spanish, and German in our sample) are related to French speakers' perception of rhythm, but not to their perception of melody. These results are discussed with respect to the rhythmic properties of French and suggest a common perceptual basis for rhythm in language and music. (c) 2015 APA, all rights reserved.

  16. "Necesita una vacuna": what Spanish-speakers want in text-message immunization reminders.

    Science.gov (United States)

    Ahlers-Schmidt, Carolyn R; Chesser, Amy; Brannon, Jennifer; Lopez, Venessa; Shah-Haque, Sapna; Williams, Katherine; Hart, Traci

    2013-08-01

    Appointment reminders help parents deal with complex immunization schedules. Preferred content of text-message reminders has been identified for English-speakers. Spanish-speaking parents of children under three years old were recruited to develop Spanish text-message immunization reminders. Structured interviews included questions about demographic characteristics, use of technology, and willingness to receive text reminders. Each participant was assigned to one user-centered design (UCD) test: card sort, needs analysis or comprehension testing. Respondents (N=54) were female (70%) and averaged 27 years of age (SD=7). A card sort of 20 immunization-related statements resulted in identification of seven pieces of critical information, which were compiled into eight example texts. These texts were ranked in the needs assessment and the top two were assessed for comprehension. All participants were able to understand the content and describe intention to act. Utilizing UCD testing, Spanish-speakers identified short, specific text content that differed from preferred content of English-speaking parents.

  17. The relationship between vocabulary and short-term memory measures in monolingual and bilingual speakers

    Science.gov (United States)

    Kaushanskaya, Margarita; Blumenfeld, Henrike K.; Marian, Viorica

    2012-01-01

    Previous studies have indicated that bilingualism may influence the efficiency of lexical access in adults. The goals of this research were (1) to compare bilingual and monolingual adults on their native-language vocabulary performance, and (2) to examine the relationship between short-term memory skills and vocabulary performance in monolinguals and bilinguals. In Experiment 1, English-speaking monolingual adults and simultaneous English–Spanish bilingual adults were administered measures of receptive English vocabulary and of phonological short-term memory. In Experiment 2, monolingual adults were compared to sequential English–Spanish bilinguals, and were administered the same measures as in Experiment 1, as well as a measure of expressive English vocabulary. Analyses revealed comparable levels of performance on the vocabulary and the short-term memory measures in the monolingual and the bilingual groups across both experiments. There was a stronger effect of digit-span in the bilingual group than in the monolingual group, with high-span bilinguals outperforming low-span bilinguals on vocabulary measures. Findings indicate that bilingual speakers may rely on short-term memory resources to support word retrieval in their native language more than monolingual speakers. PMID:22518091

  18. Listener-speaker perceived distance predicts the degree of motor contribution to speech perception.

    Science.gov (United States)

    Bartoli, Eleonora; D'Ausilio, Alessandro; Berry, Jeffrey; Badino, Leonardo; Bever, Thomas; Fadiga, Luciano

    2015-02-01

    Listening speech sounds activates motor and premotor areas in addition to temporal and parietal brain regions. These activations are somatotopically localized according to the effectors recruited in the production of particular phonemes. Previous work demonstrated that transcranial magnetic stimulation (TMS) of speech motor centers somatotopically altered speech perception, suggesting a role for the motor system. However, these effects seemed to occur only under adverse listening conditions, suggesting that degraded speech may stimulate listeners to adopt unnatural neural strategies relying on motor centers. Here, we investigated whether naturally occurring interspeaker variability, which did not affect task difficulty, made a speech discrimination task sensitive to TMS interference. In this paradigm, TMS over tongue and lips motor representations somatotopically altered the discrimination time of speech. Furthermore, the TMS-induced effect correlated with listeners' similarity judgments between listeners' and speakers' speech productions. Thus, the degree of motor recruitment depends on the perceived distance between listener and speaker. This result supports the claim that discriminating others' speech pattern requires the contribution of the listener's own motor repertoire. We conclude that motor recruitment in speech perception can be a natural product of discriminating speech in a normally variable and unpredictable environment, not merely related to task difficulty. © The Author 2013. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  19. Semantic Categories in the Domain of Motion Verbs by Adult Speakers of Danish, German, and Turkish

    Directory of Open Access Journals (Sweden)

    Jessen, Moiken

    2013-12-01

    Full Text Available Languages differ in the ways they divide the world. This study applies cluster analysis to understand how and why languages differ in the way they express motion events. It further lays out what the parameters of the structure of the semantic space of motion are, based on data collected from participants who were adult speakers of Danish, German, and Turkish. The participants described 37 video clips depicting a large variety of motion events. The results of the study show that the segmentation of the semantic space displays a great deal of variation across all three groups. Turkish differs from German and Danish with respect to the features used to segment the semantic space – namely by using vector orientation. German and Danish differ greatly with respect to (a how fine-grained the distinctions made are, and (b how motion verbs with a common Germanic root are distributed across the semantic space. Consequently, this study illustrates that the parameters applied for categorization by speakers are, to some degree, related to typological membership, in relation to Talmy's typological framework for the expression of motion events. Finally, the study shows that the features applied for categorization differ across languages and that typological membership is not necessarily a predictor of elaboration of the motion verb lexicon.

  20. Assessing the Performance of Automatic Speech Recognition Systems When Used by Native and Non-Native Speakers of Three Major Languages in Dictation Workflows

    DEFF Research Database (Denmark)

    Zapata, Julián; Kirkedal, Andreas Søeborg

    2015-01-01

    In this paper, we report on a two-part experiment aiming to assess and compare the performance of two types of automatic speech recognition (ASR) systems on two different computational platforms when used to augment dictation workflows. The experiment was performed with a sample of speakers...... of three major languages and with different linguistic profiles: non-native English speakers; non-native French speakers; and native Spanish speakers. The main objective of this experiment is to examine ASR performance in translation dictation (TD) and medical dictation (MD) workflows without manual...

  1. More than use it or lose it: the number-of-speakers effect on heritage language proficiency.

    Science.gov (United States)

    Gollan, Tamar H; Starr, Jennie; Ferreira, Victor S

    2015-02-01

    Acquiring a heritage language (HL), a minority language spoken primarily at home, is often a major step toward achieving bilingualism. Two studies examined factors that promote HL proficiency. Chinese-English and Spanish-English undergraduates and Hebrew-English children named pictures in both their languages, and they or their parents completed language history questionnaires. HL picture-naming ability correlated positively with the number of different HL speakers participants spoke to as children, independently of each language's frequency of use, and without negatively affecting English picture-naming ability. HL performance increased also when primary caregivers had lower English proficiency, with later English age of acquisition, and (in children) with increased age. These results suggest a prescription for increasing bilingual proficiency is regular interaction with multiple HL speakers. Responsible cognitive mechanisms could include greater variety of words used by different speakers, representational robustness from exposure to variations in form, or multiple retrieval cues, perhaps analogous to contextual diversity effects.

  2. A Novel Approach to Speaker Weight Estimation Using a Fusion of the i-vector and NFA Frameworks

    DEFF Research Database (Denmark)

    Poorjam, Amir Hossein; Bahari, Mohamad Hasan; Van hamme, Hogo

    2017-01-01

    This paper proposes a novel approach for automatic speaker weight estimation from spontaneous telephone speech signals. In this method, each utterance is modeled using the i-vector framework which is based on the factor analysis on Gaussian Mixture Model (GMM) mean supervectors, and the Non......-negative Factor Analysis (NFA) framework which is based on a constrained factor analysis on GMM weight supervectors. Then, the available information in both Gaussian means and Gaussian weights is exploited through a feature-level fusion of the i-vectors and the NFA vectors. Finally, a least-squares support vector...... regression is employed to estimate the weight of speakers from the given utterances. The proposed approach is evaluated on spontaneous telephone speech signals of National Institute of Standards and Technology 2008 and 2010 Speaker Recognition Evaluation corpora. To investigate the effectiveness...

  3. A Study of the Effect of Emotional State upon the Variation of the Fundamental Frequency of a Speaker

    Directory of Open Access Journals (Sweden)

    Marius Vasile GHIURCAU

    2010-01-01

    Full Text Available Telephone banking or brokering, building accesssystems or forensics are some of the areas in which speakerrecognition is continuously developing. Fundamental frequencyrepresents an important speech feature used in theseapplications. In this paper we present a study of the effect ofemotional state of a speaker upon the variation of thefundamental frequency of the speech signal. Human beings arequite frequently overwhelmed by various emotions and most ofthe time one can not really control these emotional states. Forthe purpose of our work we have used the Berlin emotionalspeech database which contains utterances of 10 speakers indifferent emotional situations: happy, angry, fearful, bored andneutral. The mean fundamental frequency and also the standarddeviation for every speaker in all the emotional states werecomputed. The results show a very strong influence of theemotional state upon frequency variation.

  4. The impact of compression of speech signal, background noise and acoustic disturbances on the effectiveness of speaker identification

    Science.gov (United States)

    Kamiński, K.; Dobrowolski, A. P.

    2017-04-01

    The paper presents the architecture and the results of optimization of selected elements of the Automatic Speaker Recognition (ASR) system that uses Gaussian Mixture Models (GMM) in the classification process. Optimization was performed on the process of selection of individual characteristics using the genetic algorithm and the parameters of Gaussian distributions used to describe individual voices. The system that was developed was tested in order to evaluate the impact of different compression methods used, among others, in landline, mobile, and VoIP telephony systems, on effectiveness of the speaker identification. Also, the results were presented of effectiveness of speaker identification at specific levels of noise with the speech signal and occurrence of other disturbances that could appear during phone calls, which made it possible to specify the spectrum of applications of the presented ASR system.

  5. Correlation between low-proficiency in English and negative perceptions of what it means to be an English speaker

    Directory of Open Access Journals (Sweden)

    Kavarljit Kaur Gill

    2013-01-01

    Full Text Available Learning another language is very much affected by positive or negative connotations attached to the new language by the language learner. Entering Malaysian public universities there are many students with a low proficiency in English, despite spending eleven years studying English in schools. Could it be that the lack of progress among these students could be attributed to a negative view of what it means to be a speaker of English? This study investigated the perceptions of students at a public university, to determine whether there is a correlation between low-proficiency and negative perceptions of what it means to be an English speaker. Analysis of the results showed that Malaysian students have a very positive perception of what it means to be an English speaker.

  6. Assessing public speaking fear with the short form of the Personal Report of Confidence as a Speaker scale: confirmatory factor analyses among a French-speaking community sample.

    Science.gov (United States)

    Heeren, Alexandre; Ceschi, Grazia; Valentiner, David P; Dethier, Vincent; Philippot, Pierre

    2013-01-01

    The main aim of this study was to assess the reliability and structural validity of the French version of the 12-item version of the Personal Report of Confidence as Speaker (PRCS), one of the most promising measurements of public speaking fear. A total of 611 French-speaking volunteers were administered the French versions of the short PRCS, the Liebowitz Social Anxiety Scale, the Fear of Negative Evaluation scale, as well as the Trait version of the Spielberger State-Trait Anxiety Inventory and the Beck Depression Inventory-II, which assess the level of anxious and depressive symptoms, respectively. Regarding its structural validity, confirmatory factor analyses indicated a single-factor solution, as implied by the original version. Good scale reliability (Cronbach's alpha = 0.86) was observed. The item discrimination analysis suggested that all the items contribute to the overall scale score reliability. The French version of the short PRCS showed significant correlations with the Liebowitz Social Anxiety Scale (r = 0.522), the Fear of Negative Evaluation scale (r = 0.414), the Spielberger State-Trait Anxiety Inventory (r = 0.516), and the Beck Depression Inventory-II (r = 0.361). The French version of the short PRCS is a reliable and valid measure for the evaluation of the fear of public speaking among a French-speaking sample. These findings have critical consequences for the measurement of psychological and pharmacological treatment effectiveness in public speaking fear among a French-speaking sample.

  7. 2012 Gordon Research Conference on Bioinspired Materials - Formal Schedule and Speaker/Poster Program

    Energy Technology Data Exchange (ETDEWEB)

    Chilkoti, Ashutosk [Duke Univ., Durham, NC (United States)

    2012-06-29

    The emerging, interdisciplinary field of Bioinspired Materials focuses on developing a fundamental understanding of the synthesis, directed self-assembly and hierarchical organization of natural occurring materials, and uses this understanding to engineer new bioinspired artificial materials for diverse applications. The inaugural 2012 Gordon Conference on Bioinspired Materials seeks to capture the excitement of this burgeoning field by a cutting-edge scientific program and roster of distinguished invited speakers and discussion leaders who will address the key issues in the field. The Conference will feature a wide range of topics, such as materials and devices from DNA, reprogramming the genetic code for design of new materials, peptide, protein and carbohydrate based materials, biomimetic systems, complexity in self-assembly, and biomedical applications of bioinspired materials.

  8. Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions

    DEFF Research Database (Denmark)

    Ma, Ning; Brown, Guy J.; May, Tobias

    2015-01-01

    This paper presents a novel machine-hearing system that exploits deep neural networks (DNNs) and head movements for binaural localisation of multiple speakers in reverberant conditions. DNNs are used to map binaural features, consisting of the complete crosscorrelation function (CCF) and interaural...... level differences (ILDs), to the source azimuth. Our approach was evaluated using a localisation task in which sources were located in a full 360-degree azimuth range. As a result, front-back confusions often occurred due to the similarity of binaural features in the front and rear hemifields....... To address this, a head movement strategy was incorporated in the DNN-based model to help reduce the front-back errors. Our experiments show that, compared to a system based on a Gaussian mixture model (GMM) classifier, the proposed DNN system substantially reduces localisation errors under challenging...

  9. Classroom acoustics design guidelines based on the optimization of speaker conditions

    DEFF Research Database (Denmark)

    Pelegrin Garcia, David; Brunskog, Jonas

    2012-01-01

    School teachers suffer frequently from voice problems due to the high vocal load that they experience and the not-always-ideal conditions under which they have to teach. Traditionally, the purpose of the acoustic design of classrooms has been to optimize speech intelligibility. New guidelines...... and noise level measurements in classrooms. Requirements of optimum vocal comfort, average A-weighted speech levels across the audience higher than 50 dB, and a physical volume higher than 6 m3/student are combined to extract optimum acoustic conditions, which depend on the number of students....... These conditions, which are independent on the position of the speaker, cannot be optimum for more than 50 students. For classrooms with 10 students, the reverberation time in occupied conditions shall be between 0.5 and 0.65 s, and the volume between 60 and 170 m3. For classrooms with 40 students...

  10. TEACHING NON-INDIGENOUS SPEAKERS OF ISIXHOSA: A CRITICAL EVALUATION OF OWN PRACTICE

    Directory of Open Access Journals (Sweden)

    Phumla Kese

    2012-09-01

    Full Text Available In this article, the author reports on a mid-year critical evaluation of her teaching practice at a multilingual university with students who are non-indigenous speakers of isiXhosa. Data is gleaned from participatory observations, critical-incident journal entries, students’ assessments, views of an outside observer, and student feedback. The conclusion is drawn that to achieve communicative competence in the Xhosa language, the following are vital towards meeting the challenges of students’ anxieties about the course: increasing student motivation, confronting the differences between students’ home languages and isiXhosa, using tutors who are proficient in the students’ mother tongue, as well as encouraging creativity together by keeping a balance between oral and written tasks.

  11. Study of the characteristic parameters of the normal voices of Argentinian speakers

    Directory of Open Access Journals (Sweden)

    Edgardo Bonzi

    2014-02-01

    Full Text Available The voice laboratory permits to study the human voices using a method that is objective and noninvasive. In this work, we have studied the parameters of the human voice such as pitch, formant, jitter, shimmer and harmonic-noise ratio of a group of young people. This statistical information of parameters is obtained from Argentinian speakers. Received: 29 December 2013, Accepted: 27 May 2014; Reviewed by: J. Brum, Instituto de Física, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay; Edited by: E. Mizraji; DOI: http://dx.doi.org/10.4279/PIP.060002 Cite as: E V Bonzi, G B Grad, A M Maggi, M R Muñoz, Papers in Physics 6, 060002 (2014

  12. Genre Analysis of MA Thesis Abstracts by native and (Iraqi non-native Speakers of English

    Directory of Open Access Journals (Sweden)

    Firas Abdul-Munim Jawad

    2017-12-01

    Full Text Available Abstract         The purpose of the present study is to investigate the macro-structure (move-step analysis of abstract section of MA theses in linguistics. There are two corpora consist of fifteen MA theses written by American native and fifteen others by Iraqi Arab non-native speakers of English. Based on the analysis, some similarities and differences were found regarding cyclicity (move structure and move-step frequency of occurrence. The findings of the present study are useful for Iraqi non-native writers by presenting the important factors that make them familiar with rhetorical move-step structure of MA thesis abstracts.

  13. Issues In Teaching English To Speakers Of Other Languages (Tesol: A Review Of Literature

    Directory of Open Access Journals (Sweden)

    Sri Rejeki Murtiningsih

    2011-05-01

    Full Text Available The article aims to review English language teaching to speakers of other languages and other related aspects such as the use of technology and the first language in classroom practices. As the need for English language competency develops, efforts to support students’ learning also change. In addition to the teaching techniques, the technology development has also gained attention for improving effective teaching and learning. Because English is a language that is not used in daily communication and the variety of students’ language competence, the use of first language in a foreign language classroom has also become another point to encourage the students’ foreign language acquisition.

  14. Some features of a typical house as perceived by native speakers of English and of Serbian

    Directory of Open Access Journals (Sweden)

    Dilparić Branislava M.

    2015-01-01

    Full Text Available The paper reports findings from one of the two differently-designed surveys conducted among groups of both native speakers of English and native speakers of Serbian with a common general objective to obtain a picture of better candidates for a role of the whole (to be analyzed into its constituent elements in the contrastive study on the lexical field a house and its parts in English and Serbian. The specific objective of the survey presented here, however, was to build up the target picture with some of the features of the ideal example of the house category, such as the shape of the house, the key materials its principal structural elements (foundations, walls, a roof are made of, the number of residential units in the house and the type of the household that occupies it, the number of the house levels, the minimum of its interior spatial components and their functions, the types of systemic parts in the house, the status and position of the house relative to surround­ing buildings, etc. Also, taking into consideration that the demographic profiles of the survey participants reflected various cultural backgrounds (which significantly influence the formation of mental images of a typical sample of the category, the survey aimed to compare the similarities and differences between the 'English' and the 'Serbian' typical house, that is the features assigned to a typical house by most of the surveyed representa­tives of Anglo-American and by those of Serbian culture. Judging exclusively by the features observed and the results obtained, the study concludes that the 'English' and the 'Serbian' typical house look very similar in many aspect and that the two different cultures are not as distant as they may seem.

  15. Dietary restrictions in healing among speakers of Iquito, an endangered language of the Peruvian Amazon.

    Science.gov (United States)

    Jernigan, Kevin A

    2011-07-11

    Ethno botanical research was carried out with speakers of Iquitos, a critically endangered Amazonian language of the Zaparoan family. The study focused on the concept of "dieting" (siyan++ni in Iquitos), a practice involving prohibitions considered necessary to the healing process. These restrictions include: 1) foods and activities that can exacerbate illness, 2) environmental influences that conflict with some methods of healing (e.g. steam baths or enemas) and 3) foods and activities forbidden by the spirits of certain powerful medicinal plants. The study tested the following hypotheses: H1--Each restriction will correlate with specific elements in illness explanatory models and H2--Illnesses whose explanatory models have personality elements will show a greater number and variety of restrictions than those based on naturalistic reasoning. The work was carried out in 2009 and 2010 in the Alto Nanay region of Peru. In structured interviews, informants gave explanatory models for illness categories, including etiologies, pathophysiologies, treatments and dietary restrictions necessary for 49 illnesses. Seventeen botanical vouchers for species said to have powerful spirits that require diets were also collected. All restrictions found correspond to some aspect of illness explanatory models. Thirty-five percent match up with specific illness etiologies, 53% correspond to particular pathophysiologies, 18% correspond with overall seriousness of the illness and 18% are only found with particular forms of treatment. Diets based on personalistic reasoning have a significantly higher average number of restrictions than those based on naturalistic reasoning. Dieting plays a central role in healing among Iquitos speakers. Specific prohibitions can be explained in terms of specific aspects of illness etiologies, pathophysiologies and treatments. Although the Amazonian literature contains few studies focusing on dietary proscriptions over a wide range of illnesses, some

  16. Lexical Access in Persian Normal Speakers: Picture Naming, Verbal Fluency and Spontaneous Speech

    Directory of Open Access Journals (Sweden)

    Zahra Sadat Ghoreishi

    2014-06-01

    Full Text Available Objectives: Lexical access is the process by which the basic conceptual, syntactical and morpho-phonological information of words are activated. Most studies of lexical access have focused on picture naming. There is hardly any previous research on other parameters of lexical access such as verbal fluency and analysis of connected speech in Persian normal participants. This study investigates the lexical access performance in normal speakers in different issues such as age, sex and education. Methods: The performance of 120 adult Persian speakers in three tasks including picture naming, verbal fluency and connected speech, was examined using "Persian Lexical Access Assessment Package”. The performance of participants between two gender groups (male/female, three education groups (below 5 years, above 12 years, between 5 and 12 years and three age groups (18-35 years, 36-55 years, 56-75 years were compared. Results: According to findings, picture naming increased with increasing education and decreased with increasing age. The performance of participants in phonological and semantic verbal fluency showed improvement with age and education. No significant difference was seen between males and females in verbal fluency task. In the analysis of connected speech there were no significant differences between different age and education groups and just mean length of utterance in males was significantly higher than females. Discussion: The findings could be a primitive scale for comparison between normal subjects and patients in lexical access tasks, furthermore it could be a horizon for planning of treatment goals in patients with word finding problem according to age, gender and education.

  17. Brain Plasticity in Speech Training in Native English Speakers Learning Mandarin Tones

    Science.gov (United States)

    Heinzen, Christina Carolyn

    The current study employed behavioral and event-related potential (ERP) measures to investigate brain plasticity associated with second-language (L2) phonetic learning based on an adaptive computer training program. The program utilized the acoustic characteristics of Infant-Directed Speech (IDS) to train monolingual American English-speaking listeners to perceive Mandarin lexical tones. Behavioral identification and discrimination tasks were conducted using naturally recorded speech, carefully controlled synthetic speech, and non-speech control stimuli. The ERP experiments were conducted with selected synthetic speech stimuli in a passive listening oddball paradigm. Identical pre- and post- tests were administered on nine adult listeners, who completed two-to-three hours of perceptual training. The perceptual training sessions used pair-wise lexical tone identification, and progressed through seven levels of difficulty for each tone pair. The levels of difficulty included progression in speaker variability from one to four speakers and progression through four levels of acoustic exaggeration of duration, pitch range, and pitch contour. Behavioral results for the natural speech stimuli revealed significant training-induced improvement in identification of Tones 1, 3, and 4. Improvements in identification of Tone 4 generalized to novel stimuli as well. Additionally, comparison between discrimination of across-category and within-category stimulus pairs taken from a synthetic continuum revealed a training-induced shift toward more native-like categorical perception of the Mandarin lexical tones. Analysis of the Mismatch Negativity (MMN) responses in the ERP data revealed increased amplitude and decreased latency for pre-attentive processing of across-category discrimination as a result of training. There were also laterality changes in the MMN responses to the non-speech control stimuli, which could reflect reallocation of brain resources in processing pitch patterns

  18. Dietary restrictions in healing among speakers of Iquito, an endangered language of the Peruvian Amazon

    Directory of Open Access Journals (Sweden)

    Jernigan Kevin A

    2011-07-01

    Full Text Available Abstract Background Ethnobotanical research was carried out with speakers of Iquito, a critically endangered Amazonian language of the Zaparoan family. The study focused on the concept of "dieting" (siyan++ni in Iquito, a practice involving prohibitions considered necessary to the healing process. These restrictions include: 1 foods and activities that can exacerbate illness, 2 environmental influences that conflict with some methods of healing (e.g. steam baths or enemas and 3 foods and activities forbidden by the spirits of certain powerful medicinal plants. The study tested the following hypotheses: H1 - Each restriction will correlate with specific elements in illness explanatory models and H2 - Illnesses whose explanatory models have personalistic elements will show a greater number and variety of restrictions than those based on naturalistic reasoning. Methods The work was carried out in 2009 and 2010 in the Alto Nanay region of Peru. In structured interviews, informants gave explanatory models for illness categories, including etiologies, pathophysiologies, treatments and dietary restrictions necessary for 49 illnesses. Seventeen botanical vouchers for species said to have powerful spirits that require diets were also collected. Results All restrictions found correspond to some aspect of illness explanatory models. Thirty-five percent match up with specific illness etiologies, 53% correspond to particular pathophysiologies, 18% correspond with overall seriousness of the illness and 18% are only found with particular forms of treatment. Diets based on personalistic reasoning have a significantly higher average number of restrictions than those based on naturalistic reasoning. Conclusions Dieting plays a central role in healing among Iquito speakers. Specific prohibitions can be explained in terms of specific aspects of illness etiologies, pathophysiologies and treatments. Although the Amazonian literature contains few studies focusing on

  19. An Alternative Understanding of Education and Empowerment: Local-Level Perspectives of Refugee Social Integration in the United Kingdom

    Science.gov (United States)

    Klenk, Hazel

    2017-01-01

    This article draws on research with a group of refugee women who attend English for speakers of other languages (ESOL) classes at a community center in London. It considers the role education can play in their social integration processes, employing an approach that has been developed from feminist notions of empowerment and social practice…

  20. Social learning and sociality

    NARCIS (Netherlands)

    Reader, S.M.; Lefebvre, L.

    2001-01-01

    Sociality may not be a defining feature of social learning. Complex social systems have been predicted to favour the evolution of social learning, but the evidence for this relationship is weak. In birds, only one study supports the hypothesis that social learning is an adaptive