audio-enhanced personal digital: Topics by WorldWideScience.org

Sample records for audio-enhanced personal digital

ESA personal communications and digital audio broadcasting systems based on non-geostationary satellites

Science.gov (United States)

Logalbo, P.; Benedicto, J.; Viola, R.

1993-01-01

Personal Communications and Digital Audio Broadcasting are two new services that the European Space Agency (ESA) is investigating for future European and Global Mobile Satellite systems. ESA is active in promoting these services in their various mission options including non-geostationary and geostationary satellite systems. A Medium Altitude Global Satellite System (MAGSS) for global personal communications at L and S-band, and a Multiregional Highly inclined Elliptical Orbit (M-HEO) system for multiregional digital audio broadcasting at L-band are described. Both systems are being investigated by ESA in the context of future programs, such as Archimedes, which are intended to demonstrate the new services and to develop the technology for future non-geostationary mobile communication and broadcasting satellites.
Making the Switch to Digital Audio

Directory of Open Access Journals (Sweden)

Shannon Gwin Mitchell

2004-12-01

Full Text Available In this article, the authors describe the process of converting from analog to digital audio data. They address the step-by-step decisions that they made in selecting hardware and software for recording and converting digital audio, issues of system integration, and cost considerations. The authors present a brief description of how digital audio is being used in their current research project and how it has enhanced the “quality” of their qualitative research.
Digital signal processor for silicon audio playback devices; Silicon audio saisei kikiyo digital signal processor

Energy Technology Data Exchange (ETDEWEB)

NONE

2000-03-01

The digital audio signal processor (DSP) TC9446F series has been developed silicon audio playback devices with a memory medium of, e.g., flash memory, DVD players, and AV devices, e.g., TV sets. It corresponds to AAC (advanced audio coding) (2ch) and MP3 (MPEG1 Layer3), as the audio compressing techniques being used for transmitting music through an internet. It also corresponds to compressed types, e.g., Dolby Digital, DTS (digital theater system) and MPEG2 audio, being adopted for, e.g., DVDs. It can carry a built-in audio signal processing program, e.g., Dolby ProLogic, equalizer, sound field controlling, and 3D sound. TC9446XB has been lined up anew. It adopts an FBGA (fine pitch ball grid array) package for portable audio devices. (translated by NEDO)
DAFX Digital Audio Effects

CERN Document Server

Zö

2011-01-01

The rapid development in various fields of Digital Audio Effects, or DAFX, has led to new algorithms and this second edition of the popular book, DAFX: Digital Audio Effects has been updated throughout to reflect progress in the field. It maintains a unique approach to DAFX with a lecture-style introduction into the basics of effect processing. Each effect description begins with the presentation of the physical and acoustical phenomena, an explanation of the signal processing techniques to achieve the effect, followed by a discussion of musical applications and the control of effect parameter
Digital Augmented Reality Audio Headset

Directory of Open Access Journals (Sweden)

Jussi Rämö

2012-01-01

Full Text Available Augmented reality audio (ARA combines virtual sound sources with the real sonic environment of the user. An ARA system can be realized with a headset containing binaural microphones. Ideally, the ARA headset should be acoustically transparent, that is, it should not cause audible modification to the surrounding sound. A practical implementation of an ARA mixer requires a low-latency headphone reproduction system with additional equalization to compensate for the attenuation and the modified ear canal resonances caused by the headphones. This paper proposes digital IIR filters to realize the required equalization and evaluates a real-time prototype ARA system. Measurements show that the throughput latency of the digital prototype ARA system can be less than 1.4 ms, which is sufficiently small in practice. When the direct and processed sounds are combined in the ear, a comb filtering effect is brought about and appears as notches in the frequency response. The comb filter effect in speech and music signals was studied in a listening test and it was found to be inaudible when the attenuation is 20 dB. Insert ARA headphones have a sufficient attenuation at frequencies above about 1 kHz. The proposed digital ARA system enables several immersive audio applications, such as a virtual audio tourist guide and audio teleconferencing.
106-17 Telemetry Standards Digitized Audio Telemetry Standard Chapter 5

Science.gov (United States)

2017-07-01

Digitized Audio Telemetry Standard 5.1 General This chapter defines continuously variable slope delta (CVSD) modulation as the standard for digitizing...audio signal. The CVSD modulator is, in essence , a 1-bit analog-to-digital converter. The output of this 1-bit encoder is a serial bit stream, where
Securing Digital Audio using Complex Quadratic Map

Science.gov (United States)

Suryadi, MT; Satria Gunawan, Tjandra; Satria, Yudi

2018-03-01

In This digital era, exchanging data are common and easy to do, therefore it is vulnerable to be attacked and manipulated from unauthorized parties. One data type that is vulnerable to attack is digital audio. So, we need data securing method that is not vulnerable and fast. One of the methods that match all of those criteria is securing the data using chaos function. Chaos function that is used in this research is complex quadratic map (CQM). There are some parameter value that causing the key stream that is generated by CQM function to pass all 15 NIST test, this means that the key stream that is generated using this CQM is proven to be random. In addition, samples of encrypted digital sound when tested using goodness of fit test are proven to be uniform, so securing digital audio using this method is not vulnerable to frequency analysis attack. The key space is very huge about 8.1×l031 possible keys and the key sensitivity is very small about 10-10, therefore this method is also not vulnerable against brute-force attack. And finally, the processing speed for both encryption and decryption process on average about 450 times faster that its digital audio duration.
Removable Watermarking Sebagai Pengendalian Terhadap Cyber Crime Pada Audio Digital

Directory of Open Access Journals (Sweden)

Reyhani Lian Putri

2017-08-01

Full Text Available Perkembangan teknologi informasi yang pesat menuntut penggunanya untuk lebih berhati-hati seiring semakin meningkatnya cyber crime.Banyak pihak telah mengembangkan berbagai teknik perlindungan data digital, salah satunya adalah watermarking. Teknologi watermarking berfungsi untuk memberikan identitas, melindungi, atau menandai data digital, baik audio, citra, ataupun video, yang mereka miliki. Akan tetapi, teknik tersebut masih dapat diretas oleh oknum-oknum yang tidak bertanggung jawab.Pada penelitian ini, proses watermarking diterapkan pada audio digital dengan menyisipkan watermark yang terdengar jelas oleh indera pendengaran manusia (perceptible pada audio host.Hal ini bertujuan agar data audio dapat terlindungi dan apabila ada pihak lain yang ingin mendapatkan data audio tersebut harus memiliki “kunci” untuk menghilangkan watermark. Proses removable watermarking ini dilakukan pada data watermark yang sudah diketahui metode penyisipannya, agar watermark dapat dihilangkan sehingga kualitas audio menjadi lebih baik. Dengan menggunakan metode ini diperoleh kinerja audio watermarking pada nilai distorsi tertinggi dengan rata-rata nilai SNR sebesar7,834 dB dan rata-rata nilai ODG sebesar -3,77.Kualitas audio meningkat setelah watermark dihilangkan, di mana rata-rata SNR menjadi sebesar 24,986 dB dan rata-rata ODG menjadi sebesar -1,064 serta nilai MOS sebesar 4,40.
37 CFR 201.28 - Statements of Account for digital audio recording devices or media.

Science.gov (United States)

2010-07-01

... conjunction with an annual audit of the manufacturing or importing party's financial statements. (ii) The CPA... Certified Public Accountants. (5) Manufacturing or importing party refers to any person or entity that... general class of products made up of functionally equivalent digital audio recording products with...
El Digital Audio Tape Recorder. Contra autores y creadores

Directory of Open Access Journals (Sweden)

Jun Ono

2015-01-01

Full Text Available La llamada "DAT" (abreviatura por "digital audio tape recorder" / grabadora digital de audio ha recibido cobertura durante mucho tiempo en los medios masivos de Japón y otros países, como un producto acústico electrónico nuevo y controversial de la industria japonesa de artefactos electrónicos. ¿Qué ha pasado con el objeto de esta controversia?
Distortion-Free 1-Bit PWM Coding for Digital Audio Signals

Directory of Open Access Journals (Sweden)

John Mourjopoulos

2007-01-01

Full Text Available Although uniformly sampled pulse width modulation (UPWM represents a very efficient digital audio coding scheme for digital-to-analog conversion and full-digital amplification, it suffers from strong harmonic distortions, as opposed to benign non-harmonic artifacts present in analog PWM (naturally sampled PWM, NPWM. Complete elimination of these distortions usually requires excessive oversampling of the source PCM audio signal, which results to impractical realizations of digital PWM systems. In this paper, a description of digital PWM distortion generation mechanism is given and a novel principle for their minimization is proposed, based on a process having some similarity to the dithering principle employed in multibit signal quantization. This conditioning signal is termed “jither” and it can be applied either in the PCM amplitude or the PWM time domain. It is shown that the proposed method achieves significant decrement of the harmonic distortions, rendering digital PWM performance equivalent to that of source PCM audio, for mild oversampling (e.g., ×4 resulting to typical PWM clock rates of 90 MHz.
Distortion-Free 1-Bit PWM Coding for Digital Audio Signals

Directory of Open Access Journals (Sweden)

Mourjopoulos John

2007-01-01

Full Text Available Although uniformly sampled pulse width modulation (UPWM represents a very efficient digital audio coding scheme for digital-to-analog conversion and full-digital amplification, it suffers from strong harmonic distortions, as opposed to benign non-harmonic artifacts present in analog PWM (naturally sampled PWM, NPWM. Complete elimination of these distortions usually requires excessive oversampling of the source PCM audio signal, which results to impractical realizations of digital PWM systems. In this paper, a description of digital PWM distortion generation mechanism is given and a novel principle for their minimization is proposed, based on a process having some similarity to the dithering principle employed in multibit signal quantization. This conditioning signal is termed "jither" and it can be applied either in the PCM amplitude or the PWM time domain. It is shown that the proposed method achieves significant decrement of the harmonic distortions, rendering digital PWM performance equivalent to that of source PCM audio, for mild oversampling (e.g., resulting to typical PWM clock rates of 90 MHz.
Extraction, Mapping, and Evaluation of Expressive Acoustic Features for Adaptive Digital Audio Effects

DEFF Research Database (Denmark)

Holfelt, Jonas; Csapo, Gergely; Andersson, Nikolaj Schwab

2017-01-01

This paper describes the design and implementation of a real-time adaptive digital audio effect with an emphasis on using expressive audio features that control effect param- eters. Research in adaptive digital audio effects is cov- ered along with studies about expressivity and important...
Audible Aliasing Distortion in Digital Audio Synthesis

Directory of Open Access Journals (Sweden)

J. Schimmel

2012-04-01

Full Text Available This paper deals with aliasing distortion in digital audio signal synthesis of classic periodic waveforms with infinite Fourier series, for electronic musical instruments. When these waveforms are generated in the digital domain then the aliasing appears due to its unlimited bandwidth. There are several techniques for the synthesis of these signals that have been designed to avoid or reduce the aliasing distortion. However, these techniques have high computing demands. One can say that today's computers have enough computing power to use these methods. However, we have to realize that today’s computer-aided music production requires tens of multi-timbre voices generated simultaneously by software synthesizers and the most of the computing power must be reserved for hard-disc recording subsystem and real-time audio processing of many audio channels with a lot of audio effects. Trivially generated classic analog synthesizer waveforms are therefore still effective for sound synthesis. We cannot avoid the aliasing distortion but spectral components produced by the aliasing can be masked with harmonic components and thus made inaudible if sufficient oversampling ratio is used. This paper deals with the assessment of audible aliasing distortion with the help of a psychoacoustic model of simultaneous masking and compares the computing demands of trivial generation using oversampling with those of other methods.
Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications

NARCIS (Netherlands)

Pocta, P.; Beerends, J.G.

2015-01-01

This paper investigates the impact of different audio codecs typically deployed in current digital audio broadcasting (DAB) systems and web-casting applications, which represent a main source of quality impairment in these systems and applications, on the quality perceived by the end user. Both
A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio

DEFF Research Database (Denmark)

Lund, Haakon; Skov, Mette; Larsen, Birger

2014-01-01

An increasing number of large digitized audio-visual collections within digital humanities have recently been made available for users. Often access to digitized audio-visual collections is hampered by little and inconsistent metadata. This paper presents the preliminary findings from a study of ...
Personalized Audio Systems - a Bayesian Approach

DEFF Research Database (Denmark)

Nielsen, Jens Brehm; Jensen, Bjørn Sand; Hansen, Toke Jansen

2013-01-01

Modern audio systems are typically equipped with several user-adjustable parameters unfamiliar to most users listening to the system. To obtain the best possible setting, the user is forced into multi-parameter optimization with respect to the users's own objective and preference. To address this......, the present paper presents a general inter-active framework for personalization of such audio systems. The framework builds on Bayesian Gaussian process regression in which a model of the users's objective function is updated sequentially. The parameter setting to be evaluated in a given trial is selected...
pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.

Science.gov (United States)

Giannakopoulos, Theodoros

2015-01-01

Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library.
Digital video and audio broadcasting technology a practical engineering guide

CERN Document Server

Fischer, Walter

2010-01-01

Digital Video and Audio Broadcasting Technology - A Practical Engineering Guide' deals with all the most important digital television, sound radio and multimedia standards such as MPEG, DVB, DVD, DAB, ATSC, T-DMB, DMB-T, DRM and ISDB-T. The book provides an in-depth look at these subjects in terms of practical experience. In addition it contains chapters on the basics of technologies such as analog television, digital modulation, COFDM or mathematical transformations between time and frequency domains. The attention in the respective field under discussion is focussed on aspects of measuring t
Perceived Audio Quality Analysis in Digital Audio Broadcasting Plus System Based on PEAQ

Directory of Open Access Journals (Sweden)

K. Ulovec

2018-04-01

Full Text Available Broadcasters need to decide on bitrates of the services in the multiplex transmitted via Digital Audio Broadcasting Plus system. The bitrate should be set as low as possible for maximal number of services, but with high quality, not lower than in conventional analog systems. In this paper, the objective method Perceptual Evaluation of Audio Quality is used to analyze the perceived audio quality for appropriate codecs --- MP2 and AAC offering three profiles. The main aim is to determine dependencies on the type of signal --- music and speech, the number of channels --- stereo and mono, and the bitrate. Results indicate that only MP2 codec and AAC Low Complexity profile reach imperceptible quality loss. The MP2 codec needs higher bitrate than AAC Low Complexity profile for the same quality. For the both versions of AAC High-Efficiency profiles, the limit bitrates are determined above which less complex profiles outperform the more complex ones and higher bitrates above these limits are not worth using. It is shown that stereo music has worse quality than stereo speech generally, whereas for mono, the dependencies vary upon the codec/profile. Furthermore, numbers of services satisfying various quality criteria are presented.

Audio Conferencing Enhancements

OpenAIRE

VESTERINEN, LEENA

2006-01-01

Audio conferencing allows multiple people in distant locations to interact in a single voice call. Whilst it can be very useful service it also has several key disadvantages. This thesis study investigated the options for improving the user experience of the mobile teleconferencing applications. In particular, the use of 3D, spatial audio and visualinteractive functionality was investigated as the means of improving the intelligibility and audio perception during the audio...
Migrating Home Computer Audio Waveforms to Digital Objects: A Case Study on Digital Archaeology

Directory of Open Access Journals (Sweden)

Mark Guttenbrunner

2011-03-01

Full Text Available Rescuing data from inaccessible or damaged storage media for the purpose of preserving the digital data for the long term is one of the dimensions of digital archaeology. With the current pace of technological development, any system can become obsolete in a matter of years and hence the data stored in a specific storage media might not be accessible anymore due to the unavailability of the system to access the media. In order to preserve digital records residing in such storage media, it is necessary to extract the data stored in those media by some means.One early storage medium for home computers in the 1980s was audio tape. The first home computer systems allowed the use of standard cassette players to record and replay data. Audio cassettes are more durable than old home computers when properly stored. Devices playing this medium (i.e. tape recorders can be found in working condition or can be repaired, as they are usually made out of standard components. By re-engineering the format of the waveform and the file formats, the data on such media can then be extracted from a digitised audio stream and migrated to a non-obsolete format.In this paper we present a case study on extracting the data stored on an audio tape by an early home computer system, namely the Philips Videopac+ G7400. The original data formats were re-engineered and an application was written to support the migration of the data stored on tapes without using the original system. This eliminates the necessity of keeping an obsolete system alive for enabling access to the data on the storage media meant for this system. Two different methods to interpret the data and eliminate possible errors in the tape were implemented and evaluated on original tapes, which were recorded 20 years ago. Results show that with some error correction methods, parts of the tapes are still readable even without the original system. It also implies that it is easier to build solutions while original
Subband coding of digital audio signals without loss of quality

NARCIS (Netherlands)

Veldhuis, Raymond N.J.; Breeuwer, Marcel; van de Waal, Robbert

1989-01-01

A subband coding system for high quality digital audio signals is described. To achieve low bit rates at a high quality level, it exploits the simultaneous masking effect of the human ear. It is shown how this effect can be used in an adaptive bit-allocation scheme. The proposed approach has been
Interactive 3D audio: Enhancing awareness of details in immersive soundscapes?

DEFF Research Database (Denmark)

Schmidt, Mikkel Nørgaard; Schwartz, Stephen; Larsen, Jan

2012-01-01

Spatial audio and the possibility of interacting with the audio environment is thought to increase listeners' attention to details in a soundscape. This work examines if interactive 3D audio enhances listeners' ability to recall details in a soundscape. Nine different soundscapes were constructed...
Haptic and Audio-visual Stimuli: Enhancing Experiences and Interaction

NARCIS (Netherlands)

Nijholt, Antinus; Dijk, Esko O.; Lemmens, Paul M.C.; Luitjens, S.B.

2010-01-01

The intention of the symposium on Haptic and Audio-visual stimuli at the EuroHaptics 2010 conference is to deepen the understanding of the effect of combined Haptic and Audio-visual stimuli. The knowledge gained will be used to enhance experiences and interactions in daily life. To this end, a
Digital audio recordings improve the outcomes of patient consultations

DEFF Research Database (Denmark)

Wolderslund, Maiken; Kofoed, Poul-Erik; Holst, René

2017-01-01

OBJECTIVES: To investigate the effects on patients' outcome of the consultations when provided with: a Digital Audio Recording (DAR) of the consultation and a Question Prompt List (QPL). METHODS: This is a three-armed randomised controlled cluster trial. One group of patients received standard care......, while the other two groups received either the QPL in combination with a recording of their consultation or only the recording. Patients from four outpatient clinics participated: Paediatric, Orthopaedic, Internal Medicine, and Urology. The effects were evaluated by patient-administered questionnaires...
Design and Implementation of a linear-phase equalizer in digital audio signal processing

NARCIS (Netherlands)

Slump, Cornelis H.; van Asma, C.G.M.; Barels, J.K.P.; Barels, J.K.P.; Brunink, W.J.A; Drenth, F.B.; Pol, J.V.; Schouten, D.S.; Samsom, M.M.; Samsom, M.M.; Herrmann, O.E.

1992-01-01

This contribution presents the four phases of a project aiming at the realization in VLSI of a digital audio equalizer with a linear phase characteristic. The first step includes the identification of the system requirements, based on experience and (psycho-acoustical) literature. Secondly, the
Synchronized personalized music audio-playlists to improve adherence to physical activity among patients participating in a structured exercise program: a proof-of-principle feasibility study.

Science.gov (United States)

Alter, David A; O'Sullivan, Mary; Oh, Paul I; Redelmeier, Donald A; Marzolini, Susan; Liu, Richard; Forhan, Mary; Silver, Michael; Goodman, Jack M; Bartel, Lee R

2015-01-01

Preference-based tempo-pace synchronized music has been shown to reduce perceived physical activity exertion and improve exercise performance. The extent to which such strategies can improve adherence to physical activity remains unknown. The objective of the study is to explore the feasibility and efficacy of tempo-pace synchronized preference-based music audio-playlists on adherence to physical activity among cardiovascular disease patients participating in a cardiac rehabilitation. Thirty-four cardiac rehabilitation patients were randomly allocated to one of two strategies: (1) no music usual-care control and (2) tempo-pace synchronized audio-devices with personalized music playlists + usual-care. All songs uploaded onto audio-playlist devices took into account patient personal music genre and artist preferences. However, actual song selection was restricted to music whose tempos approximated patients' prescribed exercise walking/running pace (steps per minute) to achieve tempo-pace synchrony. Patients allocated to audio-music playlists underwent further randomization in which half of the patients received songs that were sonically enhanced with rhythmic auditory stimulation (RAS) to accentuate tempo-pace synchrony, whereas the other half did not. RAS was achieved through blinded rhythmic sonic-enhancements undertaken manually to songs within individuals' music playlists. The primary outcome consisted of the weekly volume of physical activity undertaken over 3 months as determined by tri-axial accelerometers. Statistical methods employed an intention to treat and repeated-measures design. Patients randomized to personalized audio-playlists with tempo-pace synchrony achieved higher weekly volumes of physical activity than did their non-music usual-care comparators (475.6 min vs. 370.2 min, P music usual-care controls, respectively, P music with RAS utilized their audio-playlist devices more frequently than did non-RAS music counterparts ( P �
A New Principle for a High Efficiency Power Audio Amplifier for Use with a Digital Preamplifier

DEFF Research Database (Denmark)

Jensen, Jørgen Arendt

1986-01-01

The use of class-B and class-D amlifiers for converting digital audio signals to analog signals is discussed. It is shown that the class-D amplifier is unsuitable due to distortion. Therefore, a new principle involving a switch-mode power supply and a class-B amplifier is suggested. By regulating...... the supply voltage to the amplifier according to the amplitude of the audio signal, a higher efficiency than can be obtained by the current principles is achieved. The regulation can be done very efficiently by generating the control signal to the power supply in advance of the audio signal, made possible...
A new principle for a high-efficiency power audio amplifier for use with a digital preamplifier

DEFF Research Database (Denmark)

Jensen, Jørgen Arendt

1987-01-01

The use of class-B and class-D amplifiers for converting digital audio signals to analog signals is discussed. It is shown that the class-D amplifier is unsuitable due to distortion. Therefore a new principle involving a switch-mode power supply and a class-B amplifier is suggested. By regulating...... the supply voltage to the amplifier according to the amplitude of the audio signal, a higher efficiency than can be obtained by the usual principles is achieved. The regulation can be done very efficiently by generating the control signal to the power supply in advance of the audio signal, made possible...
Audio-visual speech timing sensitivity is enhanced in cluttered conditions.

Directory of Open Access Journals (Sweden)

Warrick Roseboom

2011-04-01

Full Text Available Events encoded in separate sensory modalities, such as audition and vision, can seem to be synchronous across a relatively broad range of physical timing differences. This may suggest that the precision of audio-visual timing judgments is inherently poor. Here we show that this is not necessarily true. We contrast timing sensitivity for isolated streams of audio and visual speech, and for streams of audio and visual speech accompanied by additional, temporally offset, visual speech streams. We find that the precision with which synchronous streams of audio and visual speech are identified is enhanced by the presence of additional streams of asynchronous visual speech. Our data suggest that timing perception is shaped by selective grouping processes, which can result in enhanced precision in temporally cluttered environments. The imprecision suggested by previous studies might therefore be a consequence of examining isolated pairs of audio and visual events. We argue that when an isolated pair of cross-modal events is presented, they tend to group perceptually and to seem synchronous as a consequence. We have revealed greater precision by providing multiple visual signals, possibly allowing a single auditory speech stream to group selectively with the most synchronous visual candidate. The grouping processes we have identified might be important in daily life, such as when we attempt to follow a conversation in a crowded room.
Digital audio watermarking fundamentals, techniques and challenges

CERN Document Server

Xiang, Yong; Yan, Bin

2017-01-01

This book offers comprehensive coverage on the most important aspects of audio watermarking, from classic techniques to the latest advances, from commonly investigated topics to emerging research subdomains, and from the research and development achievements to date, to current limitations, challenges, and future directions. It also addresses key topics such as reversible audio watermarking, audio watermarking with encryption, and imperceptibility control methods. The book sets itself apart from the existing literature in three main ways. Firstly, it not only reviews classical categories of audio watermarking techniques, but also provides detailed descriptions, analysis and experimental results of the latest work in each category. Secondly, it highlights the emerging research topic of reversible audio watermarking, including recent research trends, unique features, and the potentials of this subdomain. Lastly, the joint consideration of audio watermarking and encryption is also reviewed. With the help of this...
Open soundcard as a platform for practical, laboratory study of digital audio

DEFF Research Database (Denmark)

Dimitrov, Smilen; Serafin, Stefania

2014-01-01

This article investigates how lacking suitable platforms for laboratory exercises becomes a learning problem, limiting the practical experience students gain. In engineering education, laboratory demonstration difficulty of issues like real-time streaming in digital signal and audio processing...... afforded by such laboratories, and their open nature, could testably improve the diversity of demonstrated practical topics, while maintaining engineering students' motivation....
Detecting double compression of audio signal

Science.gov (United States)

Yang, Rui; Shi, Yun Q.; Huang, Jiwu

2010-01-01

MP3 is the most popular audio format nowadays in our daily life, for example music downloaded from the Internet and file saved in the digital recorder are often in MP3 format. However, low bitrate MP3s are often transcoded to high bitrate since high bitrate ones are of high commercial value. Also audio recording in digital recorder can be doctored easily by pervasive audio editing software. This paper presents two methods for the detection of double MP3 compression. The methods are essential for finding out fake-quality MP3 and audio forensics. The proposed methods use support vector machine classifiers with feature vectors formed by the distributions of the first digits of the quantized MDCT (modified discrete cosine transform) coefficients. Extensive experiments demonstrate the effectiveness of the proposed methods. To the best of our knowledge, this piece of work is the first one to detect double compression of audio signal.
Multimodal indexing of digital audio-visual documents: A case study for cultural heritage data

NARCIS (Netherlands)

Carmichael, J.; Larson, M.; Marlow, J.; Newman, E.; Clough, P.; Oomen, J.; Sav, S.

2008-01-01

This paper describes a multimedia multimodal information access sub-system (MIAS) for digital audio-visual documents, typically presented in streaming media format. The system is designed to provide both professional and general users with entry points into video documents that are relevant to their
Speech and audio processing for coding, enhancement and recognition

CERN Document Server

Togneri, Roberto; Narasimha, Madihally

2015-01-01

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. · Offers readers a single-source reference on the significant applications of speech and audio processing to speech coding, speech enhancement and speech/speaker recognition. Enables readers involved in algorithm development and implementation issues for speech coding to understand the historical development and future challenges in speech coding research; · Discusses speech coding methods yielding bit-streams that are multi-rate and scalable for Voice-over-IP (VoIP) Networks; · �...
Music preferences based on audio features, and its relation to personality

OpenAIRE

Dunn, Greg

2009-01-01

Recent studies have summarized reported music preferences by genre into four broadly defined categories, which relate to various personality characteristics. Other research has indicated that genre classification is ambiguous and inconsistent. This ambiguity suggests that research relating personality to music preferences based on genre could benefit from a more objective definition of music. This problem is addressed by investigating how music preferences linked to objective audio features r...
A Method to Detect AAC Audio Forgery

Directory of Open Access Journals (Sweden)

Qingzhong Liu

2015-08-01

Full Text Available Advanced Audio Coding (AAC, a standardized lossy compression scheme for digital audio, which was designed to be the successor of the MP3 format, generally achieves better sound quality than MP3 at similar bit rates. While AAC is also the default or standard audio format for many devices and AAC audio files may be presented as important digital evidences, the authentication of the audio files is highly needed but relatively missing. In this paper, we propose a scheme to expose tampered AAC audio streams that are encoded at the same encoding bit-rate. Specifically, we design a shift-recompression based method to retrieve the differential features between the re-encoded audio stream at each shifting and original audio stream, learning classifier is employed to recognize different patterns of differential features of the doctored forgery files and original (untouched audio files. Experimental results show that our approach is very promising and effective to detect the forgery of the same encoding bit-rate on AAC audio streams. Our study also shows that shift recompression-based differential analysis is very effective for detection of the MP3 forgery at the same bit rate.
Paper-Based Textbooks with Audio Support for Print-Disabled Students.

Science.gov (United States)

Fujiyoshi, Akio; Ohsawa, Akiko; Takaira, Takuya; Tani, Yoshiaki; Fujiyoshi, Mamoru; Ota, Yuko

2015-01-01

Utilizing invisible 2-dimensional codes and digital audio players with a 2-dimensional code scanner, we developed paper-based textbooks with audio support for students with print disabilities, called "multimodal textbooks." Multimodal textbooks can be read with the combination of the two modes: "reading printed text" and "listening to the speech of the text from a digital audio player with a 2-dimensional code scanner." Since multimodal textbooks look the same as regular textbooks and the price of a digital audio player is reasonable (about 30 euro), we think multimodal textbooks are suitable for students with print disabilities in ordinary classrooms.
TC9447F, single-chip DSP (digital signal processor) for audio; 1 chip audio yo DSP LSI TC9447F

Energy Technology Data Exchange (ETDEWEB)

NONE

1999-03-01

TC9447F is a single-chip DSP for audio which builds in 2-channel AD converter/4-channel DA converter. It can build various application programs such as the sound field control like hall simulation, digital filter like equalizer, and dynamic range control, in the program memory (ROM). Further, it builds in {+-}10dB trim use electronic volume for two channels. It also builds data delay use RAM (64K-bit) in, so no RAM to be separately attached is necessary. (translated by NEDO)

47 CFR 25.144 - Licensing provisions for the 2.3 GHz satellite digital audio radio service.

Science.gov (United States)

2010-10-01

... 47 Telecommunication 2 2010-10-01 2010-10-01 false Licensing provisions for the 2.3 GHz satellite digital audio radio service. 25.144 Section 25.144 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES SATELLITE COMMUNICATIONS Applications and Licenses Space Stations § 25...
47 CFR 25.214 - Technical requirements for space stations in the satellite digital audio radio service and...

Science.gov (United States)

2010-10-01

... 47 Telecommunication 2 2010-10-01 2010-10-01 false Technical requirements for space stations in the satellite digital audio radio service and associated terrestrial repeaters. 25.214 Section 25.214 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES SATELLITE COMMUNICATIONS...
AUTOMATIC SEGMENTATION OF BROADCAST AUDIO SIGNALS USING AUTO ASSOCIATIVE NEURAL NETWORKS

Directory of Open Access Journals (Sweden)

P. Dhanalakshmi

2010-12-01

Full Text Available In this paper, we describe automatic segmentation methods for audio broadcast data. Today, digital audio applications are part of our everyday lives. Since there are more and more digital audio databases in place these days, the importance of effective management for audio databases have become prominent. Broadcast audio data is recorded from the Television which comprises of various categories of audio signals. Efficient algorithms for segmenting the audio broadcast data into predefined categories are proposed. Audio features namely Linear prediction coefficients (LPC, Linear prediction cepstral coefficients, and Mel frequency cepstral coefficients (MFCC are extracted to characterize the audio data. Auto Associative Neural Networks are used to segment the audio data into predefined categories using the extracted features. Experimental results indicate that the proposed algorithms can produce satisfactory results.
Audio- and TV-products. Power consumption reduction in audio- and TV-products. Final report; Audio- og TV-produkter. Effektminimering i audio- og TV-produkter: Afsluttende rapport

Energy Technology Data Exchange (ETDEWEB)

Kierkegaard, P.

1998-10-01

The project concerning the audio products resulted in energy savings of 90-97% at efficiencies of 91-96% with full effect and stand-by losses of 0.4-3 W. It is especially new epoch-making methods for pulse modulation (called Controlled Oscillation Modulator, COM and Phase Shifted Carrier Pulse Width Modulation, PSCPWM) and error for correction in the effect conversion (called Multivariable Enhanced Cascade Control, MECC and Pulse Edge Delay Error Correction, PEDEC), which has made the breakthrough. Two patents have been applied for, and new digital amplifiers will be introduced in all the relevant products. The project concerning TV products has shown that a loss reduction in deflecting circuits of ca.20 % may be obtained. (EHS)
Two-dimensional block-based reception for differentially encoded OFDM systems : a study on improved reception techniques for digital audio broadcasting systems

NARCIS (Netherlands)

Houtum, van W.J.

2012-01-01

Digital audio broadcast (DAB), DAB+ and Terrestrial-Digital Multimedia Broadcasting (T-DMB) systems use multi-carrier modulation (MCM). The principle of MCM in the DAB-family is based on orthogonal frequency division multiplexing (OFDM), for which every subcarrier is modulated by p 4 differentially
An Enhanced Data Integrity Model In Mobile Cloud Environment Using Digital Signature Algorithm And Robust Reversible Watermarking

Directory of Open Access Journals (Sweden)

Boukari Souley

2017-10-01

Full Text Available the increase use of hand held devices such as smart phones to access multimedia content in the cloud is increasing with rise and growth in information technology. Mobile cloud computing is increasingly used today because it allows users to have access to variety of resources in the cloud such as image video audio and software applications with minimal usage of their inbuilt resources such as storage memory by using the one available in the cloud. The major challenge faced with mobile cloud computing is security. Watermarking and digital signature are some techniques used to provide security and authentication on user data in the cloud. Watermarking is a technique used to embed digital data within a multimedia content such as image video or audio in order to prevent authorized access to those content by intruders whereas digital signature is used to identify and verify user data when accessed. In this work we implemented digital signature and robust reversible image watermarking in order enhance mobile cloud computing security and integrity of data by providing double authentication techniques. The results obtained show the effectiveness of combining the two techniques robust reversible watermarking and digital signature by providing strong authentication to ensures data integrity and extract the original content watermarked without changes.
Concierge: Personal database software for managing digital research resources

Directory of Open Access Journals (Sweden)

Hiroyuki Sakai

2007-11-01

Full Text Available This article introduces a desktop application, named Concierge, for managing personal digital research resources. Using simple operations, it enables storage of various types of files and indexes them based on content descriptions. A key feature of the software is a high level of extensibility. By installing optional plug-ins, users can customize and extend the usability of the software based on their needs. In this paper, we also introduce a few optional plug-ins: literaturemanagement, electronic laboratory notebook, and XooNlps client plug-ins. XooNIps is a content management system developed to share digital research resources among neuroscience communities. It has been adopted as the standard database system in Japanese neuroinformatics projects. Concierge, therefore, offers comprehensive support from management of personal digital research resources to their sharing in open-access neuroinformatics databases such as XooNIps. This interaction between personal and open-access neuroinformatics databases is expected to enhance the dissemination of digital research resources. Concierge is developed as an open source project; Mac OS X and Windows XP versions have been released at the official site (http://concierge.sourceforge.jp.
Ambiguity Function Analysis and Processing for Passive Radar Based on CDR Digital Audio Broadcasting

Directory of Open Access Journals (Sweden)

Zhang Qiang

2015-01-01

Full Text Available China Digital Radio (CDR broadcasting is a new standard of digital audio broadcasting of FM frequency (87–108 MHz based on our research and development efforts. It is compatible with the frequency spectrum in analog FM radio and satisfies the requirements for smooth transition from analog to digital signal in FM broadcasting in China. This paper focuses on the signal characteristics and processing methods of radio-based passive radar. The signal characteristics and ambiguity function of a passive radar illumination source are analyzed. The adverse effects on the target detection of the side peaks owing to cyclic prefix, the Doppler ambiguity strips because of signal synchronization, and the range of side peaks resulting from the signal discontinuous spectrum are then studied. Finally, methods for suppressing these side peaks are proposed and their effectiveness is verified by simulations.
Intelligent audio analysis

CERN Document Server

Schuller, Björn W

2013-01-01

This book provides the reader with the knowledge necessary for comprehension of the field of Intelligent Audio Analysis. It firstly introduces standard methods and discusses the typical Intelligent Audio Analysis chain going from audio data to audio features to audio recognition. Further, an introduction to audio source separation, and enhancement and robustness are given. After the introductory parts, the book shows several applications for the three types of audio: speech, music, and general sound. Each task is shortly introduced, followed by a description of the specific data and methods applied, experiments and results, and a conclusion for this specific task. The books provides benchmark results and standardized test-beds for a broader range of audio analysis tasks. The main focus thereby lies on the parallel advancement of realism in audio analysis, as too often today’s results are overly optimistic owing to idealized testing conditions, and it serves to stimulate synergies arising from transfer of ...
CREATING AUDIO VISUAL DIALOGUE TASK AS STUDENTS’ SELF ASSESSMENT TO ENHANCE THEIR SPEAKING ABILITY

Directory of Open Access Journals (Sweden)

Novia Trisanti

2017-04-01

Full Text Available The study is about giving overview of employing audio visual dialogue task as students creativity task and self assessment in EFL speaking class of tertiary education to enhance the students speaking ability. The qualitative research was done in one of the speaking classes at English Department, Semarang State University, Central Java, Indonesia. The results that can be seen from the rubric of self assessment show that the oral performance through audio visual recorded tasks done by the students as their self assessment gave positive evidences. The audio visual dialogue task can be very beneficial since it can motivate the students learning and increase their learning experiences. The self-assessment can be a valuable additional means to improve their speaking ability since it is one of the motives that drive self- evaluatioan, along with self- verification and self- enhancement.
Can We Afford These Affordances? GarageBand and the Double-Edged Sword of the Digital Audio Workstation

Science.gov (United States)

Bell, Adam Patrick

2015-01-01

The proliferation of computers, tablets, and smartphones has resulted in digital audio workstations (DAWs) such as GarageBand in being some of the most widely distributed musical instruments. Positing that software designers are dictating the music education of DAW-dependent music-makers, I examine the fallacy that music-making applications such…
Musical examination to bridge audio data and sheet music

Science.gov (United States)

Pan, Xunyu; Cross, Timothy J.; Xiao, Liangliang; Hei, Xiali

2015-03-01

The digitalization of audio is commonly implemented for the purpose of convenient storage and transmission of music and songs in today's digital age. Analyzing digital audio for an insightful look at a specific musical characteristic, however, can be quite challenging for various types of applications. Many existing musical analysis techniques can examine a particular piece of audio data. For example, the frequency of digital sound can be easily read and identified at a specific section in an audio file. Based on this information, we could determine the musical note being played at that instant, but what if you want to see a list of all the notes played in a song? While most existing methods help to provide information about a single piece of the audio data at a time, few of them can analyze the available audio file on a larger scale. The research conducted in this work considers how to further utilize the examination of audio data by storing more information from the original audio file. In practice, we develop a novel musical analysis system Musicians Aid to process musical representation and examination of audio data. Musicians Aid solves the previous problem by storing and analyzing the audio information as it reads it rather than tossing it aside. The system can provide professional musicians with an insightful look at the music they created and advance their understanding of their work. Amateur musicians could also benefit from using it solely for the purpose of obtaining feedback about a song they were attempting to play. By comparing our system's interpretation of traditional sheet music with their own playing, a musician could ensure what they played was correct. More specifically, the system could show them exactly where they went wrong and how to adjust their mistakes. In addition, the application could be extended over the Internet to allow users to play music with one another and then review the audio data they produced. This would be particularly
An introduction to audio content analysis applications in signal processing and music informatics

CERN Document Server

Lerch, Alexander

2012-01-01

"With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included"--
System Level Power Optimization of Digital Audio Back End for Hearing Aids

DEFF Research Database (Denmark)

Pracny, Peter; Jørgensen, Ivan Harald Holger; Bruun, Erik

2017-01-01

This work deals with power optimization of the audio processing back end for hearing aids - the interpolation filter (IF), the sigma-delta (SD modulator and the Class D power amplifier (PA) as a whole. Specifications are derived and insight into the tradeoffs involved is used to optimize...... the interpolation filter and the SD modulator on the system level so that the switching frequency of the Class D PA - the main power consumer in the back end - is minimized. A figure-of-merit (FOM) which allows judging the power consumption of the digital part of the back end early in the design process is used...
WLAN Technologies for Audio Delivery

Directory of Open Access Journals (Sweden)

Nicolas-Alexander Tatlas

2007-01-01

Full Text Available Audio delivery and reproduction for home or professional applications may greatly benefit from the adoption of digital wireless local area network (WLAN technologies. The most challenging aspect of such integration relates the synchronized and robust real-time streaming of multiple audio channels to multipoint receivers, for example, wireless active speakers. Here, it is shown that current WLAN solutions are susceptible to transmission errors. A detailed study of the IEEE802.11e protocol (currently under ratification is also presented and all relevant distortions are assessed via an analytical and experimental methodology. A novel synchronization scheme is also introduced, allowing optimized playback for multiple receivers. The perceptual audio performance is assessed for both stereo and 5-channel applications based on either PCM or compressed audio signals.
Enhanced audio-visual interactions in the auditory cortex of elderly cochlear-implant users.

Science.gov (United States)

Schierholz, Irina; Finke, Mareike; Schulte, Svenja; Hauthal, Nadine; Kantzke, Christoph; Rach, Stefan; Büchner, Andreas; Dengler, Reinhard; Sandmann, Pascale

2015-10-01

Auditory deprivation and the restoration of hearing via a cochlear implant (CI) can induce functional plasticity in auditory cortical areas. How these plastic changes affect the ability to integrate combined auditory (A) and visual (V) information is not yet well understood. In the present study, we used electroencephalography (EEG) to examine whether age, temporary deafness and altered sensory experience with a CI can affect audio-visual (AV) interactions in post-lingually deafened CI users. Young and elderly CI users and age-matched NH listeners performed a speeded response task on basic auditory, visual and audio-visual stimuli. Regarding the behavioral results, a redundant signals effect, that is, faster response times to cross-modal (AV) than to both of the two modality-specific stimuli (A, V), was revealed for all groups of participants. Moreover, in all four groups, we found evidence for audio-visual integration. Regarding event-related responses (ERPs), we observed a more pronounced visual modulation of the cortical auditory response at N1 latency (approximately 100 ms after stimulus onset) in the elderly CI users when compared with young CI users and elderly NH listeners. Thus, elderly CI users showed enhanced audio-visual binding which may be a consequence of compensatory strategies developed due to temporary deafness and/or degraded sensory input after implantation. These results indicate that the combination of aging, sensory deprivation and CI facilitates the coupling between the auditory and the visual modality. We suggest that this enhancement in multisensory interactions could be used to optimize auditory rehabilitation, especially in elderly CI users, by the application of strong audio-visually based rehabilitation strategies after implant switch-on. Copyright © 2015 Elsevier B.V. All rights reserved.
Efficiency in audio processing : filter banks and transcoding

NARCIS (Netherlands)

Lee, Jun Wei

2007-01-01

Audio transcoding is the conversion of digital audio from one compressed form A to another compressed form B, where A and B have different compression properties, such as a different bit-rate, sampling frequency or compression method. This is typically achieved by decoding A to an intermediate
A digital input class-D audio amplifier with sixth-order PWM

International Nuclear Information System (INIS)

Luo Shumeng; Li Dongmei

2013-01-01

A digital input class-D audio amplifier with a sixth-order pulse-width modulation (PWM) modulator is presented. This modulator moves the PWM generator into the closed sigma—delta modulator loop. The noise and distortions generated at the PWM generator module are suppressed by the high gain of the forward loop of the sigma—delta modulator. Therefore, at the output of the modulator, a very clean PWM signal is acquired for driving the power stage of the class-D amplifier. A sixth-order modulator is designed to balance the performance and the system clock speed. Fabricated in standard 0.18 μm CMOS technology, this class-D amplifier achieves 110 dB dynamic range, 100 dB signal-to-noise rate, and 0.0056% total harmonic distortion plus noise. (semiconductor integrated circuits)
DISEÑO DE UN MICROSISTEMA PROGRAMABLE PARA EFECTOS DE AUDIO DIGITAL USANDO FPGAS

Directory of Open Access Journals (Sweden)

John Michael Espinosa Durán

Full Text Available Este artículo describe el diseño de un microsistema programable para el procesamiento de efectos de audio digital implementado en un FPGA. El microsistema es diseñado usando un procesador de propósito específico y reconfigurable, un banco de RAMs y una interfaz gráfica de usuario basada en una pantalla táctil LCD. El procesador es diseñado usando 15 efectos de audio basados en retardos y procesamiento en el dominio dinámico y de la frecuencia. Los efectos son diseñados usando Megafunciones y el compilador FIR de Quartus II, son simulados en Simulink5 usando DSP Builder6, y son configurados utilizando una interfaz gráfica de usuario. El microsistema programable es implementado en el sistema de desarrollo DE2-70, y su funcionamiento es verificado usando un reproductor MP3 y un parlante. Adicionalmente, el microsistema permite la generación de efectos con alta fidelidad usando una tasa de muestreo máxima de 195.62 MSPS, y puede ser embebido en un SoC.
A conceptual framework for the design and analysis of first-person shooter audio and its potential use for game engines

DEFF Research Database (Denmark)

Grimshaw, Mark Nicholas; Schott, Gareth

2007-01-01

We introduce and describe a new conceptual framework for the design and analysis of audio for immersive first-person shooter games, and discuss its potential implications for the development of the audio component of game engines. The framework was created in order to illustrate and acknowledge...... the direct role of in-game audio in shaping player-player interactions and in creating a sense of immersion in the game world. Furthermore, it is argued that the relationship between player and sound is best conceptualized theoretically as an acoustic ecology. Current game engines are capable of game world...... spatiality through acoustic shading, but the ideas presented here provide a framework to explore other immersive possibilities for game audio through realtime synthesis....

Digital watermarking techniques and trends

CERN Document Server

Nematollahi, Mohammad Ali; Rosales, Hamurabi Gamboa

2017-01-01

This book presents the state-of-the-arts application of digital watermarking in audio, speech, image, video, 3D mesh graph, text, software, natural language, ontology, network stream, relational database, XML, and hardware IPs. It also presents new and recent algorithms in digital watermarking for copyright protection and discusses future trends in the field. Today, the illegal manipulation of genuine digital objects and products represents a considerable problem in the digital world. Offering an effective solution, digital watermarking can be applied to protect intellectual property, as well as fingerprinting, enhance the security and proof-of-authentication through unsecured channels.
Nurses' satisfaction with use of a personal digital assistants with a mobile nursing information system in China.

Science.gov (United States)

Shen, Li-Qiong; Zang, Xiao-Ying; Cong, Ji-Yan

2018-04-01

Personal digital assistants, technology with various functions, have been applied in international clinical practice. Great benefits in reducing medical errors and enhancing the efficiency of clinical work have been achieved, but little research has investigated nurses' satisfaction with the use of personal digital assistants. To investigate nurses' satisfaction with use of personal digital assistants, and to explore the predictors of this. This is a cross-sectional descriptive study. We conducted a cross-sectional survey targeting nurses who used personal digital assistants in a comprehensive tertiary hospital in Beijing. A total of 383 nurses were recruited in this survey in 2015. The total score of nurses' satisfaction with use of personal digital assistants was 238.91 (SD 39.25). Nurses were less satisfied with the function of documentation, compared with the function of administering medical orders. The time length of using personal digital assistants, academic degree, and different departments predicted nurses' satisfaction towards personal digital assistant use (all P < 0.05). Nurses were satisfied with the accuracy of administering medical orders and the safety of recording data. The stability of the wireless network and efficiency related to nursing work were less promising. To some extent, nurses with higher education and longer working time with personal digital assistants were more satisfied with them. © 2018 John Wiley & Sons Australia, Ltd.
Unimodal Learning Enhances Crossmodal Learning in Robotic Audio-Visual Tracking

DEFF Research Database (Denmark)

Shaikh, Danish; Bodenhagen, Leon; Manoonpong, Poramate

2017-01-01

Crossmodal sensory integration is a fundamental feature of the brain that aids in forming an coherent and unified representation of observed events in the world. Spatiotemporally correlated sensory stimuli brought about by rich sensorimotor experiences drive the development of crossmodal integrat...... a non-holonomic robotic agent towards a moving audio-visual target. Simulation results demonstrate that unimodal learning enhances crossmodal learning and improves both the overall accuracy and precision of multisensory orientation response....
Unimodal Learning Enhances Crossmodal Learning in Robotic Audio-Visual Tracking

DEFF Research Database (Denmark)

Shaikh, Danish; Bodenhagen, Leon; Manoonpong, Poramate

2018-01-01

Crossmodal sensory integration is a fundamental feature of the brain that aids in forming an coherent and unified representation of observed events in the world. Spatiotemporally correlated sensory stimuli brought about by rich sensorimotor experiences drive the development of crossmodal integrat...... a non-holonomic robotic agent towards a moving audio-visual target. Simulation results demonstrate that unimodal learning enhances crossmodal learning and improves both the overall accuracy and precision of multisensory orientation response....
Exploration of a digital audio processing platform using a compositional system level performance estimation framework

DEFF Research Database (Denmark)

Tranberg-Hansen, Anders Sejer; Madsen, Jan

2009-01-01

This paper presents the application of a compositional simulation based system-level performance estimation framework on a non-trivial industrial case study. The case study is provided by the Danish company Bang & Olufsen ICEpower a/s and focuses on the exploration of a digital mobile audio...... processing platform. A short overview of the compositional performance estimation framework used is given followed by a presentation of how it is used for performance estimation using an iterative refinement process towards the final implementation. Finally, an evaluation in terms of accuracy and speed...
Digital communication communication, multimedia, security

CERN Document Server

Meinel, Christoph

2014-01-01

The authors give a detailed summary about the fundamentals and the historical background of digital communication. This includes an overview of the encoding principles and algorithms of textual information, audio information, as well as images, graphics, and video in the Internet. Furthermore the fundamentals of computer networking, digital security and cryptography are covered. Thus, the book provides a well-founded access to communication technology of computer networks, the internet and the WWW. Numerous pictures and images, a subject-index and a detailed list of historical personalities in
Temporal digital subtraction radiography with a personal computer digital workstation

International Nuclear Information System (INIS)

Kircos, L.; Holt, W.; Khademi, J.

1990-01-01

Technique have been developed and implemented on a personal computer (PC)-based digital workstation to accomplish temporal digital subtraction radiography (TDSR). TDSR is useful in recording radiologic change over time. Thus, this technique is useful not only for monitoring chronic disease processes but also for monitoring the temporal course of interventional therapies. A PC-based digital workstation was developed on a PC386 platform with add-in hardware and software. Image acquisition, storage, and processing was accomplished using 512 x 512 x 8- or 12-bit frame grabber. Software and hardware were developed to accomplish image orientation, registration, gray scale compensation, subtraction, and enhancement. Temporal radiographs of the jaws were made in a fixed and reproducible orientation between the x-ray source and image receptor enabling TDSR. Temporal changes secondary to chronic periodontal disease, osseointegration of endosseous implants, and wound healing were demonstrated. Use of TDSR for chest imaging was also demonstrated with identification of small, subtle focal masses that were not apparent with routine viewing. The large amount of radiologic information in images of the jaws and chest may obfuscate subtle changes that TDSR seems to identify. TDSR appears to be useful as a tool to record temporal and subtle changes in radiologic images
Design and Implementation of a Video-Zoom Driven Digital Audio-Zoom System for Portable Digital Imaging Devices

Science.gov (United States)

Park, Nam In; Kim, Seon Man; Kim, Hong Kook; Kim, Ji Woon; Kim, Myeong Bo; Yun, Su Won

In this paper, we propose a video-zoom driven audio-zoom algorithm in order to provide audio zooming effects in accordance with the degree of video-zoom. The proposed algorithm is designed based on a super-directive beamformer operating with a 4-channel microphone system, in conjunction with a soft masking process that considers the phase differences between microphones. Thus, the audio-zoom processed signal is obtained by multiplying an audio gain derived from a video-zoom level by the masked signal. After all, a real-time audio-zoom system is implemented on an ARM-CORETEX-A8 having a clock speed of 600 MHz after different levels of optimization are performed such as algorithmic level, C-code, and memory optimizations. To evaluate the complexity of the proposed real-time audio-zoom system, test data whose length is 21.3 seconds long is sampled at 48 kHz. As a result, it is shown from the experiments that the processing time for the proposed audio-zoom system occupies 14.6% or less of the ARM clock cycles. It is also shown from the experimental results performed in a semi-anechoic chamber that the signal with the front direction can be amplified by approximately 10 dB compared to the other directions.
Audio watermarking robust against D/A and A/D conversions

Directory of Open Access Journals (Sweden)

Xiang Shijun

2011-01-01

Full Text Available Abstract Digital audio watermarking robust against digital-to-analog (D/A and analog-to-digital (A/D conversions is an important issue. In a number of watermark application scenarios, D/A and A/D conversions are involved. In this article, we first investigate the degradation due to DA/AD conversions via sound cards, which can be decomposed into volume change, additional noise, and time-scale modification (TSM. Then, we propose a solution for DA/AD conversions by considering the effect of the volume change, additional noise and TSM. For the volume change, we introduce relation-based watermarking method by modifying groups of the energy relation of three adjacent DWT coefficient sections. For the additional noise, we pick up the lowest-frequency coefficients for watermarking. For the TSM, the synchronization technique (with synchronization codes and an interpolation processing operation is exploited. Simulation tests show the proposed audio watermarking algorithm provides a satisfactory performance to DA/AD conversions and those common audio processing manipulations.
Los cinco grados de la comunicación en educación Oral-gestural, writing, audio, audiovisual and… digital? The five degrees of communication in education

Directory of Open Access Journals (Sweden)

José María Perceval Verde

2008-03-01

Full Text Available En el marco del presente artículo se presenta un recorrido a través de cinco grados de la comunicación en educación: el oral-gestual, la escritura, el audio, el audiovisual y el digital. El texto destaca los cambios que el escenario on-line introduce en el proceso educativo reflexionando sobre la figura del estudiante, del docente y de las relaciones entre ambos. This article portrays an overview of the five degrees of comunication in education: oralgestural, writing, audio, audiovisual and digital. It highlights the changes introduced by the on-line scenario in the educational process, reflecting on the character of the student, the teacher and the relationship between them.
Harmonic Enhancement in Low Bitrate Audio Coding Using an Efficient Long-Term Predictor

Directory of Open Access Journals (Sweden)

Song Jeongook

2010-01-01

Full Text Available This paper proposes audio coding using an efficient long-term prediction method to enhance the perceptual quality of audio codecs to speech input signals at low bit-rates. The MPEG-4 AAC-LTP exploited a similar concept, but its improvement was not significant because of small prediction gain due to long prediction lags and aliased components caused by the transformation with a time-domain aliasing cancelation (TDAC technique. The proposed algorithm increases the prediction gain by employing a deharmonizing predictor and a long-term compensation filter. The look-back memory elements are first constructed by applying the de-harmonizing predictor to the input signal, then the prediction residual is encoded and decoded by transform audio coding. Finally, the long-term compensation filter is applied to the updated look-back memory of the decoded prediction residual to obtain synthesized signals. Experimental results show that the proposed algorithm has much lower spectral distortion and higher perceptual quality than conventional approaches especially for harmonic signals, such as voiced speech.
Realtime Audio with Garbage Collection

OpenAIRE

Matheussen, Kjetil Svalastog

2010-01-01

Two non-moving concurrent garbage collectors tailored for realtime audio processing are described. Both collectors work on copies of the heap to avoid cache misses and audio-disruptive synchronizations. Both collectors are targeted at multiprocessor personal computers. The first garbage collector works in uncooperative environments, and can replace Hans Boehm's conservative garbage collector for C and C++. The collector does not access the virtual memory system. Neither doe...
Mixed-Signal Architectures for High-Efficiency and Low-Distortion Digital Audio Processing and Power Amplification

Directory of Open Access Journals (Sweden)

Pierangelo Terreni

2010-01-01

Full Text Available The paper addresses the algorithmic and architectural design of digital input power audio amplifiers. A modelling platform, based on a meet-in-the-middle approach between top-down and bottom-up design strategies, allows a fast but still accurate exploration of the mixed-signal design space. Different amplifier architectures are configured and compared to find optimal trade-offs among different cost-functions: low distortion, high efficiency, low circuit complexity and low sensitivity to parameter changes. A novel amplifier architecture is derived; its prototype implements digital processing IP macrocells (oversampler, interpolating filter, PWM cross-point deriver, noise shaper, multilevel PWM modulator, dead time compensator on a single low-complexity FPGA while off-chip components are used only for the power output stage (LC filter and power MOS bridge; no heatsink is required. The resulting digital input amplifier features a power efficiency higher than 90% and a total harmonic distortion down to 0.13% at power levels of tens of Watts. Discussions towards the full-silicon integration of the mixed-signal amplifier in embedded devices, using BCD technology and targeting power levels of few Watts, are also reported.
Can audio recording improve patients' recall of outpatient consultations?

DEFF Research Database (Denmark)

Wolderslund, Maiken; Kofoed, Poul-Erik; Axboe, Mette

Introduction In order to give patients possibility to listen to their consultation again, we have designed a system which gives the patients access to digital audio recordings of their consultations. An Interactive Voice Response platform enables the audio recording and gives the patients access...... and those who have not (control).The audio recordings and the interviews are coded according to six themes: Test results, Treatment, Risks, Future tests, Advice and Plan. Afterwards the extent of patients recall is assessed by comparing the accuracy of the patient’s statements (interview...
Design of batch audio/video conversion platform based on JavaEE

Science.gov (United States)

Cui, Yansong; Jiang, Lianpin

2018-03-01

With the rapid development of digital publishing industry, the direction of audio / video publishing shows the diversity of coding standards for audio and video files, massive data and other significant features. Faced with massive and diverse data, how to quickly and efficiently convert to a unified code format has brought great difficulties to the digital publishing organization. In view of this demand and present situation in this paper, basing on the development architecture of Sptring+SpringMVC+Mybatis, and combined with the open source FFMPEG format conversion tool, a distributed online audio and video format conversion platform with a B/S structure is proposed. Based on the Java language, the key technologies and strategies designed in the design of platform architecture are analyzed emphatically in this paper, designing and developing a efficient audio and video format conversion system, which is composed of “Front display system”, "core scheduling server " and " conversion server ". The test results show that, compared with the ordinary audio and video conversion scheme, the use of batch audio and video format conversion platform can effectively improve the conversion efficiency of audio and video files, and reduce the complexity of the work. Practice has proved that the key technology discussed in this paper can be applied in the field of large batch file processing, and has certain practical application value.
Exploring the Role of In-Person Components for Online Health Behavior Change Interventions: Can a Digital Person-to-Person Component Suffice?

Science.gov (United States)

Santarossa, Sara; Kane, Deborah; Senn, Charlene Y; Woodruff, Sarah J

2018-04-11

The growth of the digital environment provides tremendous opportunities to revolutionize health behavior change efforts. This paper explores the use of Web-based, mobile, and social media health behavior change interventions and determines whether there is a need for a face-to-face or an in-person component. It is further argued that that although in-person components can be beneficial for online interventions, a digital person-to-person component can foster similar results while dealing with challenges faced by traditional intervention approaches. Using a digital person-to-person component is rooted in social and behavioral theories such as the theory of reasoned action, and the social cognitive theory, and further justified by the human support constructs of the model of supportive accountability. Overall, face-to-face and online behavior change interventions have their respective advantages and disadvantages and functions, yet both serve important roles. It appears that it is in fact human support that is the most important component in the effectiveness and adherence of both face-to-face and online behavior change interventions, and thoughtfully introducing a digital person-to-person component, to replace face-to-face interactions, can provide the needed human support while diminishing the barriers of in-person meetings. The digital person-to-person component must create accountability, generate opportunities for tailored feedback, and create social support to successfully create health behavior change. As the popularity of the online world grows, and the interest in using the digital environment for health behavior change interventions continues to be embraced, further research into not only the use of online interventions, but the use of a digital person-to-person component, must be explored. ©Sara Santarossa, Deborah Kane, Charlene Y Senn, Sarah J Woodruff. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.04.2018.
Investigating the Effectiveness of Audio Input Enhancement on EFL Learners' Retention of Intensifiers

Science.gov (United States)

Negari, Giti Mousapour; Azizi, Aliye; Arani, Davood Khedmatkar

2018-01-01

The present study attempted to investigate the effects of audio input enhancement on EFL learners' retention of intensifiers. To this end, two research questions were formulated. In order to address these research questions, this study attempted to reject two null hypotheses. Pretest-posttest control group quasi-experimental design was employed to…
Digital signal processing methods and algorithms for audio conferencing systems

OpenAIRE

Lindström, Fredric

2007-01-01

Today, we are interconnected almost all over the planet. Large multinational companies operate worldwide, but also an increasing number of small and medium sized companies do business overseas. As people travel to meet and do businesses, the already exposed earth is subject to even more strain. Audio conferencing is an attractive alternative to travel, which is becoming more and more appreciated. Audio conferences can of course not replace all types of meetings, but can help companies to cut ...
Sistema de adquisición y procesamiento de audio

OpenAIRE

Pérez Segurado, Rubén

2015-01-01

El objetivo de este proyecto es el diseño y la implementación de una plataforma para un sistema de procesamiento de audio. El sistema recibirá una señal de audio analógica desde una fuente de audio, permitirá realizar un tratamiento digital de dicha señal y generará una señal procesada que se enviará a unos altavoces externos. Para la realización del sistema de procesamiento se empleará: - Un dispositivo FPGA de Lattice, modelo MachX02-7000-HE, en la cual estarán todas la...
Car audio using DSP for active sound control. DSP ni yoru active seigyo wo mochiita audio

Energy Technology Data Exchange (ETDEWEB)

Yamada, K.; Asano, S.; Furukawa, N. (Mitsubishi Motor Corp., Tokyo (Japan))

1993-06-01

In the automobile cabin, there are some unique problems which spoil the quality of sound reproduction from audio equipment, such as the narrow space and/or the background noise. The audio signal processing by using DSP (digital signal processor) makes enable a solution to these problems. A car audio with a high amenity has been successfully made by the active sound control using DSP. The DSP consists of an adder, coefficient multiplier, delay unit, and connections. For the actual processing by DSP, are used functions, such as sound field correction, response and processing of noises during driving, surround reproduction, graphic equalizer processing, etc. High effectiveness of the method was confirmed through the actual driving evaluation test. The present paper describes the actual method of sound control technology using DSP. Especially, the dynamic processing of the noise during driving is discussed in detail. 1 ref., 12 figs., 1 tab.

A 240W Monolithic Class-D Audio Amplifier Output Stage

DEFF Research Database (Denmark)

Nyboe, Flemming; Kaya, Cetin; Risbo, Lars

2006-01-01

A single-channel class-D audio amplifier output stage outputs 240W undipped into 4Omega 0.1% open-loop THD+N allows using the device in a fully-digital audio signal path with no feedback. The output current capability is plusmn18A and the part is fabricated in a 0.4mum/1.8mum high-voltage Bi...
Personal Sports Branding in the Digital Age: The Case of Zlatan Ibrahimovic

OpenAIRE

Samoylina, Ekaterina

2015-01-01

The rise of digital media has caused transformations and new phenomena in different fields. In the digital age such branches as personal sports branding and nation branding has acquired new opportunities for development. The research focuses on representation of the personal sports brand of Zlatan Ibrahimovic on digital media platforms and its connection to the nation brand of Sweden. Previous research deals with existing studies on personal branding, personal sports branding in digital media...
Gateway of Sound: Reassessing the Role of Audio Mastering in the Art of Record Production

Directory of Open Access Journals (Sweden)

Carlo Nardi

2014-06-01

Full Text Available Audio mastering, notwithstanding an apparent lack of scholarly attention, is a crucial gateway between production and consumption and, as such, is worth further scrutiny, especially in music genres like house or techno, which place great emphasis on sound production qualities. In this article, drawing on personal interviews with mastering engineers and field research in mastering studios in Italy and Germany, I investigate the practice of mastering engineering, paying close attention to the negotiation of techniques and sound aesthetics in relation to changes in the industry formats and, in particular, to the growing shift among DJs from vinyl to compressed digital formats. I then discuss the specificity of audio mastering in relation to EDM, insofar as DJs and controllerists conceive of the master, rather than as a finished product destined to listening, as raw material that can be reworked in performance.
[The digital reprocessing of under- and overexposed x-ray films with a personal computer].

Science.gov (United States)

Fuhrmann, R; Diedrich, P

1993-02-01

An image processing work station for digitalizing and interactively manipulating under- and overexposed X-rays was set up by adding modules to an IBM compatible personal computer. Overexposed X-rays can be qualitatively enhanced by means of controlled manipulation of contrast and brightness and by means of the use of various digital filtering techniques. With underexposed X-rays an equalized grey scale can be achieved by means of regulating contrast and brightness. Digital filtering is not required. To assure a high degree of anatomical detail (periodontal ligament) in the digitalized image a maximum pixel of 0.1 mm was defined as a qualitative norm. Since in every digitalization process resolution is diminished, it proved best to select for interactive manipulation out of the total image only the section of interest.
Detection Of Alterations In Audio Files Using Spectrograph Analysis

Directory of Open Access Journals (Sweden)

Anandha Krishnan G

2015-08-01

Full Text Available The corresponding study was carried out to detect changes in audio file using spectrograph. An audio file format is a file format for storing digital audio data on a computer system. A sound spectrograph is a laboratory instrument that displays a graphical representation of the strengths of the various component frequencies of a sound as time passes. The objectives of the study were to find the changes in spectrograph of audio after altering them to compare altering changes with spectrograph of original files and to check for similarity and difference in mp3 and wav. Five different alterations were carried out on each audio file to analyze the differences between the original and the altered file. For altering the audio file MP3 or WAV by cutcopy the file was opened in Audacity. A different audio was then pasted to the audio file. This new file was analyzed to view the differences. By adjusting the necessary parameters the noise was reduced. The differences between the new file and the original file were analyzed. By adjusting the parameters from the dialog box the necessary changes were made. The edited audio file was opened in the software named spek where after analyzing a graph is obtained of that particular file which is saved for further analysis. The original audio graph received was combined with the edited audio file graph to see the alterations.
Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.

Science.gov (United States)

Gebru, Israel D; Ba, Sileye; Li, Xiaofei; Horaud, Radu

2018-05-01

Speaker diarization consists of assigning speech signals to people engaged in a dialogue. An audio-visual spatiotemporal diarization model is proposed. The model is well suited for challenging scenarios that consist of several participants engaged in multi-party interaction while they move around and turn their heads towards the other participants rather than facing the cameras and the microphones. Multiple-person visual tracking is combined with multiple speech-source localization in order to tackle the speech-to-person association problem. The latter is solved within a novel audio-visual fusion method on the following grounds: binaural spectral features are first extracted from a microphone pair, then a supervised audio-visual alignment technique maps these features onto an image, and finally a semi-supervised clustering method assigns binaural spectral features to visible persons. The main advantage of this method over previous work is that it processes in a principled way speech signals uttered simultaneously by multiple persons. The diarization itself is cast into a latent-variable temporal graphical model that infers speaker identities and speech turns, based on the output of an audio-visual association process, executed at each time slice, and on the dynamics of the diarization variable itself. The proposed formulation yields an efficient exact inference procedure. A novel dataset, that contains audio-visual training data as well as a number of scenarios involving several participants engaged in formal and informal dialogue, is introduced. The proposed method is thoroughly tested and benchmarked with respect to several state-of-the art diarization algorithms.
Personal Digital Branding as a Professional Asset in the Digital Age.

Science.gov (United States)

Kleppinger, Courtney A; Cain, Jeff

2015-08-25

In recent years, society's rapid adoption of social media has made the boundary between professional and private life nearly indistinguishable. The literature provides guidance on how to demonstrate professionalism via social media platforms. Social media policies within health professions education tend to be legalistic in nature, serving primarily to highlight behaviors students should avoid. One missing element in social media literature is the concept of online invisibility. In this paper, we define personal digital branding, discuss the professional implications of choosing to abstain from social media use, and urge educators to recognize that the personal digital branding may be an emerging asset for young professionals in the twenty-first century.
Automated processing of massive audio/video content using FFmpeg

Directory of Open Access Journals (Sweden)

Kia Siang Hock

2014-01-01

Full Text Available Audio and video content forms an integral, important and expanding part of the digital collections in libraries and archives world-wide. While these memory institutions are familiar and well-versed in the management of more conventional materials such as books, periodicals, ephemera and images, the handling of audio (e.g., oral history recordings and video content (e.g., audio-visual recordings, broadcast content requires additional toolkits. In particular, a robust and comprehensive tool that provides a programmable interface is indispensable when dealing with tens of thousands of hours of audio and video content. FFmpeg is comprehensive and well-established open source software that is capable of the full-range of audio/video processing tasks (such as encode, decode, transcode, mux, demux, stream and filter. It is also capable of handling a wide-range of audio and video formats, a unique challenge in memory institutions. It comes with a command line interface, as well as a set of developer libraries that can be incorporated into applications.
Coexistence issues for a 2.4 GHz wireless audio streaming in presence of bluetooth paging and WLAN

Science.gov (United States)

Pfeiffer, F.; Rashwan, M.; Biebl, E.; Napholz, B.

2015-11-01

Nowadays, customers expect to integrate their mobile electronic devices (smartphones and laptops) in a vehicle to form a wireless network. Typically, IEEE 802.11 is used to provide a high-speed wireless local area network (WLAN) and Bluetooth is used for cable replacement applications in a wireless personal area network (PAN). In addition, Daimler uses KLEER as third wireless technology in the unlicensed (UL) 2.4 GHz-ISM-band to transmit full CD-quality digital audio. As Bluetooth, IEEE 802.11 and KLEER are operating in the same frequency band, it has to be ensured that all three technologies can be used simultaneously without interference. In this paper, we focus on the impact of Bluetooth and IEEE 802.11 as interferer in presence of a KLEER audio transmission.
Hot for Teacher: Using Digital Music to Enhance Students' Experience in Online Courses

Science.gov (United States)

Dunlap, Joanna C.; Lowenthal, Patrick R.

2010-01-01

This article provides a review of the instructional potential of digital music to enhance postsecondary students' experience in online courses by involving them in music-driven instructional activities. The authors describe how music-driven instructional activities, when used appropriately, can (a) humanize, personalize, and energize online…
A 240W Monolithic Class-D Audio Amplifier Output Stage

OpenAIRE

Nyboe, Flemming; Kaya, Cetin; Risbo, Lars; Andreani, Pietro

2006-01-01

A single-channel class-D audio amplifier output stage outputs 240W undipped into 4Omega 0.1% open-loop THD+N allows using the device in a fully-digital audio signal path with no feedback. The output current capability is plusmn18A and the part is fabricated in a 0.4mum/1.8mum high-voltage BiCMOS process. Over-current sensing protects the output from short circuits.
An Interactive Concert Program Based on Infrared Watermark and Audio Synthesis

Science.gov (United States)

Wang, Hsi-Chun; Lee, Wen-Pin Hope; Liang, Feng-Ju

The objective of this research is to propose a video/audio system which allows the user to listen the typical music notes in the concert program under infrared detection. The system synthesizes audio with different pitches and tempi in accordance with the encoded data in a 2-D barcode embedded in the infrared watermark. The digital halftoning technique has been used to fabricate the infrared watermark composed of halftone dots by both amplitude modulation (AM) and frequency modulation (FM). The results show that this interactive system successfully recognizes the barcode and synthesizes audio under infrared detection of a concert program which is also valid for human observation of the contents. This interactive video/audio system has greatly expanded the capability of the printout paper to audio display and also has many potential value-added applications.
Personal Digital Branding as a Professional Asset in the Digital Age

Science.gov (United States)

Kleppinger, Courtney A.

2015-01-01

In recent years, society’s rapid adoption of social media has made the boundary between professional and private life nearly indistinguishable. The literature provides guidance on how to demonstrate professionalism via social media platforms. Social media policies within health professions education tend to be legalistic in nature, serving primarily to highlight behaviors students should avoid. One missing element in social media literature is the concept of online invisibility. In this paper, we define personal digital branding, discuss the professional implications of choosing to abstain from social media use, and urge educators to recognize that the personal digital branding may be an emerging asset for young professionals in the twenty-first century. PMID:26430266
Streaming Audio and Video: New Challenges and Opportunities for Museums.

Science.gov (United States)

Spadaccini, Jim

Streaming audio and video present new challenges and opportunities for museums. Streaming media is easier to author and deliver to Internet audiences than ever before; digital video editing is commonplace now that the tools--computers, digital video cameras, and hard drives--are so affordable; the cost of serving video files across the Internet…
Aurally Aided Visual Search Performance Comparing Virtual Audio Systems

DEFF Research Database (Denmark)

Larsen, Camilla Horne; Lauritsen, David Skødt; Larsen, Jacob Junker

2014-01-01

Due to increased computational power, reproducing binaural hearing in real-time applications, through usage of head-related transfer functions (HRTFs), is now possible. This paper addresses the differences in aurally-aided visual search performance between a HRTF enhanced audio system (3D) and an...... with white dots. The results indicate that 3D audio yields faster search latencies than panning audio, especially with larger amounts of distractors. The applications of this research could fit virtual environments such as video games or virtual simulations.......Due to increased computational power, reproducing binaural hearing in real-time applications, through usage of head-related transfer functions (HRTFs), is now possible. This paper addresses the differences in aurally-aided visual search performance between a HRTF enhanced audio system (3D...
Aurally Aided Visual Search Performance Comparing Virtual Audio Systems

DEFF Research Database (Denmark)

Larsen, Camilla Horne; Lauritsen, David Skødt; Larsen, Jacob Junker

2014-01-01

Due to increased computational power reproducing binaural hearing in real-time applications, through usage of head-related transfer functions (HRTFs), is now possible. This paper addresses the differences in aurally-aided visual search performance between an HRTF enhanced audio system (3D) and an...... with white dots. The results indicate that 3D audio yields faster search latencies than panning audio, especially with larger amounts of distractors. The applications of this research could fit virtual environments such as video games or virtual simulations.......Due to increased computational power reproducing binaural hearing in real-time applications, through usage of head-related transfer functions (HRTFs), is now possible. This paper addresses the differences in aurally-aided visual search performance between an HRTF enhanced audio system (3D...
Personal Identity in Enhancement

Directory of Open Access Journals (Sweden)

Jana Podroužková

2015-09-01

Full Text Available The aim of this paper is to introduce the concept of human enhancement, its methods and its relation to personal identity. Also several approaches to personal identity will be described. Transhumanism is a special think tank supporting human enhancement through modern technologies and some of its representatives claim, that even great changes to human organisms will not affect their personal identity. I will briefly describe the most important means of human enhancment and consider the problem of personal identity for each of them separately.
MyLibrary: A Web Personalized Digital Library.

Science.gov (United States)

Rocha, Catarina; Xexeo, Geraldo; da Rocha, Ana Regina C.

With the increasing availability of information on Internet information providers, like search engines, digital libraries and online databases, it becomes more important to have personalized systems that help users to find relevant information. One type of personalization that is growing in use is recommender systems. This paper presents…
Acoustic Heritage and Audio Creativity: the Creative Application of Sound in the Representation, Understanding and Experience of Past Environments

Directory of Open Access Journals (Sweden)

Damian Murphy

2017-06-01

Full Text Available Acoustic Heritage is one aspect of archaeoacoustics, and refers more specifically to the quantifiable acoustic properties of buildings, sites and landscapes from our architectural and archaeological past, forming an important aspect of our intangible cultural heritage. Auralisation, the audio equivalent of 3D visualisation, enables these acoustic properties, captured via the process of measurement and survey, or computer-based modelling, to form the basis of an audio reconstruction and presentation of the studied space. This article examines the application of auralisation and audio creativity as a means to explore our acoustic heritage, thereby diversifying and enhancing the toolset available to the digital heritage or humanities researcher. The Open Acoustic Impulse Response (OpenAIR library is an online repository for acoustic impulse response and auralisation data, with a significant part having been gathered from a broad range of heritage sites. The methodology used to gather this acoustic data is discussed, together with the processes used in generating and calibrating a comparable computer model, and how the data generated might be analysed and presented. The creative use of this acoustic data is also considered, in the context of music production, mixed media artwork and audio for gaming. More relevant to digital heritage is how these data can be used to create new experiences of past environments, as information, interpretation, guide or artwork and ultimately help to articulate new research questions and explorations of our acoustic heritage.
Soldier Flexible Personal Digital Assistant Program

National Research Council Canada - National Science Library

Price, Mark; Woytowich, Jason; Carlson, Marc

2008-01-01

The main goal of the Soldier Flexible Personal Digital Assistant Program was to develop prototypes of a novel flexible display technology device for demonstration in a laboratory setting and use in Future Force Warrior (FFW) demonstrations...

Application of Modified Digital Halftoning Techniques to Data Hiding in Personalized Stamps

Institute of Scientific and Technical Information of China (English)

Hsi-Chun Wang; Chi-Ming Lian; Pei-Chi Hsiao

2004-01-01

The objective of this research is to embed information in personalized stamps by modified digital halftoning techniques. The displaced and deformed halftone dots are used to encode data in the personalized stamps. Hidden information can be retrieved by either an optical decoder or digital image processing techniques.The results show that personalized stamps with value-added features like data hiding or digital watermarking can be successfully implemented.
Rehabilitation strategies enhancing participation in shopping malls for persons living with a disability.

Science.gov (United States)

Alary Gauvreau, Christine; Kairy, Dahlia; Mazer, Barbara; Guindon, Andréanne; Le Dorze, Guylaine

2018-04-01

After rehabilitation, it is not clear the extent to which persons living with a disability return to their former activities in the community, such as going to shopping malls. Rehabilitation professionals are faced with the challenge to adequately prepare their clients to resume community participation. The purpose of this study was to identify rehabilitation strategies aimed at preparing clients to engage in activities in shopping malls. Twenty-two participants including 16 rehabilitation clinicians and 6 persons living with a disability participated in four nominal group sessions. Participants were questioned on current or potential rehabilitation strategies carried out to enhance participation in shopping malls for persons living with a disability. Discussions were audio-recorded and qualitative content analysis was conducted. Participants mentioned strategies that were either carried out by the clinician, or in collaboration with other parties. The latter type of strategies was either carried out with the collaboration of the client, the interdisciplinary team, the relatives, or community organizations. Rehabilitation clinicians have a role to play in preparing persons living with a disability to resume activities in a shopping mall. Additionally, therapeutic interventions in community settings may enhance the participation of rehabilitation clients in their everyday activities. Implications for rehabilitation Many strategies are currently used in rehabilitation to prepare persons living with a disability to resume shopping activities. Clinicians could implement shopping-oriented rehabilitation strategies with the client and/or with other rehabilitation partners. Involving clients in activities related to shopping might enhance their participation in shopping malls after rehabilitation. Rehabilitation clinicians can be facilitators for people living with a disability to reach optimal participation.
Wavelet-based audio embedding and audio/video compression

Science.gov (United States)

Mendenhall, Michael J.; Claypoole, Roger L., Jr.

2001-12-01

Watermarking, traditionally used for copyright protection, is used in a new and exciting way. An efficient wavelet-based watermarking technique embeds audio information into a video signal. Several effective compression techniques are applied to compress the resulting audio/video signal in an embedded fashion. This wavelet-based compression algorithm incorporates bit-plane coding, index coding, and Huffman coding. To demonstrate the potential of this audio embedding and audio/video compression algorithm, we embed an audio signal into a video signal and then compress. Results show that overall compression rates of 15:1 can be achieved. The video signal is reconstructed with a median PSNR of nearly 33 dB. Finally, the audio signal is extracted from the compressed audio/video signal without error.
Integration of Bass Enhancement and Active Noise Control System in Automobile Cabin

Directory of Open Access Journals (Sweden)

Liang Wang

2008-01-01

Full Text Available With the advancement of digital signal processing technologies, consumers are more concerned with the quality of multimedia entertainment in automobiles. In order to meet this demand, an audio enhancement system is needed to improve bass reproduction and cancel engine noise in the cabins. This paper presents an integrated active noise control system that is based on frequency-sampling filters to track and extract the bass information from the audio signal, and a multifrequency active noise equalizer to tune the low-frequency engine harmonics to enhance the bass reproduction. In the noise cancellation mode, a maximum of 3 dB bass enhancement can be achieved with significant noise suppression, while higher bass enhancement can be achieved in the bass enhance mode. The results show that the proposed system is effective for solving both the bass audio reproduction and the noise control problems in automobile cabins.
Perceptual Coding of Audio Signals Using Adaptive Time-Frequency Transform

Directory of Open Access Journals (Sweden)

Umapathy Karthikeyan

2007-01-01

Full Text Available Wide band digital audio signals have a very high data-rate associated with them due to their complex nature and demand for high-quality reproduction. Although recent technological advancements have significantly reduced the cost of bandwidth and miniaturized storage facilities, the rapid increase in the volume of digital audio content constantly compels the need for better compression algorithms. Over the years various perceptually lossless compression techniques have been introduced, and transform-based compression techniques have made a significant impact in recent years. In this paper, we propose one such transform-based compression technique, where the joint time-frequency (TF properties of the nonstationary nature of the audio signals were exploited in creating a compact energy representation of the signal in fewer coefficients. The decomposition coefficients were processed and perceptually filtered to retain only the relevant coefficients. Perceptual filtering (psychoacoustics was applied in a novel way by analyzing and performing TF specific psychoacoustics experiments. An added advantage of the proposed technique is that, due to its signal adaptive nature, it does not need predetermined segmentation of audio signals for processing. Eight stereo audio signal samples of different varieties were used in the study. Subjective (mean opinion score—MOS listening tests were performed and the subjective difference grades (SDG were used to compare the performance of the proposed coder with MP3, AAC, and HE-AAC encoders. Compression ratios in the range of 8 to 40 were achieved by the proposed technique with subjective difference grades (SDG ranging from –0.53 to –2.27.
Advances in audio watermarking based on singular value decomposition

CERN Document Server

Dhar, Pranab Kumar

2015-01-01

This book introduces audio watermarking methods for copyright protection, which has drawn extensive attention for securing digital data from unauthorized copying. The book is divided into two parts. First, an audio watermarking method in discrete wavelet transform (DWT) and discrete cosine transform (DCT) domains using singular value decomposition (SVD) and quantization is introduced. This method is robust against various attacks and provides good imperceptible watermarked sounds. Then, an audio watermarking method in fast Fourier transform (FFT) domain using SVD and Cartesian-polar transformation (CPT) is presented. This method has high imperceptibility and high data payload and it provides good robustness against various attacks. These techniques allow media owners to protect copyright and to show authenticity and ownership of their material in a variety of applications. · Features new methods of audio watermarking for copyright protection and ownership protection · Outl...
The Personal Hearing System—A Software Hearing Aid for a Personal Communication System

Directory of Open Access Journals (Sweden)

Giso Grimm

2009-01-01

Full Text Available A concept and architecture of a personal communication system (PCS is introduced that integrates audio communication and hearing support for the elderly and hearing-impaired through a personal hearing system (PHS. The concept envisions a central processor connected to audio headsets via a wireless body area network (WBAN. To demonstrate the concept, a prototype PCS is presented that is implemented on a netbook computer with a dedicated audio interface in combination with a mobile phone. The prototype can be used for field-testing possible applications and to reveal possibilities and limitations of the concept of integrating hearing support in consumer audio communication devices. It is shown that the prototype PCS can integrate hearing aid functionality, telephony, public announcement systems, and home entertainment. An exemplary binaural speech enhancement scheme that represents a large class of possible PHS processing schemes is shown to be compatible with the general concept. However, an analysis of hardware and software architectures shows that the implementation of a PCS on future advanced cell phone-like devices is challenging. Because of limitations in processing power, recoding of prototype implementations into fixed point arithmetic will be required and WBAN performance is still a limiting factor in terms of data rate and delay.
The Personal Hearing System—A Software Hearing Aid for a Personal Communication System

Science.gov (United States)

Grimm, Giso; Guilmin, Gwénaël; Poppen, Frank; Vlaming, Marcel S. M. G.; Hohmann, Volker

2009-12-01

A concept and architecture of a personal communication system (PCS) is introduced that integrates audio communication and hearing support for the elderly and hearing-impaired through a personal hearing system (PHS). The concept envisions a central processor connected to audio headsets via a wireless body area network (WBAN). To demonstrate the concept, a prototype PCS is presented that is implemented on a netbook computer with a dedicated audio interface in combination with a mobile phone. The prototype can be used for field-testing possible applications and to reveal possibilities and limitations of the concept of integrating hearing support in consumer audio communication devices. It is shown that the prototype PCS can integrate hearing aid functionality, telephony, public announcement systems, and home entertainment. An exemplary binaural speech enhancement scheme that represents a large class of possible PHS processing schemes is shown to be compatible with the general concept. However, an analysis of hardware and software architectures shows that the implementation of a PCS on future advanced cell phone-like devices is challenging. Because of limitations in processing power, recoding of prototype implementations into fixed point arithmetic will be required and WBAN performance is still a limiting factor in terms of data rate and delay.
The Profiles in Science Digital Library: Behind the Scenes.

Science.gov (United States)

Gallagher, Marie E; Moffatt, Christie

2012-01-01

This demonstration shows the Profiles in Science ® digital library. Profiles in Science contains digitized selections from the personal manuscript collections of prominent biomedical researchers, medical practitioners, and those fostering science and health. The Profiles in Science Web site is the delivery mechanism for content derived from the digital library system. The system is designed according to our basic principles for digital library development [1]. The digital library includes the rules and software used for digitizing items, creating and editing database records and performing quality control as well as serving the digital content to the public. Among the types of data managed by the digital library are detailed item-level, collection-level and cross-collection metadata, digitized photographs, papers, audio clips, movies, born-digital electronic files, optical character recognized (OCR) text, and annotations (see Figure 1). The digital library also tracks the status of each item, including digitization quality, sensitivity of content, and copyright. Only items satisfying all required criteria are released to the public through the World Wide Web. External factors have influenced all aspects of the digital library's infrastructure.
Perceptual Coding of Audio Signals Using Adaptive Time-Frequency Transform

Directory of Open Access Journals (Sweden)

Karthikeyan Umapathy

2007-08-01

Full Text Available Wide band digital audio signals have a very high data-rate associated with them due to their complex nature and demand for high-quality reproduction. Although recent technological advancements have significantly reduced the cost of bandwidth and miniaturized storage facilities, the rapid increase in the volume of digital audio content constantly compels the need for better compression algorithms. Over the years various perceptually lossless compression techniques have been introduced, and transform-based compression techniques have made a significant impact in recent years. In this paper, we propose one such transform-based compression technique, where the joint time-frequency (TF properties of the nonstationary nature of the audio signals were exploited in creating a compact energy representation of the signal in fewer coefficients. The decomposition coefficients were processed and perceptually filtered to retain only the relevant coefficients. Perceptual filtering (psychoacoustics was applied in a novel way by analyzing and performing TF specific psychoacoustics experiments. An added advantage of the proposed technique is that, due to its signal adaptive nature, it does not need predetermined segmentation of audio signals for processing. Eight stereo audio signal samples of different varieties were used in the study. Subjective (mean opinion scoreÃ¢Â€Â”MOS listening tests were performed and the subjective difference grades (SDG were used to compare the performance of the proposed coder with MP3, AAC, and HE-AAC encoders. Compression ratios in the range of 8 to 40 were achieved by the proposed technique with subjective difference grades (SDG ranging from Ã¢Â€Â“0.53 to Ã¢Â€Â“2.27.
Spatial audio reproduction with primary ambient extraction

CERN Document Server

He, JianJun

2017-01-01

This book first introduces the background of spatial audio reproduction, with different types of audio content and for different types of playback systems. A literature study on the classical and emerging Primary Ambient Extraction (PAE) techniques is presented. The emerging techniques aim to improve the extraction performance and also enhance the robustness of PAE approaches in dealing with more complex signals encountered in practice. The in-depth theoretical study helps readers to understand the rationales behind these approaches. Extensive objective and subjective experiments validate the feasibility of applying PAE in spatial audio reproduction systems. These experimental results, together with some representative audio examples and MATLAB codes of the key algorithms, illustrate clearly the differences among various approaches and also help readers gain insights on selecting different approaches for different applications.
Controlled sharing of personal content using digital rights management

NARCIS (Netherlands)

Conrado, C.; Petkovic, M.; Veen, van der M.; Velde, van der W.H.

2006-01-01

This paper describes a system which allows controlled distribution of personal digital content by users. The system extends an existing Digital Rights Management system for the protection of commercial copyrighted content by essentially allowing users to become content providers. This fact, however,
Value of wireless personal digital assistants for practice: perceptions of advanced practice nurses.

Science.gov (United States)

Garrett, Bernard; Klein, Gerri

2008-08-01

The aims were to explore advanced practice nurses' perceptions on wireless Personal Digital Assistant technologies, to establish the type and range of tools that would be useful to support their practice and to identify any requirements and limitations that may impact the implementation of wireless Personal Digital Assistants in practice. The wireless Personal Digital Assistant is becoming established as a hand-held computing tool for healthcare professionals. The reflections of advanced practice nurses' about the value of wireless Personal Digital Assistants and its potential to contribute to improved patient care has not been investigated. A qualitative interpretivist design was used to explore advanced practice nurses' perceptions on the value of wireless Personal Digital Assistant technologies to support their practice. The data were collected using survey questionnaires and individual and focus group interviews with nurse practitioners, clinical nurse specialists and information technology managers based in British Columbia, Canada. An open-coding content analysis was performed using qualitative data analysis software. Wireless Personal Digital Assistant's use supports the principles of pervasivity and is a technology rapidly being adopted by advanced practice nurses. Some nurses indicated a reluctance to integrate wireless Personal Digital Assistant technologies into their practices because of the cost and the short technological life cycle of these devices. Many of the barriers which precluded the use of wireless networks within facilities are being removed. Nurses demonstrated a complex understanding of wireless Personal Digital Assistant technologies and gave good rationales for its integration in their practice. Nurses identified improved client care as the major benefit of this technology in practice and the type and range of tools they identified included clinical reference tools such as drug and diagnostic/laboratory reference applications and wireless
Audio Papers

DEFF Research Database (Denmark)

Groth, Sanne Krogh; Samson, Kristine

2016-01-01

With this special issue of Seismograf we are happy to present a new format of articles: Audio Papers. Audio papers resemble the regular essay or the academic text in that they deal with a certain topic of interest, but presented in the form of an audio production. The audio paper is an extension...
All About Audio Equalization: Solutions and Frontiers

Directory of Open Access Journals (Sweden)

Vesa Välimäki

2016-05-01

Full Text Available Audio equalization is a vast and active research area. The extent of research means that one often cannot identify the preferred technique for a particular problem. This review paper bridges those gaps, systemically providing a deep understanding of the problems and approaches in audio equalization, their relative merits and applications. Digital signal processing techniques for modifying the spectral balance in audio signals and applications of these techniques are reviewed, ranging from classic equalizers to emerging designs based on new advances in signal processing and machine learning. Emphasis is placed on putting the range of approaches within a common mathematical and conceptual framework. The application areas discussed herein are diverse, and include well-defined, solvable problems of filter design subject to constraints, as well as newly emerging challenges that touch on problems in semantics, perception and human computer interaction. Case studies are given in order to illustrate key concepts and how they are applied in practice. We also recommend preferred signal processing approaches for important audio equalization problems. Finally, we discuss current challenges and the uncharted frontiers in this field. The source code for methods discussed in this paper is made available at https://code.soundsoftware.ac.uk/projects/allaboutaudioeq.
Audio engineering 101 a beginner's guide to music production

CERN Document Server

Dittmar, Tim

2013-01-01

Audio Engineering 101 is a real world guide for starting out in the recording industry. If you have the dream, the ideas, the music and the creativity but don't know where to start, then this book is for you!Filled with practical advice on how to navigate the recording world, from an author with first-hand, real-life experience, Audio Engineering 101 will help you succeed in the exciting, but tough and confusing, music industry. Covering all you need to know about the recording process, from the characteristics of sound to a guide to microphones to analog versus digital
Audio Restoration

Science.gov (United States)

Esquef, Paulo A. A.

The first reproducible recording of human voice was made in 1877 on a tinfoil cylinder phonograph devised by Thomas A. Edison. Since then, much effort has been expended to find better ways to record and reproduce sounds. By the mid-1920s, the first electrical recordings appeared and gradually took over purely acoustic recordings. The development of electronic computers, in conjunction with the ability to record data onto magnetic or optical media, culminated in the standardization of compact disc format in 1980. Nowadays, digital technology is applied to several audio applications, not only to improve the quality of modern and old recording/reproduction techniques, but also to trade off sound quality for less storage space and less taxing transmission capacity requirements.
System-Level Optimization of a DAC for Hearing-Aid Audio Class D Output Stage

DEFF Research Database (Denmark)

Pracný, Peter; Jørgensen, Ivan Harald Holger; Bruun, Erik

2013-01-01

This paper deals with system-level optimization of a digital-to-analog converter (DAC) for hearing-aid audio Class D output stage. We discuss the ΣΔ modulator system-level design parameters – the order, the oversampling ratio (OSR) and the number of bits in the quantizer. We show that combining...... by comparing two ΣΔ modulator designs. The proposed optimization has impact on the whole hearing-aid audio back-end system including less hardware in the interpolation filter and half the switching rate in the digital-pulse-width-modulation (DPWM) block and Class D output stage...... a reduction of the OSR with an increase of the order results in considerable power savings while the audio quality is kept. For further savings in the ΣΔ modulator, overdesign and subsequent coarse coefficient quantization are used. A figure of merit (FOM) is introduced to confirm this optimization approach...
Automated Speech and Audio Analysis for Semantic Access to Multimedia

NARCIS (Netherlands)

Jong, F.M.G. de; Ordelman, R.; Huijbregts, M.

2006-01-01

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to
Automated speech and audio analysis for semantic access to multimedia

NARCIS (Netherlands)

de Jong, Franciska M.G.; Ordelman, Roeland J.F.; Huijbregts, M.A.H.; Avrithis, Y.; Kompatsiaris, Y.; Staab, S.; O' Connor, N.E.

2006-01-01

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to

Download Your Doctor: Implementation of a Digitally Mediated Personal Physician Presence to Enhance Patient Engagement With a Health-Promoting Internet Application.

Science.gov (United States)

Lygidakis, Charilaos; Wallace, Paul; Tersar, Costanza; Marcatto, Francesco; Ferrante, Donatella; Della Vedova, Roberto; Scafuri, Francesca; Scafato, Emanuele; Struzzo, Pierluigi

2016-03-04

Brief interventions delivered in primary health care are effective in reducing excessive drinking; online behavior-changing technique interventions may be helpful. Physicians may actively encourage the use of such interventions by helping patients access selected websites (a process known as "facilitated access"). Although the therapeutic working alliance plays a significant role in the achievement of positive outcomes in face-to-face psychotherapy and its development has been shown to be feasible online, little research has been done on its impact on brief interventions. Strengthening patients' perception of their physician's endorsement of a website could facilitate the development of an effective alliance between the patient and the app. We describe the implementation of a digitally mediated personal physician presence to enhance patient engagement with an alcohol-reduction website as part of the experimental online intervention in a noninferiority randomized controlled trial. We also report the feedback of the users on the module. The Download Your Doctor module was created to simulate the personal physician presence for an alcohol-reduction website that was developed for the EFAR-FVG trial conducted in the Italian region of Friuli-Venezia-Giulia. The module was designed to enhance therapeutic alliance and thus improve outcomes in the intervention group (facilitated access to the website). Participating general and family practitioners could customize messages and visual elements and upload a personal photo, signature, and video recordings. To assess the perceptions and attitudes of the physicians, a semistructured interview was carried out 3 months after the start of the trial. Participating patients were invited to respond to a short online questionnaire 12 months following recruitment to investigate their evaluation of their online experiences. Nearly three-quarters (23/32, 72%) of the physicians interviewed chose to customize the contents of the interaction
Audio-Visual Perception System for a Humanoid Robotic Head

Directory of Open Access Journals (Sweden)

Raquel Viciana-Abad

2014-05-01

Full Text Available One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework.
An ESL Audio-Script Writing Workshop

Science.gov (United States)

Miller, Carla

2012-01-01

The roles of dialogue, collaborative writing, and authentic communication have been explored as effective strategies in second language writing classrooms. In this article, the stages of an innovative, multi-skill writing method, which embeds students' personal voices into the writing process, are explored. A 10-step ESL Audio Script Writing Model…
Audio-Visual and Meaningful Semantic Context Enhancements in Older and Younger Adults.

Directory of Open Access Journals (Sweden)

Kirsten E Smayda

Full Text Available Speech perception is critical to everyday life. Oftentimes noise can degrade a speech signal; however, because of the cues available to the listener, such as visual and semantic cues, noise rarely prevents conversations from continuing. The interaction of visual and semantic cues in aiding speech perception has been studied in young adults, but the extent to which these two cues interact for older adults has not been studied. To investigate the effect of visual and semantic cues on speech perception in older and younger adults, we recruited forty-five young adults (ages 18-35 and thirty-three older adults (ages 60-90 to participate in a speech perception task. Participants were presented with semantically meaningful and anomalous sentences in audio-only and audio-visual conditions. We hypothesized that young adults would outperform older adults across SNRs, modalities, and semantic contexts. In addition, we hypothesized that both young and older adults would receive a greater benefit from a semantically meaningful context in the audio-visual relative to audio-only modality. We predicted that young adults would receive greater visual benefit in semantically meaningful contexts relative to anomalous contexts. However, we predicted that older adults could receive a greater visual benefit in either semantically meaningful or anomalous contexts. Results suggested that in the most supportive context, that is, semantically meaningful sentences presented in the audiovisual modality, older adults performed similarly to young adults. In addition, both groups received the same amount of visual and meaningful benefit. Lastly, across groups, a semantically meaningful context provided more benefit in the audio-visual modality relative to the audio-only modality, and the presence of visual cues provided more benefit in semantically meaningful contexts relative to anomalous contexts. These results suggest that older adults can perceive speech as well as younger
An audio FIR-DAC in a BCD process for high power Class-D amplifiers

NARCIS (Netherlands)

Doorn, T.S.; van Tuijl, Adrianus Johannes Maria; Schinkel, Daniel; Annema, Anne J.; Berkhout, M.; Berkhout, M.; Nauta, Bram

A 322 coefficient semi-digital FIR-DAC using a 1-bit PWM input signal was designed and implemented in a high voltage, audio power bipolar CMOS DMOS (BCD) process. This facilitates digital input signals for an analog class-D amplifier in BCD. The FIR-DAC performance depends on the ISI-resistant
Semantic Web, Reusable Learning Objects, Personal Learning Networks in Health: Key Pieces for Digital Health Literacy.

Science.gov (United States)

Konstantinidis, Stathis Th; Wharrad, Heather; Windle, Richard; Bamidis, Panagiotis D

2017-01-01

The knowledge existing in the World Wide Web is exponentially expanding, while continuous advancements in health sciences contribute to the creation of new knowledge. There are a lot of efforts trying to identify how the social connectivity can endorse patients' empowerment, while other studies look at the identification and the quality of online materials. However, emphasis has not been put on the big picture of connecting the existing resources with the patients "new habits" of learning through their own Personal Learning Networks. In this paper we propose a framework for empowering patients' digital health literacy adjusted to patients' currents needs by utilizing the contemporary way of learning through Personal Learning Networks, existing high quality learning resources and semantics technologies for interconnecting knowledge pieces. The framework based on the concept of knowledge maps for health as defined in this paper. Health Digital Literacy needs definitely further enhancement and the use of the proposed concept might lead to useful tools which enable use of understandable health trusted resources tailored to each person needs.
Visual cues for person-centered communication.

Science.gov (United States)

Williams, Kristine; Harris, Brynn; Lueger, Amy; Ward, Kathleen; Wassmer, Rebecca; Weber, Amy

2011-11-01

Nursing home communication is frequently limited and task-focused and fails to affirm resident personhood. We tested the feasibility and effects of automated digital displays of resident photographs to remind staff (N = 11) of resident (n = 6) personhood. Historical photographs were displayed in digital photo frames mounted in each resident's room. To evaluate the intervention's effects, staff-resident conversations were audio-recorded prior to displaying the frames and repeated 2 weeks and 3 months later. Conversations were transcribed and statements were topic coded (task-focused vs. interpersonal). Staff person-centered talk increased from 11% to 32% (z = 2.37, p = .02) after the intervention and task-talk decreased from 64% to 40%. Resident interpersonal topics increased from 20% to 37%. Staff statements increased from 29 at baseline, to 37 postintervention, and 41 at 3-month follow-up and resident engagement and reminiscence also increased. Effects were reduced after 3 months. Automated photo displays are an easily implemented, low-cost intervention to promote person-centered communication.
The audio expert everything you need to know about audio

CERN Document Server

Winer, Ethan

2012-01-01

The Audio Expert is a comprehensive reference that covers all aspects of audio, with many practical, as well as theoretical, explanations. Providing in-depth descriptions of how audio really works, using common sense plain-English explanations and mechanical analogies with minimal math, the book is written for people who want to understand audio at the deepest, most technical level, without needing an engineering degree. It's presented in an easy-to-read, conversational tone, and includes more than 400 figures and photos augmenting the text.The Audio Expert takes th
Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images

Directory of Open Access Journals (Sweden)

Koji Iwano

2007-03-01

Full Text Available This paper proposes an audio-visual speech recognition method using lip information extracted from side-face images as an attempt to increase noise robustness in mobile environments. Our proposed method assumes that lip images can be captured using a small camera installed in a handset. Two different kinds of lip features, lip-contour geometric features and lip-motion velocity features, are used individually or jointly, in combination with audio features. Phoneme HMMs modeling the audio and visual features are built based on the multistream HMM technique. Experiments conducted using Japanese connected digit speech contaminated with white noise in various SNR conditions show effectiveness of the proposed method. Recognition accuracy is improved by using the visual information in all SNR conditions. These visual features were confirmed to be effective even when the audio HMM was adapted to noise by the MLLR method.
Audio-Visual Fusion for Sound Source Localization and Improved Attention

Energy Technology Data Exchange (ETDEWEB)

Lee, Byoung Gi; Choi, Jong Suk; Yoon, Sang Suk; Choi, Mun Taek; Kim, Mun Sang [Korea Institute of Science and Technology, Daejeon (Korea, Republic of); Kim, Dai Jin [Pohang University of Science and Technology, Pohang (Korea, Republic of)

2011-07-15

Service robots are equipped with various sensors such as vision camera, sonar sensor, laser scanner, and microphones. Although these sensors have their own functions, some of them can be made to work together and perform more complicated functions. AudioFvisual fusion is a typical and powerful combination of audio and video sensors, because audio information is complementary to visual information and vice versa. Human beings also mainly depend on visual and auditory information in their daily life. In this paper, we conduct two studies using audioFvision fusion: one is on enhancing the performance of sound localization, and the other is on improving robot attention through sound localization and face detection.
Audio-Visual Fusion for Sound Source Localization and Improved Attention

International Nuclear Information System (INIS)

Lee, Byoung Gi; Choi, Jong Suk; Yoon, Sang Suk; Choi, Mun Taek; Kim, Mun Sang; Kim, Dai Jin

2011-01-01

Service robots are equipped with various sensors such as vision camera, sonar sensor, laser scanner, and microphones. Although these sensors have their own functions, some of them can be made to work together and perform more complicated functions. AudioFvisual fusion is a typical and powerful combination of audio and video sensors, because audio information is complementary to visual information and vice versa. Human beings also mainly depend on visual and auditory information in their daily life. In this paper, we conduct two studies using audioFvision fusion: one is on enhancing the performance of sound localization, and the other is on improving robot attention through sound localization and face detection
Improvements of ModalMax High-Fidelity Piezoelectric Audio Device

Science.gov (United States)

Woodard, Stanley E.

2005-01-01

ModalMax audio speakers have been enhanced by innovative means of tailoring the vibration response of thin piezoelectric plates to produce a high-fidelity audio response. The ModalMax audio speakers are 1 mm in thickness. The device completely supplants the need to have a separate driver and speaker cone. ModalMax speakers can perform the same applications of cone speakers, but unlike cone speakers, ModalMax speakers can function in harsh environments such as high humidity or extreme wetness. New design features allow the speakers to be completely submersed in salt water, making them well suited for maritime applications. The sound produced from the ModalMax audio speakers has sound spatial resolution that is readily discernable for headset users.
Toward Personal and Emotional Connectivity in Mobile Higher Education through Asynchronous Formative Audio Feedback

Science.gov (United States)

Rasi, Päivi; Vuojärvi, Hanna

2018-01-01

This study aims to develop asynchronous formative audio feedback practices for mobile learning in higher education settings. The development was conducted in keeping with the principles of design-based research. The research activities focused on an inter-university online course, within which the use of instructor audio feedback was tested,…
Digital Rights Management

NARCIS (Netherlands)

Koster, P.; Jonker, Willem; Blanken, Henk; de Vries, A.P.; Blok, H.E.; Feng, L.

2007-01-01

Digital Rights Management, or DRM for short, is a much-discussed topic nowadays. The main reason for this is that DRM technology is often mentioned in the context of protection of digital audio and video content, for example to avoid large scale copying of CDs and DVDs via peer-to-peer networks in
Integration of top-down and bottom-up information for audio organization and retrieval

DEFF Research Database (Denmark)

Jensen, Bjørn Sand

The increasing availability of digital audio and music calls for methods and systems to analyse and organize these digital objects. This thesis investigates three elements related to such systems focusing on the ability to represent and elicit the user's view on the multimedia object and the system...... output. The aim is to provide organization and processing, which aligns with the understanding and needs of the users. Audio and music is often characterized by the large amount of heterogenous information. The rst aspect investigated is the integration of such multi-variate and multi-modal information...... (indirect scaling). Inference is performed by analytical and simulation based methods, including the Laplace approximation and expectation propagation. In order to minimize the cost of the often expensive and lengthly experimentation, sequential experiment design or active learning is supported. The setup...
Personalized direct marketing using digital publishing

Science.gov (United States)

Kutty, Cheeniyil L.; Prabhakaran, Jayasree K.

2006-02-01

In today's cost-conscious business climate, marketing and customer service decision makers are increasingly concerned with how to increase customer response and retention rates. Companies spend large amounts of money on Customer Relationship Management (CRM) solutions and data acquisition but they don't know how to use the information stored in these systems to improve the effectiveness of their direct marketing campaigns. By leveraging the customer information they already have, companies can create personalized, printed direct mail programs that generate high response rates, greater returns, and stronger customer loyalty, while gaining a significant edge over their competitors. To reach the promised land of one-to-one direct marketing (personalized direct marketing - PDM), companies need an end-to-end solution for creating, managing, printing, and distributing personalized direct mail "on demand." Having access to digital printing is just one piece of the solution. A more complete approach includes leveraging personalization technology into a useful direct marketing tool that provides true one-to-one marketing, allowing variable images and text in a personalized direct mail. This paper discusses integration of CRM with a Print-on-Demand solution so as to create truly personalized printed marketing campaigns for one or many individuals based on the profile information, preferences and purchase history stored in the CRM.
SCORE DIGITAL TECHNOLOGY: THE CONVERGENCE

Directory of Open Access Journals (Sweden)

Chernyshov Alexander V.

2013-12-01

Full Text Available Explores the role of digital scorewriters in today's culture, education, and music industry and media environment. The main principle of the development of software is not only publishing innovation (relating to the sheet music, and integration into the area of composition, arrangement, education, creative process for works based on digital technology (films, television and radio broadcasting, Internet, audio and video art. Therefore the own convergence of musically-computer technology is a total phenomenon: notation program combined with means MIDI-sequencer, audio and video editor. The article contains the unique interview with the creator of music notation processors.
Semantic Analysis of Multimedial Information Usign Both Audio and Visual Clues

Directory of Open Access Journals (Sweden)

Andrej Lukac

2008-01-01

Full Text Available Nowadays, there is a lot of information in databases (text, audio/video form, etc.. It is important to be able to describe this data for better orientation in them. It is necessary to apply audio/video properties, which are used for metadata management, segmenting the document into semantically meaningful units, classifying each unit into a predefined scene type, indexing, summarizing the document for efficient retrieval and browsing. Data can be used for system that automatically searches for a specific person in a sequence also for special video sequences. Audio/video properties are presented by descriptors and description schemes. There are many features that can be used to characterize multimedial signals. We can analyze audio and video sequences jointly or considered them completely separately. Our aim is oriented to possibilities of combining multimedial features. Focus is direct into discussion programs, because there are more decisions how to combine audio features with video sequences.
Tomosynthesis and contrast-enhanced digital mammography: recent advances in digital mammography

International Nuclear Information System (INIS)

Diekmann, Felix; Bick, Ulrich

2007-01-01

Digital mammography is more and more replacing conventional mammography. Initial concerns about an inferior image quality of digital mammography have been largely overcome and recent studies even show digital mammography to be superior in women with dense breasts, while at the same time reducing radiation exposure. Nevertheless, an important limitation of digital mammography remains: namely, the fact that summation may obscure lesions in dense breast tissue. However, digital mammography offers the option of so-called advanced applications, and two of these, contrast-enhanced mammography and tomosynthesis, are promising candidates for improving the detection of breast lesions otherwise obscured by the summation of dense tissue. Two techniques of contrast-enhanced mammography are available: temporal subtraction of images acquired before and after contrast administration and the so-called dual-energy technique, which means that pairs of low/high-energy images acquired after contrast administration are subtracted. Tomosynthesis on the other hand provides three-dimensional information on the breast. The images are acquired with different angulations of the X-ray tube while the object or detector is static. Various reconstruction algorithms can then be applied to the set of typically nine to 28 source images to reconstruct 1-mm slices with a reduced risk of obscuring pathology. Combinations of both advanced applications have only been investigated in individual experimental studies; more advanced software algorithms and CAD systems are still in their infancy and have only undergone preliminary clinical evaluation. (orig.)
Efficiently Synchronized Spread-Spectrum Audio Watermarking with Improved Psychoacoustic Model

Directory of Open Access Journals (Sweden)

Xing He

2008-01-01

Full Text Available This paper presents an audio watermarking scheme which is based on an efficiently synchronized spread-spectrum technique and a new psychoacoustic model computed using the discrete wavelet packet transform. The psychoacoustic model takes advantage of the multiresolution analysis of a wavelet transform, which closely approximates the standard critical band partition. The goal of this model is to include an accurate time-frequency analysis and to calculate both the frequency and temporal masking thresholds directly in the wavelet domain. Experimental results show that this watermarking scheme can successfully embed watermarks into digital audio without introducing audible distortion. Several common watermark attacks were applied and the results indicate that the method is very robust to those attacks.

Personal Digital Information Archiving among Students of Social Sciences and Humanities

Science.gov (United States)

Krtalic, Maja; Marcetic, Hana; Micunovic, Milijana

2016-01-01

Introduction: As both academic citizens and active participants in information society who use information, students produce huge amounts of personal digital data and documents. It is therefore important to raise questions about their awareness, responsibility, tendencies and activities they undertake to preserve their collective digital heritage.…
Realisierung eines verzerrungsarmen Open-Loop Klasse-D Audio-Verstärkers mit SB-ZePoC

Directory of Open Access Journals (Sweden)

O. Schnick

2007-06-01

Full Text Available In den letzten Jahren hat die Entwicklung von Klasse-D Verstärkern für Audio-Anwendungen ein vermehrtes Interesse auf sich gezogen. Eine Motivation hierfür liegt in der mit dieser Technik extrem hohen erzielbaren Effizienz von über 90%. Die Signale, die Klasse-D Verstärker steuern, sind binär. Immer mehr Audio-Signale werden entweder digital gespeichert (CD, DVD, MP3 oder digital übermittelt (Internet, DRM, DAB, DVB-T, DVB-S, GMS, UMTS, weshalb eine direkte Umsetzung dieser Daten in ein binäres Steuersignal ohne vorherige konventionelle D/A-Wandlung erstrebenswert erscheint.

Die klassischen Pulsweitenmodulationsverfahren führen zu Aliasing-Komponenten im Audio-Basisband. Diese Verzerrungen können nur durch eine sehr hohe Schaltfrequenz auf ein akzeptables Maß reduziert werden. Durch das von der Forschungsgruppe um Prof. Mathis vorgestellte SB-ZePoC Verfahren (Zero Position Coding with Separated Baseband wird diese Art der Signalverzerrung durch Generierung eines separierten Basisbands verhindert. Deshalb können auch niedrige Schaltfrequenzen gewählt werden. Dadurch werden nicht nur die Schaltverluste, sondern auch Timing-Verzerrungen verringert, die durch die nichtideale Schaltendstufe verursacht werden. Diese tragen einen großen Anteil zu den gesamten Verzerrungen eines Klasse-D Verstärkers bei. Mit dem SB-ZePoC Verfahren lassen sich verzerrungsarme Open-Loop Klasse-D Audio-Verstärker realisieren, die ohne aufwändige Gegenkopplungsschleifen auskommen.

Class-D amplifiers are suiteble for amplification of audio signals. One argument is their high efficiency of 90% and more. Today most of the audio signals are stored or transmitted in digital form. A digitally controlled Class-D amplifier can be directly driven with coded (modulated data. No separate D/A conversion is needed. Classical modulation schemes like Pulse-Width-Modulation (PWM cause aliasing. So a very high switching rate is required to minimize the
Speaker detection for conversational robots using synchrony between audio and video

NARCIS (Netherlands)

Noulas, A.; Englebienne, G.; Terwijn, B.; Kröse, B.; Hanheide, M.; Zender, H.

2010-01-01

This paper compares different methods for detecting the speaking person when multiple persons are interacting with a robot. We evaluate the state-of-the-art speaker detection methods on the iCat robot. These methods use the synchrony between audio and video to locate the most probable speaker. We
Analysis of Personal Digital Library and MyLibrary%"Personal Digital Library"与"MyLibrary"辨析

Institute of Scientific and Technical Information of China (English)

秦飞飞

2011-01-01

学术界一些研究者认为"Personal Digital Library"与"MyLibrary"均可指个人数字图书馆.然而,两者是不同概念、特征及功能的事物.论文对两者的概念、研究现状及趋势作了详细的论述,旨在揭示这两种事物,为后续研究者提供借鉴.
Interpolation Filter Design for Hearing-Aid Audio Class-D Output Stage Application

DEFF Research Database (Denmark)

Pracný, Peter; Bruun, Erik; Llimos Muntal, Pere

2012-01-01

This paper deals with a design of a digital interpolation filter for a 3rd order multi-bit ΣΔ modulator with over-sampling ratio OSR = 64. The interpolation filter and the ΣΔ modulator are part of the back-end of an audio signal processing system in a hearing-aid application. The aim in this paper...... is to compare this design to designs presented in other state-of-the-art works ranging from hi-fi audio to hearing-aids. By performing comparison, trends and tradeoffs in interpolation filter design are indentified and hearing-aid specifications are derived. The possibilities for hardware reduction...... in the interpolation filter are investigated. Proposed design simplifications presented here result in the least hardware demanding combination of oversampling ratio, number of stages and number of filter taps among a number of filters reported for audio applications....
Technical Evaluation Report 31: Internet Audio Products (3/ 3

Directory of Open Access Journals (Sweden)

Jim Rudolph

2004-08-01

Full Text Available Two contrasting additions to the online audio market are reviewed: iVocalize, a browser-based audio-conferencing software, and Skype, a PC-to-PC Internet telephone tool. These products are selected for review on the basis of their success in gaining rapid popular attention and usage during 2003-04. The iVocalize review emphasizes the product’s role in the development of a series of successful online audio communities – notably several serving visually impaired users. The Skype review stresses the ease with which the product may be used for simultaneous PC-to-PC communication among up to five users. Editor’s Note: This paper serves as an introduction to reports about online community building, and reviews of online products for disabled persons, in the next ten reports in this series. JPB, Series Ed.
The audio and visual communication systems for suited engineering activities on JET

International Nuclear Information System (INIS)

Pearce, R.J.H.; Bruce, J.; Callaghan, C.; Hart, M.; Martin, P.; Middleton, R.; Tait, J.

2001-01-01

The beryllium and/or tritium contamination of the JET tokamak and auxiliary systems necessitates that many activities are carried out in air line fed pressurised suits. To enable often complex engineering activities to be performed, a number of novel audio and visual and communications systems have been designed. The paper describes these systems which give freedom of visual and audio communication between suited personnel, supervisors, operators and engineers. The system enhances the safety of the working environment as well as helping to minimise the radiation dose to personnel. It is concluded, from a number of years experience of using the audio and visual communications systems for suited operations, that safety and the progress of complex engineering tasks have been significantly enhanced
The audio and visual communication systems for suited engineering activities on JET

Energy Technology Data Exchange (ETDEWEB)

Pearce, R.J.H. E-mail: robert.pearce@jet.uk; Bruce, J.; Callaghan, C.; Hart, M.; Martin, P.; Middleton, R.; Tait, J

2001-11-01

The beryllium and/or tritium contamination of the JET tokamak and auxiliary systems necessitates that many activities are carried out in air line fed pressurised suits. To enable often complex engineering activities to be performed, a number of novel audio and visual and communications systems have been designed. The paper describes these systems which give freedom of visual and audio communication between suited personnel, supervisors, operators and engineers. The system enhances the safety of the working environment as well as helping to minimise the radiation dose to personnel. It is concluded, from a number of years experience of using the audio and visual communications systems for suited operations, that safety and the progress of complex engineering tasks have been significantly enhanced.
Conflicting audio-haptic feedback in physically based simulation of walking sounds

DEFF Research Database (Denmark)

Turchet, Luca; Serafin, Stefania; Dimitrov, Smilen

2010-01-01

We describe an audio-haptic experiment conducted using a system which simulates in real-time the auditory and haptic sensation of walking on different surfaces. The system is based on physical models, that drive both the haptic and audio synthesizers, and a pair of shoes enhanced with sensors...... and actuators. Such experiment was run to examine the ability of subjects to recognize the different surfaces with both coherent and incoherent audio-haptic stimuli. Results show that in this kind of tasks the auditory modality is dominant on the haptic one....
Precision Scaling of Neural Networks for Efficient Audio Processing

OpenAIRE

Ko, Jong Hwan; Fromm, Josh; Philipose, Matthai; Tashev, Ivan; Zarar, Shuayb

2017-01-01

While deep neural networks have shown powerful performance in many audio applications, their large computation and memory demand has been a challenge for real-time processing. In this paper, we study the impact of scaling the precision of neural networks on the performance of two common audio processing tasks, namely, voice-activity detection and single-channel speech enhancement. We determine the optimal pair of weight/neuron bit precision by exploring its impact on both the performance and ...
Tooteko: a Case Study of Augmented Reality for AN Accessible Cultural Heritage. Digitization, 3d Printing and Sensors for AN Audio-Tactile Experience

Science.gov (United States)

D'Agnano, F.; Balletti, C.; Guerra, F.; Vernier, P.

2015-02-01

Tooteko is a smart ring that allows to navigate any 3D surface with your finger tips and get in return an audio content that is relevant in relation to the part of the surface you are touching in that moment. Tooteko can be applied to any tactile surface, object or sheet. However, in a more specific domain, it wants to make traditional art venues accessible to the blind, while providing support to the reading of the work for all through the recovery of the tactile dimension in order to facilitate the experience of contact with art that is not only "under glass." The system is made of three elements: a high-tech ring, a tactile surface tagged with NFC sensors, and an app for tablet or smartphone. The ring detects and reads the NFC tags and, thanks to the Tooteko app, communicates in wireless mode with the smart device. During the tactile navigation of the surface, when the finger reaches a hotspot, the ring identifies the NFC tag and activates, through the app, the audio track that is related to that specific hotspot. Thus a relevant audio content relates to each hotspot. The production process of the tactile surfaces involves scanning, digitization of data and 3D printing. The first experiment was modelled on the facade of the church of San Michele in Isola, made by Mauro Codussi in the late fifteenth century, and which marks the beginning of the Renaissance in Venice. Due to the absence of recent documentation on the church, the Correr Museum asked the Laboratorio di Fotogrammetria to provide it with the aim of setting up an exhibition about the order of the Camaldolesi, owners of the San Michele island and church. The Laboratorio has made the survey of the facade through laser scanning and UAV photogrammetry. The point clouds were the starting point for prototypation and 3D printing on different supports. The idea of the integration between a 3D printed tactile surface and sensors was born as a final thesis project at the Postgraduate Mastercourse in Digital
Digital Technologies Supporting Person-Centered Integrated Care - A Perspective.

Science.gov (United States)

Øvretveit, John

2017-09-25

Shared electronic health and social care records in some service systems are already showing some of the benefits of digital technology and digital data for integrating health and social care. These records are one example of the beginning "digitalisation" of services that gives a glimpse of the potential of digital technology and systems for building coordinated and individualized integrated care. Yet the promise has been greater than the benefits, and progress has been slow compared to other industries. This paper describes for non-technical readers how information technology was used to support integrated care schemes in six EU services, and suggests practical ways forward to use the new opportunities to build person-centered integrated care.
Audio Twister

DEFF Research Database (Denmark)

Cermak, Daniel; Moreno Garcia, Rodrigo; Monastiridis, Stefanos

2015-01-01

Daniel Cermak-Sassenrath, Rodrigo Moreno Garcia, Stefanos Monastiridis. Audio Twister. Installation. P-Hack Copenhagen 2015, Copenhagen, DK, Apr 24, 2015.......Daniel Cermak-Sassenrath, Rodrigo Moreno Garcia, Stefanos Monastiridis. Audio Twister. Installation. P-Hack Copenhagen 2015, Copenhagen, DK, Apr 24, 2015....
Back to basics audio

CERN Document Server

Nathan, Julian

1998-01-01

Back to Basics Audio is a thorough, yet approachable handbook on audio electronics theory and equipment. The first part of the book discusses electrical and audio principles. Those principles form a basis for understanding the operation of equipment and systems, covered in the second section. Finally, the author addresses planning and installation of a home audio system.Julian Nathan joined the audio service and manufacturing industry in 1954 and moved into motion picture engineering and production in 1960. He installed and operated recording theaters in Sydney, Austra
Digitální audio zesilovač

OpenAIRE

Tiller, Jakub

2010-01-01

Tématem bakalářské práce jsou výkonové audio zesilovače pracující ve třídě D. Jejich velké rozšíření je způsobeno hlavně vysokou účinností a dobrými parametry. Tato práce je zaměřena na rozbor jednotlivých částí těchto zesilovačů a na rozbor možností měření jejich parametrů. Následně je v práci uveden návrh zesilovače jako laboratorního přípravku s možností číslicového řízení zesílení a navržena automatizovaná měření parametrů tohoto zesilovače v prostředí VEE Pro. Dále je v této práci navrže...
Local Control of Audio Environment: A Review of Methods and Applications

Directory of Open Access Journals (Sweden)

Jussi Kuutti

2014-02-01

Full Text Available The concept of a local audio environment is to have sound playback locally restricted such that, ideally, adjacent regions of an indoor or outdoor space could exhibit their own individual audio content without interfering with each other. This would enable people to listen to their content of choice without disturbing others next to them, yet, without any headphones to block conversation. In practice, perfect sound containment in free air cannot be attained, but a local audio environment can still be satisfactorily approximated using directional speakers. Directional speakers may be based on regular audible frequencies or they may employ modulated ultrasound. Planar, parabolic, and array form factors are commonly used. The directivity of a speaker improves as its surface area and sound frequency increases, making these the main design factors for directional audio systems. Even directional speakers radiate some sound outside the main beam, and sound can also reflect from objects. Therefore, directional speaker systems perform best when there is enough ambient noise to mask the leaking sound. Possible areas of application for local audio include information and advertisement audio feed in commercial facilities, guiding and narration in museums and exhibitions, office space personalization, control room messaging, rehabilitation environments, and entertainment audio systems.
Open source platform Digital Personal Assistant

OpenAIRE

Usachev, Denis; Khusnutdinov, Azat; Mazzara, Manuel; Khan, Adil; Panchenko, Ivan

2018-01-01

Nowadays Digital Personal Assistants (DPA) become more and more popular. DPAs help to increase quality of life especially for elderly or disabled people. In this paper we develop an open source DPA and smart home system as a 3-rd party extension to show the functionality of the assistant. The system is designed to use the DPA as a learning platform for engineers to provide them with the opportunity to create and test their own hypothesis. The DPA is able to recognize users' commands in natura...
Collusion-resistant audio fingerprinting system in the modulated complex lapped transform domain.

Directory of Open Access Journals (Sweden)

Jose Juan Garcia-Hernandez

Full Text Available Collusion-resistant fingerprinting paradigm seems to be a practical solution to the piracy problem as it allows media owners to detect any unauthorized copy and trace it back to the dishonest users. Despite the billionaire losses in the music industry, most of the collusion-resistant fingerprinting systems are devoted to digital images and very few to audio signals. In this paper, state-of-the-art collusion-resistant fingerprinting ideas are extended to audio signals and the corresponding parameters and operation conditions are proposed. Moreover, in order to carry out fingerprint detection using just a fraction of the pirate audio clip, block-based embedding and its corresponding detector is proposed. Extensive simulations show the robustness of the proposed system against average collusion attack. Moreover, by using an efficient Fast Fourier Transform core and standard computer machines it is shown that the proposed system is suitable for real-world scenarios.
Audio Haptic Videogaming for Developing Wayfinding Skills in Learners Who are Blind.

Science.gov (United States)

Sánchez, Jaime; de Borba Campos, Marcia; Espinoza, Matías; Merabet, Lotfi B

2014-01-01

Interactive digital technologies are currently being developed as a novel tool for education and skill development. Audiopolis is an audio and haptic based videogame designed for developing orientation and mobility (O&M) skills in people who are blind. We have evaluated the cognitive impact of videogame play on O&M skills by assessing performance on a series of behavioral tasks carried out in both indoor and outdoor virtual spaces. Our results demonstrate that the use of Audiopolis had a positive impact on the development and use of O&M skills in school-aged learners who are blind. The impact of audio and haptic information on learning is also discussed.
Economic and legal aspects of introducing novel ICT instruments: integrating sound into social media marketing - from audio branding to soundscaping

Directory of Open Access Journals (Sweden)

Daj, A.

2013-12-01

Full Text Available The pervasive expansion and implementation of ICT based marketing instruments imposes a new economic investigation of business models and regulatory solutions. Moreover, the current status of Social Media research indicates that the use of social networking and collaboration technologies is deeply changing the way people communicate, consume and cooperate with each other. Against the backdrop of widespread availability of digital audio-video content and the growing number of “smart” mobile devices, business professionals have developed new strategies for achieving customer involvement and retention through digitally linking audio stimuli to the powerful networking environment of Social Media.

Evaluating a Personal Learning Environment for Digital Storytelling

Directory of Open Access Journals (Sweden)

Nikolaos Marianos

2011-10-01

Full Text Available The evaluation of flexible and personal learning environments is extremely challenging. It should not be limited to the assessment of products, but should address the quality of educative experience with close monitoring. The evaluation of a PLE using digital storytelling is even more complicated, due to the unpredictability of the usage scenarios. This paper presents an evaluation methodology for PLEs using digital storytelling, using a participatory design approach. The results from an open validation trial indicate that this methodology is able to incorporate all necessary factors and that the selected evaluation tools are appropriate for addressing the quality of educative experience.
Digital Music Lab: A Framework for Analysing Big Music Data

OpenAIRE

Abdallah, S.; Benetos, E.; Gold, N. E.; Hargreaves, S.; Weyde, T.; Wolff, D.

2016-01-01

In the transition from traditional to digital musicology, large scale music data are increasingly becoming available which require research methods that work on the collection level and at scale. In the Digital Music Lab (DML) project, a software system has been developed that provides large-scale analysis of music audio with an interactive interface. The DML system includes distributed processing of audio and other music data, remote analysis of copyright-restricted data, logical inference o...
Noise-Canceling Helmet Audio System

Science.gov (United States)

Seibert, Marc A.; Culotta, Anthony J.

2007-01-01

A prototype helmet audio system has been developed to improve voice communication for the wearer in a noisy environment. The system was originally intended to be used in a space suit, wherein noise generated by airflow of the spacesuit life-support system can make it difficult for remote listeners to understand the astronaut s speech and can interfere with the astronaut s attempt to issue vocal commands to a voice-controlled robot. The system could be adapted to terrestrial use in helmets of protective suits that are typically worn in noisy settings: examples include biohazard, fire, rescue, and diving suits. The system (see figure) includes an array of microphones and small loudspeakers mounted at fixed positions in a helmet, amplifiers and signal-routing circuitry, and a commercial digital signal processor (DSP). Notwithstanding the fixed positions of the microphones and loudspeakers, the system can accommodate itself to any normal motion of the wearer s head within the helmet. The system operates in conjunction with a radio transceiver. An audio signal arriving via the transceiver intended to be heard by the wearer is adjusted in volume and otherwise conditioned and sent to the loudspeakers. The wearer s speech is collected by the microphones, the outputs of which are logically combined (phased) so as to form a microphone- array directional sensitivity pattern that discriminates in favor of sounds coming from vicinity of the wearer s mouth and against sounds coming from elsewhere. In the DSP, digitized samples of the microphone outputs are processed to filter out airflow noise and to eliminate feedback from the loudspeakers to the microphones. The resulting conditioned version of the wearer s speech signal is sent to the transceiver.
User Evaluation of a Soldier Flexible Display Personal Digital Assistant

National Research Council Canada - National Science Library

Sampson, James B; Boynton, Angela C; Mitchell, K. B; Magnifico, Dennis S; DuPont, Frederick J

2008-01-01

The U.S. Army Natick Soldier Research, Development and Engineering Center and U.S. Army Research Laboratory, Human Research and Engineering Directorate conducted an evaluation of a Soldier Flexible Display Personal Digital Assistant...
Enhancing a Core Journal Collection for Digital Libraries

Science.gov (United States)

Kovacevic, Ana; Devedzic, Vladan; Pocajt, Viktor

2010-01-01

Purpose: This paper aims to address the problem of enhancing the selection of titles offered by a digital library, by analysing the differences in these titles when they are cited by local authors in their publications and when they are listed in the digital library offer. Design/methodology/approach: Text mining techniques were used to identify…
Effects of image enhancement on reliability of landmark identification in digital cephalometry

Directory of Open Access Journals (Sweden)

M Oshagh

2013-01-01

Full Text Available Introduction: Although digital cephalometric radiography is gaining popularity in orthodontic practice, the most important source of error in its tracing is uncertainty in landmark identification. Therefore, efforts to improve accuracy in landmark identification were directed primarily toward the improvement in image quality. One of the more useful techniques of this process involves digital image enhancement which can increase overall visual quality of image, but this does not necessarily mean a better identification of landmarks. The purpose of this study was to evaluate the effectiveness of digital image enhancements on reliability of landmark identification. Materials and Methods: Fifteen common landmarks including 10 skeletal and 5 soft tissues were selected on the cephalograms of 20 randomly selected patients, prepared in Natural Head Position (NHP. Two observers (orthodontists identified landmarks on the 20 original photostimulable phosphor (PSP digital cephalogram images and 20 enhanced digital images twice with an intervening time interval of at least 4 weeks. The x and y coordinates were further analyzed to evaluate the pattern of recording differences in horizontal and vertical directions. Reliability of landmarks identification was analyzed by paired t test. Results: There was a significant difference between original and enhanced digital images in terms of reliability of points Ar and N in vertical and horizontal dimensions, and enhanced images were significantly more reliable than original images. Identification of A point, Pogonion and Pronasal points, in vertical dimension of enhanced images was significantly more reliable than original ones. Reliability of Menton point identification in horizontal dimension was significantly more in enhanced images than original ones. Conclusion: Direct digital image enhancement by altering brightness and contrast can increase reliability of some landmark identification and this may lead to more
Semantic congruency but not temporal synchrony enhances long-term memory performance for audio-visual scenes.

Science.gov (United States)

Meyerhoff, Hauke S; Huff, Markus

2016-04-01

Human long-term memory for visual objects and scenes is tremendous. Here, we test how auditory information contributes to long-term memory performance for realistic scenes. In a total of six experiments, we manipulated the presentation modality (auditory, visual, audio-visual) as well as semantic congruency and temporal synchrony between auditory and visual information of brief filmic clips. Our results show that audio-visual clips generally elicit more accurate memory performance than unimodal clips. This advantage even increases with congruent visual and auditory information. However, violations of audio-visual synchrony hardly have any influence on memory performance. Memory performance remained intact even with a sequential presentation of auditory and visual information, but finally declined when the matching tracks of one scene were presented separately with intervening tracks during learning. With respect to memory performance, our results therefore show that audio-visual integration is sensitive to semantic congruency but remarkably robust against asymmetries between different modalities.
enhanced digital library system that supports sustainable knowledge

African Journals Online (AJOL)

Digital libraries are well known for sharing resources all over the world. Several ... The person with more information will guide a group or society and he ..... librarians should work out among themselves a co-operative means of tracking the.
Tablets enhance play by taking toddlers on a digital adventure

DEFF Research Database (Denmark)

Johansen, Stine Liv

2017-01-01

Tablets do not necessarily make children passive or sedentary. In a Danish nursery, a group of scientists, educators, and education consultants have studied how digital technologies can be used to enhance playful activities.......Tablets do not necessarily make children passive or sedentary. In a Danish nursery, a group of scientists, educators, and education consultants have studied how digital technologies can be used to enhance playful activities....
Real-time Loudspeaker Distance Estimation with Stereo Audio

DEFF Research Database (Denmark)

Nielsen, Jesper Kjær; Gaubitch, Nikolay; Heusdens, Richard

2015-01-01

Knowledge on how a number of loudspeakers are positioned relative to a listening position can be used to enhance the listening experience. Usually, these loudspeaker positions are estimated using calibration signals, either audible or psycho-acoustically hidden inside the desired audio signal...
Audio-visual biofeedback for respiratory-gated radiotherapy: Impact of audio instruction and audio-visual biofeedback on respiratory-gated radiotherapy

International Nuclear Information System (INIS)

George, Rohini; Chung, Theodore D.; Vedam, Sastry S.; Ramakrishnan, Viswanathan; Mohan, Radhe; Weiss, Elisabeth; Keall, Paul J.

2006-01-01

Purpose: Respiratory gating is a commercially available technology for reducing the deleterious effects of motion during imaging and treatment. The efficacy of gating is dependent on the reproducibility within and between respiratory cycles during imaging and treatment. The aim of this study was to determine whether audio-visual biofeedback can improve respiratory reproducibility by decreasing residual motion and therefore increasing the accuracy of gated radiotherapy. Methods and Materials: A total of 331 respiratory traces were collected from 24 lung cancer patients. The protocol consisted of five breathing training sessions spaced about a week apart. Within each session the patients initially breathed without any instruction (free breathing), with audio instructions and with audio-visual biofeedback. Residual motion was quantified by the standard deviation of the respiratory signal within the gating window. Results: Audio-visual biofeedback significantly reduced residual motion compared with free breathing and audio instruction. Displacement-based gating has lower residual motion than phase-based gating. Little reduction in residual motion was found for duty cycles less than 30%; for duty cycles above 50% there was a sharp increase in residual motion. Conclusions: The efficiency and reproducibility of gating can be improved by: incorporating audio-visual biofeedback, using a 30-50% duty cycle, gating during exhalation, and using displacement-based gating
N1 enhancement in synesthesia during visual and audio-visual perception in semantic cross-modal conflict situations: an ERP study

Directory of Open Access Journals (Sweden)

Christopher eSinke

2014-01-01

Full Text Available Synesthesia entails a special kind of sensory perception, where stimulation in one sensory modality leads to an internally generated perceptual experience of another, not stimulated sensory modality. This phenomenon can be viewed as an abnormal multisensory integration process as here the synesthetic percept is aberrantly fused with the stimulated modality. Indeed, recent synesthesia research has focused on multimodal processing even outside of the specific synesthesia-inducing context and has revealed changed multimodal integration, thus suggesting perceptual alterations at a global level. Here, we focused on audio-visual processing in synesthesia using a semantic classification task in combination with visually or auditory-visually presented animated and inanimated objects in an audio-visual congruent and incongruent manner. Fourteen subjects with auditory-visual and/or grapheme-color synesthesia and 14 control subjects participated in the experiment. During presentation of the stimuli, event-related potentials were recorded from 32 electrodes. The analysis of reaction times and error rates revealed no group differences with best performance for audio-visually congruent stimulation indicating the well-known multimodal facilitation effect. We found an enhanced amplitude of the N1 component over occipital electrode sites for synesthetes compared to controls. The differences occurred irrespective of the experimental condition and therefore suggest a global influence on early sensory processing in synesthetes.
Enhancing Literacy and Curriculum Using Digitalized Collections and Approaches

Science.gov (United States)

Lukenbill, Bill

2010-01-01

Digitized collections offer a wealth of resources for improving a wide variety of literacies that promote critical thinking skills, instruction and curriculum enhancements. Digitized collections and processes are increasing rapidly in their development and availability and as such introduce issues such as public access, copyright laws, limitations…
Digital holographic-based cancellable biometric for personal authentication

International Nuclear Information System (INIS)

Verma, Gaurav; Sinha, Aloka

2016-01-01

In this paper, we propose a new digital holographic-based cancellable biometric scheme for personal authentication and verification. The realization of cancellable biometric is presented by using an optoelectronic experimental approach, in which an optically recorded hologram of the fingerprint of a person is numerically reconstructed. Each reconstructed feature has its own perspective, which is utilized to generate user-specific fingerprint features by using a feature-extraction process. New representations of the user-specific fingerprint features can be obtained from the same hologram, by changing the reconstruction distance (d) by an amount Δd between the recording plane and the reconstruction plane. This parameter is the key to make the cancellable user-specific fingerprint features using a digital holographic technique, which allows us to choose different reconstruction distances when reissuing the user-specific fingerprint features in the event of compromise. We have shown theoretically that each user-specific fingerprint feature has a unique identity with a high discrimination ability, and the chances of a match between them are minimal. In this aspect, a recognition system has also been demonstrated using the fingerprint biometric of the enrolled person at a particular reconstruction distance. For the performance evaluation of a fingerprint recognition system—the false acceptance ratio, the false rejection ratio and the equal error rate are calculated using correlation. The obtained results show good discrimination ability between the genuine and the impostor populations with the highest recognition rate of 98.23%. (paper)
Involving a young person in the development of a digital resource in nurse education.

Science.gov (United States)

Fenton, Gaynor

2014-01-01

Health policies across western societies have embedded the need for service user and carer perspectives in service design and delivery of educational programmes. There is a growing recognition of the need to include the perspectives of children and young people as service users in the design and delivery of child focused educational programmes. Digital storytelling provides a strategy for student nurses to gain insight into the lived experiences of children and young people. Engaging with these stories enables students to develop an understanding of a young persons' experience of healthcare. This paper outlines a project that developed a digital learning object based upon a young person's experience of cancer and student evaluations of the digital learning object as a teaching and learning strategy. Over 80% of students rated the digital learning object as interesting and were motivated to explore its content. In addition, the evaluation highlighted that listening to the young person's experiences of her treatment regimes was informative and assisted understanding of a patients' perspective of care delivery. Copyright © 2013 Elsevier Ltd. All rights reserved.
Digital Technologies Supporting Person-Centered Integrated Care – A Perspective

Directory of Open Access Journals (Sweden)

John Øvretveit

2017-09-01

Full Text Available Shared electronic health and social care records in some service systems are already showing some of the benefits of digital technology and digital data for integrating health and social care. These records are one example of the beginning “digitalisation” of services that gives a glimpse of the potential of digital technology and systems for building coordinated and individualized integrated care. Yet the promise has been greater than the benefits, and progress has been slow compared to other industries. This paper describes for non-technical readers how information technology was used to support integrated care schemes in six EU services, and suggests practical ways forward to use the new opportunities to build person-centered integrated care.
Digital Technologies Supporting Person-Centered Integrated Care – A Perspective

Science.gov (United States)

2017-01-01

Shared electronic health and social care records in some service systems are already showing some of the benefits of digital technology and digital data for integrating health and social care. These records are one example of the beginning “digitalisation” of services that gives a glimpse of the potential of digital technology and systems for building coordinated and individualized integrated care. Yet the promise has been greater than the benefits, and progress has been slow compared to other industries. This paper describes for non-technical readers how information technology was used to support integrated care schemes in six EU services, and suggests practical ways forward to use the new opportunities to build person-centered integrated care. PMID:29588629
Issues and prospects of digitizing liberation movements' archives ...

African Journals Online (AJOL)

User

collective memory. Keywords: digitization,. NAHECS, liberation archives, digitally born, audio- visual. Introduction and background to liberation archives. Archives are generally records of .... long term preservation and access to selected archival materials ..... by International Library of African Music. (ILAM) in. Rhodes.
Digital circuit for the introduction and later removal of dither from an analog signal

Science.gov (United States)

Borgen, Gary S.

1994-05-01

An electronics circuit is presented for accurately digitizing an analog audio or like data signal into a digital equivalent signal by introducing dither into the analog signal and then subsequently removing the dither from the digitized signal prior to its conversion to an analog signal which is a substantial replica of the incoming analog audio or like data signal. The electronics circuit of the present invention is characterized by a first pseudo-random number generator which generates digital random noise signals or dither for addition to the digital equivalent signal and a second pseudo-random number generator which generates subtractive digital random noise signals for the subsequent removal of dither from the digital equivalent signal prior its conversion to the analog replica signal.
Guided Expectations: A Case Study of a Sound Collage Audio Guide

DEFF Research Database (Denmark)

Laursen, Ditte

This paper is a user evaluation of a mobile phone audio guide developed for visitors to use at the National Gallery of Denmark. The audio guide is offered as a downloadable MP3 file to every incoming visitor who is carrying a mobile phone with an open Bluetooth connection. The guide itself...... according to personal interest, and a conflict between the expectation of a learning experience rather than an aesthetic experience. Results indicate that most visitors are able to make sense of the guide and to use it successfully, in different ways, to enrich their visit. Evaluation also shows...... that visitors are fond of using their own mobile phones - but they have several problems with their phones in downloading the MP3 file. Read more: Guided Expectations: A Case Study of a Sound Collage Audio Guide | conference.archimuse.com...

GaN Power Stage for Switch-mode Audio Amplification

DEFF Research Database (Denmark)

Ploug, Rasmus Overgaard; Knott, Arnold; Poulsen, Søren Bang

2015-01-01

Gallium Nitride (GaN) based power transistors are gaining more and more attention since the introduction of the enhancement mode eGaN Field Effect Transistor (FET) which makes an adaptation from Metal-Oxide Semiconductor (MOSFET) to eGaN based technology less complex than by using depletion mode Ga......N FETs. This project seeks to investigate the possibilities of using eGaN FETs as the power switching device in a full bridge power stage intended for switch mode audio amplification. A 50 W 1 MHz power stage was built and provided promising audio performance. Future work includes optimization of dead...
The effect of digital storytelling in improving the third graders' writing skills

Directory of Open Access Journals (Sweden)

Ahmet Yamaç

2016-09-01

Full Text Available The aim of this action research was to investigate the effects of digital storytelling in improving the writing skills of third grade students enrolled in rural primary schools. The writing performances of the students were measured before and after the teaching procedures of digital storytelling. Then, the process of narrative writing with digital storytelling was profoundly and carefully explored through observation and field notes, interviews, audio and video records, student diaries and documents, and student products. The results indicated that digital storytelling enhanced students’ ideas, organization, word choice, sentence fluency, and conventions in terms of writing quality. Similarly, the digital storytelling improved story elements and word counts in stories. In terms of the quality of students’ digital stories, the results demonstrated a steady progress in the elements of digital stories, and the technology literacy and competency of students throughout the process. Besides, the digital storytelling modified the process of narrative writing, and emerged as a beneficial tool to overcome the digital divide by developing students’ new literacy perception, competency, and skills. The digital storytelling also created learning community by improving interactions among students in the classroom, and increased their motivation to write.
The Effect of Digital Storytelling in Improving the Third Graders' Writing Skills

Directory of Open Access Journals (Sweden)

Ahmet YAMAÇ

2016-09-01

Full Text Available The aim of this action research was to investigate the effects of digital storytelling in improving the writing skills of third grade students enrolled in rural primary schools. The writing performances of the students were measured before and after the teaching procedures of digital storytelling. Then, the process of narrative writing with digital storytelling was profoundly and carefully explored through observation and field notes, interviews, audio and video records, student diaries and documents, and student products. The results indicated that digital storytelling enhanced students’ ideas, organization, word choice, sentence fluency, and conventions in terms of writing quality. Similarly, the digital storytelling improved story elements and word counts in stories. In terms of the quality of students’ digital stories, the results demonstrated a steady progress in the elements of digital stories, and the technology literacy and competency of students throughout the process. Besides, the digital storytelling modified the process of narrative writing, and emerged as a beneficial tool to overcome the digital divide by developing students’ new literacy perception, competency, and skills. The digital storytelling also created learning community by improving interactions among students in the classroom, and increased their motivation to write.
Digital enhancement of computerized axial tomograms

Science.gov (United States)

Roberts, E., Jr.

1978-01-01

A systematic evaluation has been conducted of certain digital image enhancement techniques performed in image space. Three types of images have been used, computer generated phantoms, tomograms of a synthetic phantom, and axial tomograms of human anatomy containing images of lesions, artificially introduced into the tomograms. Several types of smoothing, sharpening, and histogram modification have been explored. It has been concluded that the most useful enhancement techniques are a selective smoothing of singular picture elements, combined with contrast manipulation. The most useful tool in applying these techniques is the gray-scale histogram.
3D interactive augmented reality-enhanced digital learning systems for mobile devices

Science.gov (United States)

Feng, Kai-Ten; Tseng, Po-Hsuan; Chiu, Pei-Shuan; Yang, Jia-Lin; Chiu, Chun-Jie

2013-03-01

With enhanced processing capability of mobile platforms, augmented reality (AR) has been considered a promising technology for achieving enhanced user experiences (UX). Augmented reality is to impose virtual information, e.g., videos and images, onto a live-view digital display. UX on real-world environment via the display can be e ectively enhanced with the adoption of interactive AR technology. Enhancement on UX can be bene cial for digital learning systems. There are existing research works based on AR targeting for the design of e-learning systems. However, none of these work focuses on providing three-dimensional (3-D) object modeling for en- hanced UX based on interactive AR techniques. In this paper, the 3-D interactive augmented reality-enhanced learning (IARL) systems will be proposed to provide enhanced UX for digital learning. The proposed IARL systems consist of two major components, including the markerless pattern recognition (MPR) for 3-D models and velocity-based object tracking (VOT) algorithms. Realistic implementation of proposed IARL system is conducted on Android-based mobile platforms. UX on digital learning can be greatly improved with the adoption of proposed IARL systems.
Sound for digital video

CERN Document Server

Holman, Tomlinson

2013-01-01

Achieve professional quality sound on a limited budget! Harness all new, Hollywood style audio techniques to bring your independent film and video productions to the next level.In Sound for Digital Video, Second Edition industry experts Tomlinson Holman and Arthur Baum give you the tools and knowledge to apply recent advances in audio capture, video recording, editing workflow, and mixing to your own film or video with stunning results. This fresh edition is chockfull of techniques, tricks, and workflow secrets that you can apply to your own projects from preproduction
Audio-Visual Aid in Teaching "Fatty Liver"

Science.gov (United States)

Dash, Sambit; Kamath, Ullas; Rao, Guruprasad; Prakash, Jay; Mishra, Snigdha

2016-01-01

Use of audio visual tools to aid in medical education is ever on a rise. Our study intends to find the efficacy of a video prepared on "fatty liver," a topic that is often a challenge for pre-clinical teachers, in enhancing cognitive processing and ultimately learning. We prepared a video presentation of 11:36 min, incorporating various…
Attention to affective audio-visual information: Comparison between musicians and non-musicians

NARCIS (Netherlands)

Weijkamp, J.; Sadakata, M.

2017-01-01

Individuals with more musical training repeatedly demonstrate enhanced auditory perception abilities. The current study examined how these enhanced auditory skills interact with attention to affective audio-visual stimuli. A total of 16 participants with more than 5 years of musical training
Comparison of In-Person vs. Digital Climate Education Program

Science.gov (United States)

Anderson, R. K.; Flora, J. A.; Saphir, M.

2017-12-01

In 2014, ACE (Alliance for Climate Education) evaluated the impact of its 45-minute live climate edutainment education program on the knowledge, attitudes and behavior of high school students with respect to climate change. The results showed gains in knowledge, increased engagement, as well as increased communication about climate change with number of students reporting talking about climate change with friends and family more than doubling. In 2016, ACE launched a digital version of its in-person edutainment education program, a 40-minute video version of the live program. This digital version, Our Climate Our Future (OCOF), has now been used by nearly 4,000 teachers nationwide and viewed by over 150,000 students. We experimentally tested the impact of the digital program (OCOF) compared to the live program and a control group. The experiment was conducted with 709 students in 27 classes at two North Carolina public high schools. Classes were assigned to one of three conditions: digital, live and control. In the digital version, students watched the 40-minute OCOF video featuring the same educator that presented the live program. In the live version, students received an identical 40-minute live presentation by an ACE staff educator The control group received neither treatment. When compared to controls, both programs were effective in positively increasing climate change knowledge, climate justice knowledge, perceived self-efficacy to make climate-friendly behavior changes, and beliefs about climate change (all statistically significant at or above P<.01). In the areas of hope that people can solve climate change and intent to change behavior, only the live program showed significant increases. In these two areas, it may be that an in-person experience is key to affecting change. In light of these positive results, ACE plans to increase the use of OCOF in schools across the country to assist teachers in their efforts to teach about climate change.
Reduction in time-to-sleep through EEG based brain state detection and audio stimulation.

Science.gov (United States)

Zhuo Zhang; Cuntai Guan; Ti Eu Chan; Juanhong Yu; Aung Aung Phyo Wai; Chuanchu Wang; Haihong Zhang

2015-08-01

We developed an EEG- and audio-based sleep sensing and enhancing system, called iSleep (interactive Sleep enhancement apparatus). The system adopts a closed-loop approach which optimizes the audio recording selection based on user's sleep status detected through our online EEG computing algorithm. The iSleep prototype comprises two major parts: 1) a sleeping mask integrated with a single channel EEG electrode and amplifier, a pair of stereo earphones and a microcontroller with wireless circuit for control and data streaming; 2) a mobile app to receive EEG signals for online sleep monitoring and audio playback control. In this study we attempt to validate our hypothesis that appropriate audio stimulation in relation to brain state can induce faster onset of sleep and improve the quality of a nap. We conduct experiments on 28 healthy subjects, each undergoing two nap sessions - one with a quiet background and one with our audio-stimulation. We compare the time-to-sleep in both sessions between two groups of subjects, e.g., fast and slow sleep onset groups. The p-value obtained from Wilcoxon Signed Rank Test is 1.22e-04 for slow onset group, which demonstrates that iSleep can significantly reduce the time-to-sleep for people with difficulty in falling sleep.
∑∆ Modulator System-Level Considerations for Hearing-Aid Audio Class-D Output Stage Application

DEFF Research Database (Denmark)

Pracný, Peter; Bruun, Erik

2012-01-01

This paper deals with a system-level design of a digital sigma-delta (∑∆) modulator for hearing-aid audio Class D output stage application. The aim of this paper is to provide a thorough discussion on various possibilities and tradeoffs of ∑∆ modulator system-level design parameter combinations...... - order, oversampling ratio (OSR) and number of bits in the quantizer - including their impact on interpolation filter design as well. The system is kept in digital domain up to the input of the Class D power stage including the digital pulse width modulation (DPWM) block. Notes on the impact of the DPWM...
Categorizing Video Game Audio

DEFF Research Database (Denmark)

Westerberg, Andreas Rytter; Schoenau-Fog, Henrik

2015-01-01

they can use audio in video games. The conclusion of this study is that the current models' view of the diegetic spaces, used to categorize video game audio, is not t to categorize all sounds. This can however possibly be changed though a rethinking of how the player interprets audio.......This paper dives into the subject of video game audio and how it can be categorized in order to deliver a message to a player in the most precise way. A new categorization, with a new take on the diegetic spaces, can be used a tool of inspiration for sound- and game-designers to rethink how...
Core architectures for digital media and the associated compilation techniques

NARCIS (Netherlands)

Jess, J.A.G.; Reis, R.; Lubaszewski, M.; Jess, J.A.G.

2006-01-01

The new generation of multimedia systems will be fully digital. This includes real time digital TV transmission via cable, satellite and terrestrial channels as well as digital audio broadcasting. A number of standards have been developed such as those of the ‘‘Moving Picture Experts Group’’ (MPEG).
High-Fidelity Piezoelectric Audio Device

Science.gov (United States)

Woodward, Stanley E.; Fox, Robert L.; Bryant, Robert G.

2003-01-01

ModalMax is a very innovative means of harnessing the vibration of a piezoelectric actuator to produce an energy efficient low-profile device with high-bandwidth high-fidelity audio response. The piezoelectric audio device outperforms many commercially available speakers made using speaker cones. The piezoelectric device weighs substantially less (4 g) than the speaker cones which use magnets (10 g). ModalMax devices have extreme fabrication simplicity. The entire audio device is fabricated by lamination. The simplicity of the design lends itself to lower cost. The piezoelectric audio device can be used without its acoustic chambers and thereby resulting in a very low thickness of 0.023 in. (0.58 mm). The piezoelectric audio device can be completely encapsulated, which makes it very attractive for use in wet environments. Encapsulation does not significantly alter the audio response. Its small size (see Figure 1) is applicable to many consumer electronic products, such as pagers, portable radios, headphones, laptop computers, computer monitors, toys, and electronic games. The audio device can also be used in automobile or aircraft sound systems.
Digital knowledge in the coat pocket - hand-held personal digital assistants in radiology

International Nuclear Information System (INIS)

Niehues, S.M.; Froehlich, M.; Felix, R.; Lemke, A.J.

2004-01-01

The personal digital assistant (PDA) enables the independent access to large data in a pocket-sized format. The applications for hand-held computers are growing steadily and can support almost any kind of problem. An overview of the available hardware and software is provided and evaluated. Furthermore, the use of the PDA in the clinical daily routine is described. In view of the numerous software programs available in radiology, the range of software solutions for radiologists is presented. Despite the high acquisition cost, the PDA has already become the digital assistant for the radiologist. After a short time of getting used to the PDA, nobody wants to miss it at work or at home. New technical features and available software programs will continuously increase the integration of the PDA into the medical workflow in the near future. (orig.)
The MPEG Representation of Digital Media

CERN Document Server

2012-01-01

More and more information, audio and video but also a range of other information type, is generated, processed and used by machines today, even though the end user may be a human. The result over the past 15 years has been a substantial increase in the type of information and change in the way humans generate, classify, store, search, access and consume information. Conversion of information to digital form is a prerequisite for this enhanced machine role, but must be done having in mind requirements such as compactness, fidelity, interpretability etc. This book provides an overview of the basic technology and mechanisms underpinning the operation of MPEG standards. It is a valuable reference for those making decisions in products and services based on digital media, those with general background, engaged in studies or developments of MPEG-related implementations, and those curious about MPEG and its role in the development of successful, standard technologies. Offers an overview of what’s behind MP3, dig...
Potential Cost Savings of Contrast-Enhanced Digital Mammography.

Science.gov (United States)

Patel, Bhavika K; Gray, Richard J; Pockaj, Barbara A

2017-06-01

The purpose of this article is to discuss whether the sensitivity and specificity of contrast-enhanced digital mammography (CEDM) render it a viable diagnostic alternative to breast MRI. That CEDM couples low-energy images (comparable to the diagnostic quality of standard mammography) and subtracted contrast-enhanced mammograms make it a cost-effective modality and a realistic substitute for the more costly breast MRI.
A Joint Audio-Visual Approach to Audio Localization

DEFF Research Database (Denmark)

Jensen, Jesper Rindom; Christensen, Mads Græsbøll

2015-01-01

Localization of audio sources is an important research problem, e.g., to facilitate noise reduction. In the recent years, the problem has been tackled using distributed microphone arrays (DMA). A common approach is to apply direction-of-arrival (DOA) estimation on each array (denoted as nodes), a...... time-of-flight cameras. Moreover, we propose an optimal method for weighting such DOA and range information for audio localization. Our experiments on both synthetic and real data show that there is a clear, potential advantage of using the joint audiovisual localization framework....
Issues and prospects of digitizing liberation movements' archives ...

African Journals Online (AJOL)

The major findings were that all the ANC audio material has been successfully digitized through the Multichoice funded digitization project though a lot of work has to ... history and the critically endangered audiovisual heritage whose pace of deterioration is increasingly leading to the extinction of this vital collective memory.
Renegotiating the pedagogic contract: Teaching in digitally enhanced secondary science classrooms

Science.gov (United States)

Ajayi, Ajibola Oluneye

This qualitative case study explores the effects of emerging digital technology as a teaching and learning tool in secondary school science classrooms. The study examines three teachers' perspectives on how the use of technology affects the teacher-student pedagogic relationship. The "pedagogic contract" is used as a construct to analyze the changes that took place in these teachers' classrooms amid the use of this new technology. The overarching question for this research is: How was the pedagogic contract renegotiated in three secondary science teachers' classrooms through the use of digitally enhanced science instruction. To answer this question, data was collected via semi-structured teacher interviews, classroom observations, and analysis of classroom documents such as student assignments, tests and Study Guides. This study reveals that the everyday use of digital technologies in these classrooms resulted in a re-negotiated pedagogic contract across three major dimensions: content of learning, method and management of learning activities, and assessment of learning. The extent to which the pedagogic contract was renegotiated varied with each of the teachers studied. Yet in each case, the content of learning was extended to include new topics, and greater depth of learning within the mandated curriculum. The management of learning was reshaped around metacognitive strategies, personal goal-setting, individual pacing, and small-group learning activities. With the assessment of learning, there was increased emphasis on self-directed interactive testing as a formative assessment tool. This study highlights the aspects of science classrooms that are most directly affected by the introduction of digital technologies and demonstrates how those changes are best understood as a renegotiation of the teacher-student pedagogic contract.

Modeling Audio Fingerprints : Structure, Distortion, Capacity

NARCIS (Netherlands)

Doets, P.J.O.

2010-01-01

An audio fingerprint is a compact low-level representation of a multimedia signal. An audio fingerprint can be used to identify audio files or fragments in a reliable way. The use of audio fingerprints for identification consists of two phases. In the enrollment phase known content is fingerprinted,
Digital signal processing

CERN Document Server

O'Shea, Peter; Hussain, Zahir M

2011-01-01

In three parts, this book contributes to the advancement of engineering education and that serves as a general reference on digital signal processing. Part I presents the basics of analog and digital signals and systems in the time and frequency domain. It covers the core topics: convolution, transforms, filters, and random signal analysis. It also treats important applications including signal detection in noise, radar range estimation for airborne targets, binary communication systems, channel estimation, banking and financial applications, and audio effects production. Part II considers sel
The newest digital signal processing

International Nuclear Information System (INIS)

Lee, Chae Uk

2002-08-01

This book deal with the newest digital signal processing, which contains introduction on conception of digital signal processing, constitution and purpose, signal and system such as signal, continuos signal, discrete signal and discrete system, I/O expression on impress response, convolution, mutual connection of system and frequency character,z transform of definition, range, application of z transform and relationship with laplace transform, Discrete fourier, Fast fourier transform on IDFT algorithm and FFT application, foundation of digital filter of notion, expression, types, frequency characteristic of digital filter and design order of filter, Design order of filter, Design of FIR digital filter, Design of IIR digital filter, Adaptive signal processing, Audio signal processing, video signal processing and application of digital signal processing.
Introduction to audio analysis a MATLAB approach

CERN Document Server

Giannakopoulos, Theodoros

2014-01-01

Introduction to Audio Analysis serves as a standalone introduction to audio analysis, providing theoretical background to many state-of-the-art techniques. It covers the essential theory necessary to develop audio engineering applications, but also uses programming techniques, notably MATLAB®, to take a more applied approach to the topic. Basic theory and reproducible experiments are combined to demonstrate theoretical concepts from a practical point of view and provide a solid foundation in the field of audio analysis. Audio feature extraction, audio classification, audio segmentation, au
Advances in audio source seperation and multisource audio content retrieval

Science.gov (United States)

Vincent, Emmanuel

2012-06-01

Audio source separation aims to extract the signals of individual sound sources from a given recording. In this paper, we review three recent advances which improve the robustness of source separation in real-world challenging scenarios and enable its use for multisource content retrieval tasks, such as automatic speech recognition (ASR) or acoustic event detection (AED) in noisy environments. We present a Flexible Audio Source Separation Toolkit (FASST) and discuss its advantages compared to earlier approaches such as independent component analysis (ICA) and sparse component analysis (SCA). We explain how cues as diverse as harmonicity, spectral envelope, temporal fine structure or spatial location can be jointly exploited by this toolkit. We subsequently present the uncertainty decoding (UD) framework for the integration of audio source separation and audio content retrieval. We show how the uncertainty about the separated source signals can be accurately estimated and propagated to the features. Finally, we explain how this uncertainty can be efficiently exploited by a classifier, both at the training and the decoding stage. We illustrate the resulting performance improvements in terms of speech separation quality and speaker recognition accuracy.
Enhanced digital library system that supports sustainable knowledge

African Journals Online (AJOL)

Enhanced digital library system that supports sustainable knowledge: A focus ... This research work provides a Web-Based University library, ability to access the ... and generates pins to authorize bonafide students and staff of the University.
PULSE MODULATION POWER AMPLIFIER WITH ENHANCED CASCADE CONTROL METHOD

DEFF Research Database (Denmark)

1998-01-01

a single local feedback path A (7) with a lowpass characteristic and local forward blocks B¿1? or B (3, 4). The leads to a much improved system with a very low sensitivity to errors in the switching power stage. In the second preferred embodiment of the invention the control structure is extended...... and feedback path A to determine stable self-oscillating conditions. An implemented 250W example MECC digital power amplifier has proven superior performance in terms of audio performance (0.005 % distortion, 115 dB dynamic range) and efficiency (92 %).......A digital switching power amplifier with Multivariable Enhanced Cascade Controlled (MECC) includes a modulator, a switching power stage and a low pass filter. In the first preferred embodiment an enhanced cascade control structure local to the switching power stage is added, characterised by having...
Roundtable Audio Discussion

Directory of Open Access Journals (Sweden)

Chris Bigum

2007-01-01

Full Text Available RoundTable on Technology, Teaching and Tools. This is a roundtable audio interview conducted by James Farmer, founder of Edublogs, with Anne Bartlett-Bragg (University of Technology Sydney and Chris Bigum (Deakin University. Skype was used to make and record the audio conference and the resulting sound file was edited by Andrew McLauchlan.
Using Audio-Derived Affective Offset to Enhance TV Recommendation

DEFF Research Database (Denmark)

Shepstone, Sven Ewan; Tan, Zheng-Hua; Jensen, Søren Holdt

2014-01-01

. First a user's mood profile is determined using 12-class audio-based emotion classifications . An initial TV content item is then displayed to the user based on the extracted mood profile. The user has the option to either accept the recommendation, or to critique the item once or several times......, by navigating the emotion space to request an alternative match. The final match is then compared to the initial match, in terms of the difference in the items' affective parameterization . This offset is then utilized in future recommendation sessions. The system was evaluated by eliciting three different...
Building Personal Brands with Digital Storytelling ePortfolios

Science.gov (United States)

Jones, Beata; Leverenz, Carrie

2017-01-01

Antoine de Saint-Exupery said, "If you want to build a ship, don't drum up people to collect wood and don't assign them tasks and work, but rather teach them to long for the endless immensity of the sea." This article presents a pedagogical approach for framing a digital-identity-enhancing ePortfolio that maximizes student engagement and…
Automatic processing of CERN video, audio and photo archives

International Nuclear Information System (INIS)

Kwiatek, M

2008-01-01

The digitalization of CERN audio-visual archives, a major task currently in progress, will generate over 40 TB of video, audio and photo files. Storing these files is one issue, but a far more important challenge is to provide long-time coherence of the archive and to make these files available on-line with minimum manpower investment. An infrastructure, based on standard CERN services, has been implemented, whereby master files, stored in the CERN Distributed File System (DFS), are discovered and scheduled for encoding into lightweight web formats based on predefined profiles. Changes in master files, conversion profiles or in the metadata database (read from CDS, the CERN Document Server) are automatically detected and the media re-encoded whenever necessary. The encoding processes are run on virtual servers provided on-demand by the CERN Server Self Service Centre, so that new servers can be easily configured to adapt to higher load. Finally, the generated files are made available from the CERN standard web servers with streaming implemented using Windows Media Services
Automatic processing of CERN video, audio and photo archives

Energy Technology Data Exchange (ETDEWEB)

Kwiatek, M [CERN, Geneva (Switzerland)], E-mail: Michal.Kwiatek@cem.ch

2008-07-15

The digitalization of CERN audio-visual archives, a major task currently in progress, will generate over 40 TB of video, audio and photo files. Storing these files is one issue, but a far more important challenge is to provide long-time coherence of the archive and to make these files available on-line with minimum manpower investment. An infrastructure, based on standard CERN services, has been implemented, whereby master files, stored in the CERN Distributed File System (DFS), are discovered and scheduled for encoding into lightweight web formats based on predefined profiles. Changes in master files, conversion profiles or in the metadata database (read from CDS, the CERN Document Server) are automatically detected and the media re-encoded whenever necessary. The encoding processes are run on virtual servers provided on-demand by the CERN Server Self Service Centre, so that new servers can be easily configured to adapt to higher load. Finally, the generated files are made available from the CERN standard web servers with streaming implemented using Windows Media Services.
A Bit Stream Scalable Speech/Audio Coder Combining Enhanced Regular Pulse Excitation and Parametric Coding

Directory of Open Access Journals (Sweden)

Albertus C. den Brinker

2007-01-01

Full Text Available This paper introduces a new audio and speech broadband coding technique based on the combination of a pulse excitation coder and a standardized parametric coder, namely, MPEG-4 high-quality parametric coder. After presenting a series of enhancements to regular pulse excitation (RPE to make it suitable for the modeling of broadband signals, it is shown how pulse and parametric codings complement each other and how they can be merged to yield a layered bit stream scalable coder able to operate at different points in the quality bit rate plane. The performance of the proposed coder is evaluated in a listening test. The major result is that the extra functionality of the bit stream scalability does not come at the price of a reduced performance since the coder is competitive with standardized coders (MP3, AAC, SSC.
A Bit Stream Scalable Speech/Audio Coder Combining Enhanced Regular Pulse Excitation and Parametric Coding

Science.gov (United States)

Riera-Palou, Felip; den Brinker, Albertus C.

2007-12-01

This paper introduces a new audio and speech broadband coding technique based on the combination of a pulse excitation coder and a standardized parametric coder, namely, MPEG-4 high-quality parametric coder. After presenting a series of enhancements to regular pulse excitation (RPE) to make it suitable for the modeling of broadband signals, it is shown how pulse and parametric codings complement each other and how they can be merged to yield a layered bit stream scalable coder able to operate at different points in the quality bit rate plane. The performance of the proposed coder is evaluated in a listening test. The major result is that the extra functionality of the bit stream scalability does not come at the price of a reduced performance since the coder is competitive with standardized coders (MP3, AAC, SSC).
Personal Fabrication Systems: From Bits to Atoms

Science.gov (United States)

Bull, Glen; Garofalo, Joe

2009-01-01

Media--text, images, audio, and video--underwent a transformation from analog to digital formats during the transition from the 20th to the 21st century. Digital media can easily be replicated, downloaded, revised, edited, and reposted, and the implications of this are affecting education, government, entertainment, culture, and society. The…
Digital Audiovisual Archives: Unlocking our Audio and Audiovisual ...

African Journals Online (AJOL)

... entered an exciting phase in managing our assets – the new archiving. We need to embrace these opportunities to save our collections and need to respond enthusiastically to the changing broadcast environment. We need to overcome the inefficiency of analogue and already obsolete digital content management, and
Could Audio-Described Films Benefit from Audio Introductions? An Audience Response Study

Science.gov (United States)

Romero-Fresco, Pablo; Fryer, Louise

2013-01-01

Introduction: Time constraints limit the quantity and type of information conveyed in audio description (AD) for films, in particular the cinematic aspects. Inspired by introductory notes for theatre AD, this study developed audio introductions (AIs) for "Slumdog Millionaire" and "Man on Wire." Each AI comprised 10 minutes of…
Location audio simplified capturing your audio and your audience

CERN Document Server

Miles, Dean

2014-01-01

From the basics of using camera, handheld, lavalier, and shotgun microphones to camera calibration and mixer set-ups, Location Audio Simplified unlocks the secrets to clean and clear broadcast quality audio no matter what challenges you face. Author Dean Miles applies his twenty-plus years of experience as a professional location operator to teach the skills, techniques, tips, and secrets needed to produce high-quality production sound on location. Humorous and thoroughly practical, the book covers a wide array of topics, such as:* location selection* field mixing* boo
Low-Light Image Enhancement Using Adaptive Digital Pixel Binning

Directory of Open Access Journals (Sweden)

Yoonjong Yoo

2015-06-01

Full Text Available This paper presents an image enhancement algorithm for low-light scenes in an environment with insufficient illumination. Simple amplification of intensity exhibits various undesired artifacts: noise amplification, intensity saturation, and loss of resolution. In order to enhance low-light images without undesired artifacts, a novel digital binning algorithm is proposed that considers brightness, context, noise level, and anti-saturation of a local region in the image. The proposed algorithm does not require any modification of the image sensor or additional frame-memory; it needs only two line-memories in the image signal processor (ISP. Since the proposed algorithm does not use an iterative computation, it can be easily embedded in an existing digital camera ISP pipeline containing a high-resolution image sensor.
Smartphone audio port data collection cookbook

Directory of Open Access Journals (Sweden)

Kyle Forinash

2018-06-01

Full Text Available The audio port of a smartphone is designed to send and receive audio but can be harnessed for portable, economical, and accurate data collection from a variety of sources. While smartphones have internal sensors to measure a number of physical phenomena such as acceleration, magnetism and illumination levels, measurement of other phenomena such as voltage, external temperature, or accurate timing of moving objects are excluded. The audio port cannot be only employed to sense external phenomena. It has the additional advantage of timing precision; because audio is recorded or played at a controlled rate separated from other smartphone activities, timings based on audio can be highly accurate. The following outlines unpublished details of the audio port technical elements for data collection, a general data collection recipe and an example timing application for Android devices.

Audio teleconferencing: creative use of a forgotten innovation.

Science.gov (United States)

Mather, Carey; Marlow, Annette

2012-06-01

As part of a regional School of Nursing and Midwifery's commitment to addressing recruitment and retention issues, approximately 90% of second year undergraduate student nurses undertake clinical placements at: multipurpose centres; regional or district hospitals; aged care; or community centres based in rural and remote regions within the State. The remaining 10% undertake professional experience placement in urban areas only. This placement of a large cohort of students, in low numbers in a variety of clinical settings, initiated the need to provide consistent support to both students and staff at these facilities. Subsequently the development of an audio teleconferencing model of clinical facilitation to guide student teaching and learning and to provide support to registered nurse preceptors in clinical practice was developed. This paper draws on Weimer's 'Personal Accounts of Change' approach to describe, discuss and evaluate the modifications that have occurred since the inception of this audio teleconferencing model (Weimer, 2006).
Structure Learning in Audio

DEFF Research Database (Denmark)

Nielsen, Andreas Brinch

By having information about the setting a user is in, a computer is able to make decisions proactively to facilitate tasks for the user. Two approaches are taken in this thesis to achieve more information about an audio environment. One approach is that of classifying audio, and a new approach...... investigated. A fast and computationally simple approach that compares recordings and classifies if they are from the same audio environment have been developed, and shows very high accuracy and the ability to synchronize recordings in the case of recording devices which are not connected. A more general model...
Digital-Networked Images as Personal Acts of Political Expression: New Categories for Meaning Formation

Directory of Open Access Journals (Sweden)

Mona Kasra

2017-12-01

Full Text Available This article examines the growing use of digital-networked images, specifically online self-portraits or “selfies”, as deliberate and personal acts of political expression and the ways in which meaning evolves and expands from their presence on the Internet. To understand the role of digital-networked images as a site for engaging in a personal and connective “visual” action that leads to formation of transient communities, the author analyzes the nude self-portrait of the young Egyptian woman Aliaa Magda Elmahdy, which during the Egyptian uprisings in 2011 drew attention across social media. As an object of analysis this image is a prime example of the use of digital-networked images in temporally intentional distribution, and as an instance of political enactment unique to this era. This article also explains the concept of participatory narratives as an ongoing process of meaning formation in the digital-networked image, shaped by the fluidity of the multiple and immediate textual narratives, visual derivatives, re-appropriation, and remixes contributed by other interested viewers. The online circulation of digital-networked images in fact culminates in a flow of ever-changing and overarching narratives, broadening the contextual scope around which images are traditionally viewed.
Digital Learning As Enhanced Learning Processing? Cognitive Evidence for New insight of Smart Learning.

Science.gov (United States)

Di Giacomo, Dina; Ranieri, Jessica; Lacasa, Pilar

2017-01-01

Large use of technology improved quality of life across aging and favoring the development of digital skills. Digital skills can be considered an enhancing to human cognitive activities. New research trend is about the impact of the technology in the elaboration information processing of the children. We wanted to analyze the influence of technology in early age evaluating the impact on cognition. We investigated the performance of a sample composed of n. 191 children in school age distributed in two groups as users: high digital users and low digital users. We measured the verbal and visuoperceptual cognitive performance of children by n. 8 standardized psychological tests and ad hoc self-report questionnaire. Results have evidenced the influence of digital exposition on cognitive development: the cognitive performance is looked enhanced and better developed: high digital users performed better in naming, semantic, visual memory and logical reasoning tasks. Our finding confirms the data present in literature and suggests the strong impact of the technology using not only in the social, educational and quality of life of the people, but also it outlines the functionality and the effect of the digital exposition in early age; increased cognitive abilities of the children tailor digital skilled generation with enhanced cognitive processing toward to smart learning.
“What Goes Around Comes Around”: Lessons Learned from Economic Evaluations of Personalized Medicine Applied to Digital Medicine

Science.gov (United States)

Phillips, Kathryn A.; Douglas, Michael P.; Trosman, Julia R.; Marshall, Deborah A.

2016-01-01

Two key trends that emerge from the growth of “Big Data” and the emphasis on patient-centered healthcare are the increasing use of personalized medicine and digital medicine. In order for these technologies to move into mainstream health care and be reimbursed by insurers, it will be essential to have evidence that their benefits provide reasonable value relative to their costs. However, these technologies have complex characteristics that present challenges to assessment of their economic value. Previous work has identified these challenges for personalized medicine and thus this work can inform the more nascent topic of digital medicine. Our objective is to examine the methodological challenges and future opportunities for assessing the economic value of digital medicine, using personalized medicine as a comparison. We focus specifically on “digital biomarker technologies” and “multigene tests”. We identified similarities in these technologies that can present challenges to economic evaluation: multiple results, results with different types of utilities, secondary findings, downstream impact (including on family members), and interactive effects. Using a structured review, we found that there are few economic evaluations of digital biomarker technologies, with limited results. We conclude that more evidence on effectiveness of digital medicine will be needed but that the experiences with personalized medicine can inform what data will be needed and how such analyses can be conducted. Our study points out the critical need for typologies and terminology for digital medicine technologies that would enable them to be classified in ways that will facilitate research on their effectiveness and value. PMID:28212968
Digital Twins in Health Care: Ethical Implications of an Emerging Engineering Paradigm.

Science.gov (United States)

Bruynseels, Koen; Santoni de Sio, Filippo; van den Hoven, Jeroen

2018-01-01

Personalized medicine uses fine grained information on individual persons, to pinpoint deviations from the normal. 'Digital Twins' in engineering provide a conceptual framework to analyze these emerging data-driven health care practices, as well as their conceptual and ethical implications for therapy, preventative care and human enhancement. Digital Twins stand for a specific engineering paradigm, where individual physical artifacts are paired with digital models that dynamically reflects the status of those artifacts. When applied to persons, Digital Twins are an emerging technology that builds on in silico representations of an individual that dynamically reflect molecular status, physiological status and life style over time. We use Digital Twins as the hypothesis that one would be in the possession of very detailed bio-physical and lifestyle information of a person over time. This perspective redefines the concept of 'normality' or 'health,' as a set of patterns that are regular for a particular individual , against the backdrop of patterns observed in the population. This perspective also will impact what is considered therapy and what is enhancement, as can be illustrated with the cases of the 'asymptomatic ill' and life extension via anti-aging medicine. These changes are the consequence of how meaning is derived, in case measurement data is available. Moral distinctions namely may be based on patterns found in these data and the meanings that are grafted on these patterns. Ethical and societal implications of Digital Twins are explored. Digital Twins imply a data-driven approach to health care. This approach has the potential to deliver significant societal benefits, and can function as a social equalizer, by allowing for effective equalizing enhancement interventions. It can as well though be a driver for inequality, given the fact that a Digital Twin might not be an accessible technology for everyone, and given the fact that patterns identified across a
ADC testing using digital stimuli

NARCIS (Netherlands)

Sheng, Xiaoqin

2014-01-01

The Analogue-to-Digital Converter (ADC) is one of the most typical and widely used mixed-signal circuits. They are applied in video, audio, high-speed communications systems and so on. Many ADCs are integrated into platform-based designs, the architecture which normally contains of standard blocks
A centralized audio presentation manager

Energy Technology Data Exchange (ETDEWEB)

Papp, A.L. III; Blattner, M.M.

1994-05-16

The centralized audio presentation manager addresses the problems which occur when multiple programs running simultaneously attempt to use the audio output of a computer system. Time dependence of sound means that certain auditory messages must be scheduled simultaneously, which can lead to perceptual problems due to psychoacoustic phenomena. Furthermore, the combination of speech and nonspeech audio is examined; each presents its own problems of perceptibility in an acoustic environment composed of multiple auditory streams. The centralized audio presentation manager receives abstract parameterized message requests from the currently running programs, and attempts to create and present a sonic representation in the most perceptible manner through the use of a theoretically and empirically designed rule set.
Instrumental Landing Using Audio Indication

Science.gov (United States)

Burlak, E. A.; Nabatchikov, A. M.; Korsun, O. N.

2018-02-01

The paper proposes an audio indication method for presenting to a pilot the information regarding the relative positions of an aircraft in the tasks of precision piloting. The implementation of the method is presented, the use of such parameters of audio signal as loudness, frequency and modulation are discussed. To confirm the operability of the audio indication channel the experiments using modern aircraft simulation facility were carried out. The simulated performed the instrument landing using the proposed audio method to indicate the aircraft deviations in relation to the slide path. The results proved compatible with the simulated instrumental landings using the traditional glidescope pointers. It inspires to develop the method in order to solve other precision piloting tasks.
Bit rates in audio source coding

NARCIS (Netherlands)

Veldhuis, Raymond N.J.

1992-01-01

The goal is to introduce and solve the audio coding optimization problem. Psychoacoustic results such as masking and excitation pattern models are combined with results from rate distortion theory to formulate the audio coding optimization problem. The solution of the audio optimization problem is a
Extraction Of Electronic Evidence From VoIP: Identification & Analysis Of Digital Speech

Directory of Open Access Journals (Sweden)

David Irwin

2012-09-01

Full Text Available The Voice over Internet Protocol (VoIP is increasing in popularity as a cost effective and efficient means of making telephone calls via the Internet. However, VoIP may also be an attractive method of communication to criminals as their true identity may be hidden and voice and video communications are encrypted as they are deployed across the Internet. This produces in a new set of challenges for forensic analysts compared with traditional wire-tapping of the Public Switched Telephone Network (PSTN infrastructure, which is not applicable to VoIP. Therefore, other methods of recovering electronic evidence from VoIP are required.Â This research investigates the analysis and recovery of digitised human, which persists in computer memory after a VoIP call.This paper proposes a proof of concept how remnants of digitised human speech from a VoIP call may be identified within a forensic memory capture based on how the human voice is detected via a microphone and encoded to a digital format using the sound card of your personal computer. This digital format is unencrypted whist processed in Random Access Memory (RAM before it is passed to the VoIP application for encryption and Â transmission over the Internet. Similarly, an incoming encrypted VoIP call is decrypted by the VoIP application and passes through RAM unencrypted in order to be played via the speaker output.A series of controlled tests were undertaken whereby RAM captures were analysed for remnants of digital speech after a VoIP audio call with known conversation. The identification and analysis of digital speech from RAM attempts to construct an automatic process for the identification and subsequent reconstruction of the audio content of a VoIP call.
Implementing Audio-CASI on Windows’ Platforms

Science.gov (United States)

Cooley, Philip C.; Turner, Charles F.

2011-01-01

Audio computer-assisted self interviewing (Audio-CASI) technologies have recently been shown to provide important and sometimes dramatic improvements in the quality of survey measurements. This is particularly true for measurements requiring respondents to divulge highly sensitive information such as their sexual, drug use, or other sensitive behaviors. However, DOS-based Audio-CASI systems that were designed and adopted in the early 1990s have important limitations. Most salient is the poor control they provide for manipulating the video presentation of survey questions. This article reports our experiences adapting Audio-CASI to Microsoft Windows 3.1 and Windows 95 platforms. Overall, our Windows-based system provided the desired control over video presentation and afforded other advantages including compatibility with a much wider array of audio devices than our DOS-based Audio-CASI technologies. These advantages came at the cost of increased system requirements --including the need for both more RAM and larger hard disks. While these costs will be an issue for organizations converting large inventories of PCS to Windows Audio-CASI today, this will not be a serious constraint for organizations and individuals with small inventories of machines to upgrade or those purchasing new machines today. PMID:22081743
Meshing the Personal with the Professional: Digital Storytelling in Higher Education

Directory of Open Access Journals (Sweden)

Mary F. Wright

2010-11-01

Full Text Available This paper chronicles a yearlong journey of learning about digital storytelling and leading the creation of five digital stories within a higher education community. We bring two complementary perspectives to guide this inquiry: as a faculty member in teacher education and as the University of Wisconsin system representative for the Learning Technology Development Council as well as director of our educational technology center. Our passion for the arts, aesthetics and education bring us to extend an inquiry into teacher identity and reflection by connecting our colleagues’ stories with the art of digital storytelling. We see its place and value in an academic environment; although not always currently clear, the roots of personal insight permeate the lives of professionals within the academy. Digital storytelling spans the artificial divide between the experiences of the past and our professional identities. The myriad uses of digital storytelling in higher education are explored as a reflective tool for practice, to highlight academic projects, interests or initiatives, and most importantly, to simply reflect on how we are shaped by the stories we live and how we in turn share our diverse identities.
Automatic Detection and Classification of Audio Events for Road Surveillance Applications

Directory of Open Access Journals (Sweden)

Noor Almaadeed

2018-06-01

Full Text Available This work investigates the problem of detecting hazardous events on roads by designing an audio surveillance system that automatically detects perilous situations such as car crashes and tire skidding. In recent years, research has shown several visual surveillance systems that have been proposed for road monitoring to detect accidents with an aim to improve safety procedures in emergency cases. However, the visual information alone cannot detect certain events such as car crashes and tire skidding, especially under adverse and visually cluttered weather conditions such as snowfall, rain, and fog. Consequently, the incorporation of microphones and audio event detectors based on audio processing can significantly enhance the detection accuracy of such surveillance systems. This paper proposes to combine time-domain, frequency-domain, and joint time-frequency features extracted from a class of quadratic time-frequency distributions (QTFDs to detect events on roads through audio analysis and processing. Experiments were carried out using a publicly available dataset. The experimental results conform the effectiveness of the proposed approach for detecting hazardous events on roads as demonstrated by 7% improvement of accuracy rate when compared against methods that use individual temporal and spectral features.
Professional Caregivers' Perceptions on how Persons with Mild Dementia Might Experience the Usage of a Digital Photo Diary.

Science.gov (United States)

Harrefors, Christina; Sävenstedt, Stefan; Lundquist, Anders; Lundquist, Bengt; Axelsson, Karin

2012-01-01

Cognitive impairments influence the possibility of persons with dementia to remember daily events and maintain a sense of self. In order to address these problems a digital photo diary was developed to capture information about events in daily life. The device consisted of a wearable digital camera, smart phone with Global Positioning System (GPS) and a home memory station with computer for uploading the photographs and touch screen. The aim of this study was to describe professional caregiver's perceptions on how persons with mild dementia might experience the usage of this digital photo diary from both a situation when wearing the camera and a situation when viewing the uploaded photos, through a questionnaire with 408 respondents. In order to catch the professional caregivers' perceptions a questionnaire with the semantic differential technique was used and the main question was "How do you think Hilda (the fictive person in the questionnaire) feels when she is using the digital photo diary?". The factor analysis revealed three factors; Sense of autonomy, Sense of self-esteem and Sense of trust. An interesting conclusion that can be drawn is that professional caregivers had an overall positive view of the usage of digital photo diary as supporting autonomy for persons with mild dementia. The meaningfulness of each situation when wearing the camera and viewing the uploaded pictures to be used in two different situations and a part of an integrated assistive device has to be considered separately. Individual needs and desires of the person who is living with dementia and the context of each individual has to be reflected on and taken into account before implementing assistive digital devices as a tool in care.
Effect of Nicotine on Audio and Visual Reaction Time in Dipping ...

African Journals Online (AJOL)

Nicotine through blood is harmful and as there are fewer studies in India with respect to nicotines influence on reaction time especially in the smokeless tobacco users we studied this. Reaction time is a measure of the sensorimotor integration in a person. We used a PC 1000 Hz reaction timer to record the audio and visual ...
Mining Contextual Information for Ephemeral Digital Video Preservation

OpenAIRE

Shah, Chirag

2009-01-01

For centuries the archival community has understood and practiced the art of adding contextual information while preserving an artifact. The question now is how these practices can be transferred to the digital domain. With the growing expansion of production and consumption of digital objects (documents, audio, video, etc.) it has become essential to identify and study issues related to their representation. A curator in the digital realm may be said to have the same responsibilities as on...
Computerized J-H loop tracer for soft magnetic thick films in the audio frequency range

Directory of Open Access Journals (Sweden)

Loizos G.

2014-07-01

Full Text Available A computerized J-H loop tracer for soft magnetic thick films in the audio frequency range is described. It is a system built on a PXI platform combining PXI modules for control signal generation and data acquisition. The physiscal signals are digitized and the respective data strems are processed, presented and recorded in LabVIEW 7.0.
Audio wiring guide how to wire the most popular audio and video connectors

CERN Document Server

Hechtman, John

2012-01-01

Whether you're a pro or an amateur, a musician or into multimedia, you can't afford to guess about audio wiring. The Audio Wiring Guide is a comprehensive, easy-to-use guide that explains exactly what you need to know. No matter the size of your wiring project or installation, this handy tool provides you with the essential information you need and the techniques to use it. Using The Audio Wiring Guide is like having an expert at your side. By following the clear, step-by-step directions, you can do professional-level work at a fraction of the cost.
Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content

DEFF Research Database (Denmark)

Shepstone, Sven Ewan; Tan, Zheng-Hua; Jensen, Søren Holdt

2013-01-01

Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age...... and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm...... profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies...

Comparative evaluation of audio and audio - tactile methods to improve oral hygiene status of visually impaired school children

OpenAIRE

R Krishnakumar; Swarna Swathi Silla; Sugumaran K Durai; Mohan Govindarajan; Syed Shaheed Ahamed; Logeshwari Mathivanan

2016-01-01

Background: Visually impaired children are unable to maintain good oral hygiene, as their tactile abilities are often underdeveloped owing to their visual disturbances. Conventional brushing techniques are often poorly comprehended by these children and hence, it was decided to evaluate the effectiveness of audio and audio-tactile methods in improving the oral hygiene of these children. Objective: To evaluate and compare the effectiveness of audio and audio-tactile methods in improving oral h...
Why Don’t Physicians Use Their Personal Digital Assistants?

OpenAIRE

Lu, Yen-Chiao; Lee, Jin (Janet) Kyung; Xiao, Yan; Sears, Andrew; Jacko, Julie A.; Charters, Kathleen

2003-01-01

As the Personal Digital Assistant (PDA) user population continues to expand, there is a need to design more useful devices and applications to facilitate the utilization of PDAs. We conducted a structured interview study to examine PDA usage and non-usage patterns among physicians. The purpose of this descriptive study was to identify the barriers that impede physicians in their PDA use. A data collection tool was developed to record: 1) how physicians use their PDAs, 2) functions and applica...
Audio Frequency Analysis in Mobile Phones

Science.gov (United States)

Aguilar, Horacio Munguía

2016-01-01

A new experiment using mobile phones is proposed in which its audio frequency response is analyzed using the audio port for inputting external signal and getting a measurable output. This experiment shows how the limited audio bandwidth used in mobile telephony is the main cause of the poor speech quality in this service. A brief discussion is…
[Intermodal timing cues for audio-visual speech recognition].

Science.gov (United States)

Hashimoto, Masahiro; Kumashiro, Masaharu

2004-06-01

The purpose of this study was to investigate the limitations of lip-reading advantages for Japanese young adults by desynchronizing visual and auditory information in speech. In the experiment, audio-visual speech stimuli were presented under the six test conditions: audio-alone, and audio-visually with either 0, 60, 120, 240 or 480 ms of audio delay. The stimuli were the video recordings of a face of a female Japanese speaking long and short Japanese sentences. The intelligibility of the audio-visual stimuli was measured as a function of audio delays in sixteen untrained young subjects. Speech intelligibility under the audio-delay condition of less than 120 ms was significantly better than that under the audio-alone condition. On the other hand, the delay of 120 ms corresponded to the mean mora duration measured for the audio stimuli. The results implied that audio delays of up to 120 ms would not disrupt lip-reading advantage, because visual and auditory information in speech seemed to be integrated on a syllabic time scale. Potential applications of this research include noisy workplace in which a worker must extract relevant speech from all the other competing noises.
Presence and the utility of audio spatialization

DEFF Research Database (Denmark)

Bormann, Karsten

2005-01-01

The primary concern of this paper is whether the utility of audio spatialization, as opposed to the fidelity of audio spatialization, impacts presence. An experiment is reported that investigates the presence-performance relationship by decoupling spatial audio fidelity (realism) from task...... performance by varying the spatial fidelity of the audio independently of its relevance to performance on the search task that subjects were to perform. This was achieved by having conditions in which subjects searched for a music-playing radio (an active sound source) and having conditions in which...... supplied only nonattenuated audio was detrimental to performance. Even so, this group of subjects consistently had the largest increase in presence scores over the baseline experiment. Further, the Witmer and Singer (1998) presence questionnaire was more sensitive to whether the audio source was active...
Modified BTC Algorithm for Audio Signal Coding

Directory of Open Access Journals (Sweden)

TOMIC, S.

2016-11-01

Full Text Available This paper describes modification of a well-known image coding algorithm, named Block Truncation Coding (BTC and its application in audio signal coding. BTC algorithm was originally designed for black and white image coding. Since black and white images and audio signals have different statistical characteristics, the application of this image coding algorithm to audio signal presents a novelty and a challenge. Several implementation modifications are described in this paper, while the original idea of the algorithm is preserved. The main modifications are performed in the area of signal quantization, by designing more adequate quantizers for audio signal processing. The result is a novel audio coding algorithm, whose performance is presented and analyzed in this research. The performance analysis indicates that this novel algorithm can be successfully applied in audio signal coding.
Knitting Relational Documentary Networks: The Database Meta-Documentary Filming Revolution as a paradigm of bringing interactive audio-visual archives alive

NARCIS (Netherlands)

Wiehl, Anna

2016-01-01

abstractOne phenomenon in the emerging field of digital documentary are experiments with rhizomatic interfaces and database-logics to bring audio-visual archives 'alive'. A paradigm hereof is Filming Revolution (2015), an interactive platform which gathers and interlinks films of the uprisings in
The end of ownership personal property in the digital economy

CERN Document Server

Perzanowski, Aaron

2016-01-01

If you buy a book at the bookstore, you own it. You can take it home, scribble in the margins, put in on the shelf, lend it to a friend, sell it at a garage sale. But is the same thing true for the ebooks or other digital goods you buy? Retailers and copyright holders argue that you don't own those purchases, you merely license them. That means your ebook vendor can delete the book from your device without warning or explanation -- as Amazon deleted Orwell's "1984" from the Kindles of surprised readers several years ago. These readers thought they owned their copies of "1984." Until, it turned out, they didn't. In "The End of Ownership," Aaron Perzanowski and Jason Schultz explore how notions of ownership have shifted in the digital marketplace, and make an argument for the benefits of personal property. Of course, ebooks, cloud storage, streaming, and other digital goods offer users convenience and flexibility. But, Perzanowski and Schultz warn, consumers should be aware of the tradeoffs involving user cons...
Digital Signal Processing for In-Vehicle Systems and Safety

CERN Document Server

Boyraz, Pinar; Takeda, Kazuya; Abut, Hüseyin

2012-01-01

Compiled from papers of the 4th Biennial Workshop on DSP (Digital Signal Processing) for In-Vehicle Systems and Safety this edited collection features world-class experts from diverse fields focusing on integrating smart in-vehicle systems with human factors to enhance safety in automobiles. Digital Signal Processing for In-Vehicle Systems and Safety presents new approaches on how to reduce driver inattention and prevent road accidents. The material addresses DSP technologies in adaptive automobiles, in-vehicle dialogue systems, human machine interfaces, video and audio processing, and in-vehicle speech systems. The volume also features: Recent advances in Smart-Car technology – vehicles that take into account and conform to the driver Driver-vehicle interfaces that take into account the driving task and cognitive load of the driver Best practices for In-Vehicle Corpus Development and distribution Information on multi-sensor analysis and fusion techniques for robust driver monitoring and driver recognition ...
Robust audio-visual speech recognition under noisy audio-video conditions.

Science.gov (United States)

Stewart, Darryl; Seymour, Rowan; Pass, Adrian; Ming, Ji

2014-02-01

This paper presents the maximum weighted stream posterior (MWSP) model as a robust and efficient stream integration method for audio-visual speech recognition in environments, where the audio or video streams may be subjected to unknown and time-varying corruption. A significant advantage of MWSP is that it does not require any specific measurements of the signal in either stream to calculate appropriate stream weights during recognition, and as such it is modality-independent. This also means that MWSP complements and can be used alongside many of the other approaches that have been proposed in the literature for this problem. For evaluation we used the large XM2VTS database for speaker-independent audio-visual speech recognition. The extensive tests include both clean and corrupted utterances with corruption added in either/both the video and audio streams using a variety of types (e.g., MPEG-4 video compression) and levels of noise. The experiments show that this approach gives excellent performance in comparison to another well-known dynamic stream weighting approach and also compared to any fixed-weighted integration approach in both clean conditions or when noise is added to either stream. Furthermore, our experiments show that the MWSP approach dynamically selects suitable integration weights on a frame-by-frame basis according to the level of noise in the streams and also according to the naturally fluctuating relative reliability of the modalities even in clean conditions. The MWSP approach is shown to maintain robust recognition performance in all tested conditions, while requiring no prior knowledge about the type or level of noise.
Audio scene segmentation for video with generic content

Science.gov (United States)

Niu, Feng; Goela, Naveen; Divakaran, Ajay; Abdel-Mottaleb, Mohamed

2008-01-01

In this paper, we present a content-adaptive audio texture based method to segment video into audio scenes. The audio scene is modeled as a semantically consistent chunk of audio data. Our algorithm is based on "semantic audio texture analysis." At first, we train GMM models for basic audio classes such as speech, music, etc. Then we define the semantic audio texture based on those classes. We study and present two types of scene changes, those corresponding to an overall audio texture change and those corresponding to a special "transition marker" used by the content creator, such as a short stretch of music in a sitcom or silence in dramatic content. Unlike prior work using genre specific heuristics, such as some methods presented for detecting commercials, we adaptively find out if such special transition markers are being used and if so, which of the base classes are being used as markers without any prior knowledge about the content. Our experimental results show that our proposed audio scene segmentation works well across a wide variety of broadcast content genres.
Measuring 3D Audio Localization Performance and Speech Quality of Conferencing Calls for a Multiparty Communication System

Directory of Open Access Journals (Sweden)

Mansoor Hyder

2013-07-01

Full Text Available Communication systems which support 3D (Three Dimensional audio offer a couple of advantages to the users/customers. Firstly, within the virtual acoustic environments all participants could easily be recognized through their placement/sitting positions. Secondly, all participants can turn their focus on any particular talker when multiple participants start talking at the same time by taking advantage of the natural listening tendency which is called the Cocktail Party Effect. On the other hand, 3D audio is known as a decreasing factor for overall speech quality because of the commencement of reverberations and echoes within the listening environment. In this article, we study the tradeoff between speech quality and human natural ability of localizing audio events/or talkers within our three dimensional audio supported telephony and teleconferencing solution. Further, we performed subjective user studies by incorporating two different HRTFs (Head Related Transfer Functions, different placements of the teleconferencing participants and different layouts of the virtual environments. Moreover, subjective user studies results for audio event localization and subjective speech quality are presented in this article. This subjective user study would help the research community to optimize the existing 3D audio systems and to design new 3D audio supported teleconferencing solutions based on the quality of experience requirements of the users/customers for agriculture personal in particular and for all potential users in general.
Measuring 3D Audio Localization Performance and Speech Quality of Conferencing Calls for a Multiparty Communication System

International Nuclear Information System (INIS)

Hyder, M.; Menghwar, G.D.; Qureshi, A.

2013-01-01

Communication systems which support 3D (Three Dimensional) audio offer a couple of advantages to the users/customers. Firstly, within the virtual acoustic environments all participants could easily be recognized through their placement/sitting positions. Secondly, all participants can turn their focus on any particular talker when multiple participants start talking at the same time by taking advantage of the natural listening tendency which is called the Cocktail Party Effect. On the other hand, 3D audio is known as a decreasing factor for overall speech quality because of the commencement of reverberations and echoes within the listening environment. In this article, we study the tradeoff between speech quality and human natural ability of localizing audio events/or talkers within our three dimensional audio supported telephony and teleconferencing solution. Further, we performed subjective user studies by incorporating two different HRTFs (Head Related Transfer Functions), different placements of the teleconferencing participants and different layouts of the virtual environments. Moreover, subjective user studies results for audio event localization and subjective speech quality are presented in this article. This subjective user study would help the research community to optimize the existing 3D audio systems and to design new 3D audio supported teleconferencing solutions based on the quality of experience requirements of the users/customers for agriculture personal in particular and for all potential users in general. (author)
Audience Response Made Easy: Using Personal Digital Assistants as a Classroom Polling Tool

Science.gov (United States)

Menon, Anil S.; Moffett, Shannon; Enriquez, Melissa; Martinez, Miriam M.; Dev, Parvati; Grappone, Todd

2004-01-01

Both teachers and students benefit from an interactive classroom. The teacher receives valuable input about effectiveness, student interest, and comprehension, whereas student participation, active learning, and enjoyment of the class are enhanced. Cost and deployment have limited the use of existing audience response systems, allowing anonymous linking of teachers and students in the classroom. These limitations can be circumvented, however, by use of personal digital assistants (PDAs), which are cheaper and widely used by students. In this study, the authors equipped a summer histology class of 12 students with PDAs and wireless Bluetooth cards to allow access to a central server. Teachers displayed questions in multiple-choice format as a Web page on the server and students responded with their PDAs, a process referred to as polling. Responses were immediately compiled, analyzed, and displayed. End-of-class survey results indicated that students were enthusiastic about the polling tool. The surveys also provided technical feedback that will be valuable in streamlining future trials. PMID:14764615
Audio computer-assisted self interview compared to traditional interview in an HIV-related behavioral survey in Vietnam.

Science.gov (United States)

Le, Linh Cu; Vu, Lan T H

2012-10-01

Globally, population surveys on HIV/AIDS and other sensitive topics have been using audio computer-assisted self interview for many years. This interview technique, however, is still new to Vietnam and little is known about its application and impact in general population surveys. One plausible hypothesis is that residents of Vietnam interviewed using this technique may provide a higher response rate and be more willing to reveal their true behaviors than if interviewed with traditional methods. This study aims to compare audio computer-assisted self interview with traditional face-to-face personal interview and self-administered interview with regard to rates of refusal and affirmative responses to questions on sensitive topics related to HIV/AIDS. In June 2010, a randomized study was conducted in three cities (Ha Noi, Da Nan and Can Tho), using a sample of 4049 residents aged 15 to 49 years. Respondents were randomly assigned to one of three interviewing methods: audio computer-assisted self interview, personal face-to-face interview, and self-administered paper interview. Instead of providing answers directly to interviewer questions as with traditional methods, audio computer-assisted self-interview respondents read the questions displayed on a laptop screen, while listening to the questions through audio headphones, then entered responses using a laptop keyboard. A MySQL database was used for data management and SPSS statistical package version 18 used for data analysis with bivariate and multivariate statistical techniques. Rates of high risk behaviors and mean values of continuous variables were compared for the three data collection methods. Audio computer-assisted self interview showed advantages over comparison techniques, achieving lower refusal rates and reporting higher prevalence of some sensitive and risk behaviors (perhaps indication of more truthful answers). Premarital sex was reported by 20.4% in the audio computer-assisted self-interview survey
[Digital learning and teaching in medical education : Already there or still at the beginning?

Science.gov (United States)

Kuhn, Sebastian; Frankenhauser, Susanne; Tolks, Daniel

2018-02-01

The current choice of digital teaching and learning formats in medicine is very heterogeneous. In addition to the widely used classical static formats, social communication tools, audio/video-based media, interactive formats, and electronic testing systems enrich the learning environment.For medical students, the private use of digital media is not necessarily linked to their meaningful use in the study. Many gain their experience of digital learning in the sense of "assessment drives learning", especially by taking online exams in a passive, consuming role. About half of all medical students can be referred to as "e-examinees" whose handling of digital learning is primarily focused on online exam preparation. Essentially, they do not actively influence their digital environment. Only a quarter can be identified as a "digital all-rounder", who compiles their individual learning portfolio from the broad range of digital media.At present, the use of digital media is not yet an integral and comprehensive component of the teaching framework of medical studies in Germany, but is rather used in the sense of a punctual teaching enrichment. Current trends in digital teaching and learning offerings are mobile, interactive, and personalized platforms as well as increasing the relevance of learning platforms. Furthermore, didactical concepts targeting the changed learning habits of the students are more successful regarding the acceptance and learning outcomes. In addition, digitalization is currently gaining importance as a component in the medical school curricula.
Web Audio/Video Streaming Tool

Science.gov (United States)

Guruvadoo, Eranna K.

2003-01-01

In order to promote NASA-wide educational outreach program to educate and inform the public of space exploration, NASA, at Kennedy Space Center, is seeking efficient ways to add more contents to the web by streaming audio/video files. This project proposes a high level overview of a framework for the creation, management, and scheduling of audio/video assets over the web. To support short-term goals, the prototype of a web-based tool is designed and demonstrated to automate the process of streaming audio/video files. The tool provides web-enabled users interfaces to manage video assets, create publishable schedules of video assets for streaming, and schedule the streaming events. These operations are performed on user-defined and system-derived metadata of audio/video assets stored in a relational database while the assets reside on separate repository. The prototype tool is designed using ColdFusion 5.0.
Semantic Context Detection Using Audio Event Fusion

Directory of Open Access Journals (Sweden)

Cheng Wen-Huang

2006-01-01

Full Text Available Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and management. We propose a hierarchical approach that models audio events over a time series in order to accomplish semantic context detection. Two levels of modeling, audio event and semantic context modeling, are devised to bridge the gap between physical audio features and semantic concepts. In this work, hidden Markov models (HMMs are used to model four representative audio events, that is, gunshot, explosion, engine, and car braking, in action movies. At the semantic context level, generative (ergodic hidden Markov model and discriminative (support vector machine (SVM approaches are investigated to fuse the characteristics and correlations among audio events, which provide cues for detecting gunplay and car-chasing scenes. The experimental results demonstrate the effectiveness of the proposed approaches and provide a preliminary framework for information mining by using audio characteristics.
Digital Twins in Health Care: Ethical Implications of an Emerging Engineering Paradigm

Directory of Open Access Journals (Sweden)

Koen Bruynseels

2018-02-01

Full Text Available Personalized medicine uses fine grained information on individual persons, to pinpoint deviations from the normal. ‘Digital Twins’ in engineering provide a conceptual framework to analyze these emerging data-driven health care practices, as well as their conceptual and ethical implications for therapy, preventative care and human enhancement. Digital Twins stand for a specific engineering paradigm, where individual physical artifacts are paired with digital models that dynamically reflects the status of those artifacts. When applied to persons, Digital Twins are an emerging technology that builds on in silico representations of an individual that dynamically reflect molecular status, physiological status and life style over time. We use Digital Twins as the hypothesis that one would be in the possession of very detailed bio-physical and lifestyle information of a person over time. This perspective redefines the concept of ‘normality’ or ‘health,’ as a set of patterns that are regular for a particular individual, against the backdrop of patterns observed in the population. This perspective also will impact what is considered therapy and what is enhancement, as can be illustrated with the cases of the ‘asymptomatic ill’ and life extension via anti-aging medicine. These changes are the consequence of how meaning is derived, in case measurement data is available. Moral distinctions namely may be based on patterns found in these data and the meanings that are grafted on these patterns. Ethical and societal implications of Digital Twins are explored. Digital Twins imply a data-driven approach to health care. This approach has the potential to deliver significant societal benefits, and can function as a social equalizer, by allowing for effective equalizing enhancement interventions. It can as well though be a driver for inequality, given the fact that a Digital Twin might not be an accessible technology for everyone, and given the fact
Digital Twins in Health Care: Ethical Implications of an Emerging Engineering Paradigm

Science.gov (United States)

Bruynseels, Koen; Santoni de Sio, Filippo; van den Hoven, Jeroen

2018-01-01

Personalized medicine uses fine grained information on individual persons, to pinpoint deviations from the normal. ‘Digital Twins’ in engineering provide a conceptual framework to analyze these emerging data-driven health care practices, as well as their conceptual and ethical implications for therapy, preventative care and human enhancement. Digital Twins stand for a specific engineering paradigm, where individual physical artifacts are paired with digital models that dynamically reflects the status of those artifacts. When applied to persons, Digital Twins are an emerging technology that builds on in silico representations of an individual that dynamically reflect molecular status, physiological status and life style over time. We use Digital Twins as the hypothesis that one would be in the possession of very detailed bio-physical and lifestyle information of a person over time. This perspective redefines the concept of ‘normality’ or ‘health,’ as a set of patterns that are regular for a particular individual, against the backdrop of patterns observed in the population. This perspective also will impact what is considered therapy and what is enhancement, as can be illustrated with the cases of the ‘asymptomatic ill’ and life extension via anti-aging medicine. These changes are the consequence of how meaning is derived, in case measurement data is available. Moral distinctions namely may be based on patterns found in these data and the meanings that are grafted on these patterns. Ethical and societal implications of Digital Twins are explored. Digital Twins imply a data-driven approach to health care. This approach has the potential to deliver significant societal benefits, and can function as a social equalizer, by allowing for effective equalizing enhancement interventions. It can as well though be a driver for inequality, given the fact that a Digital Twin might not be an accessible technology for everyone, and given the fact that patterns

Distortion Estimation in Compressed Music Using Only Audio Fingerprints

NARCIS (Netherlands)

Doets, P.J.O.; Lagendijk, R.L.

2008-01-01

An audio fingerprint is a compact yet very robust representation of the perceptually relevant parts of an audio signal. It can be used for content-based audio identification, even when the audio is severely distorted. Audio compression changes the fingerprint slightly. We show that these small
Image-based surveillance and security systems using personal computers for device aiming and digital image comparison

International Nuclear Information System (INIS)

Quiett, S.; Axtell, L.H.

1987-01-01

A detection-type security system using enhanced capability cameras or other imaging devices can aid in maintaining security from long distance and/or for large areas. To do so requires that the imaging device(s) be repeatedly and accurately positioned so that no areas are overlooked. Digital control using personal computers is the simplest method of achieving positional accuracy. The monitoring of large areas and/or a large number of areas also requires that a substantial quantity of visual information be catalogued and evaluated for potential security problems. While security personnel alone are typically used for such monitoring, as the quantity of visual information increases, the likelihood that potential security threats will be missed also increases. The ability of an image-based security system to detect potential security problems can be further increased with the use of selected image processing techniques. Utilizing personal computers for both imaging device position control as well as image processing, surveillance of large areas can be performed by a limited number of individuals with a high level of system confidence
Illustration of decimation in digital signal processing (DSP) systems ...

African Journals Online (AJOL)

... and engineering, especially in the areas of communication and medicine. ... This multirate DSP had been found useful in application like digital audio, video and even GSM technology. The work is implemented using MATLABTM software.
Factors enhancing learning possibilities in digital workshops

Directory of Open Access Journals (Sweden)

Christian Kobbernagel

2014-05-01

Full Text Available This article presents a study of processes supporting student learning possibilities in digital workshops planned and held at art museums in Denmark. The investigation aims to provide insights into factors enhancing learning possibilities, including the educator’s dialogic performance, experiences of art, and perceived qualities of digital content creation processes in art museum education workshops. To address the research question of what conditional and processual factors can be said to support learning possibilities, a model was developed on the basis of fieldwork and theories of media education, art pedagogy and motivation. The model was then analyzed using structural equation modelling (SEM on data collected (N= 502 after workshops in two museums. The results suggest that the dialogic performance of museum educators, a positive art experience and positive perceptions of working with digital media are factors that strongly support student participation and reflection – although to various degrees. The findings also show that, in cases in which students are disinterested and see little value in participating during the workshop, this amotivation is likely to be lower when their art experiences and their perceptions of the media production process are positive.
Factors enhancing learning possibilities in digital workshops

Directory of Open Access Journals (Sweden)

Christian Kobbernagel

2014-06-01

Full Text Available This article presents a study of processes supporting student learning possibilities in digital workshops planned and held at art museums in Denmark. The investigation aims to provide insights into factors enhancing learning possibilities, including the educator’s dialogic performance, experiences of art, and perceived qualities of digital content creation processes in art museum education workshops. To address the research question of what conditional and processual factors can be said to support learning possibilities, a model was developed on the basis of fieldwork and theories of media education, art pedagogy and motivation. The model was then analyzed using structural equation modelling (SEM on data collected (N= 502 after workshops in two museums. The results suggest that the dialogic performance of museum educators, a positive art experience and positive perceptions of working with digital media are factors that strongly support student participation and reflection – although to various degrees. The findings also show that, in cases in which students are disinterested and see little value in participating during the workshop, this amotivation is likely to be lower when their art experiences and their perceptions of the media production process are positive.
“Wrapping” X3DOM around Web Audio API

Directory of Open Access Journals (Sweden)

Andreas Stamoulias

2015-12-01

Full Text Available Spatial sound has a conceptual role in the Web3D environments, due to highly realism scenes that can provide. Lately the efforts are concentrated on the extension of the X3D/ X3DOM through spatial sound attributes. This paper presents a novel method for the introduction of spatial sound components in the X3DOM framework, based on X3D specification and Web Audio API. The proposed method incorporates the introduction of enhanced sound nodes for X3DOM which are derived by the implementation of the X3D standard components, enriched with accessional features of Web Audio API. Moreover, several examples-scenarios developed for the evaluation of our approach. The implemented examples established the achievability of new registered nodes in X3DOM, for spatial sound characteristics in Web3D virtual worlds.
Doing What We Teach: Promoting Digital Literacies for Professional Development through Personal Learning Environments and Participation

Science.gov (United States)

Laakkonen, Ilona

2015-01-01

Despite the proliferation of social media, few learners make effective use of digital technology to support their learning or graduate with the skills necessary for developing and communicating their expertise in the knowledge-driven networked society of the digital age. This article makes use of the concept of Personal Learning Environments (PLE)…
Elicitation of attributes for the evaluation of audio-on audio-interference

DEFF Research Database (Denmark)

Francombe, Jon; Mason, R.; Dewhirst, M.

2014-01-01

procedure was used to reduce these phrases into a comprehensive set of attributes. Groups of experienced and inexperienced listeners determined nine and eight attributes, respectively. These attribute sets were combined by the listeners to produce a final set of 12 attributes: masking, calming, distraction......An experiment to determine the perceptual attributes of the experience of listening to a target audio program in the presence of an audio interferer was performed. The first stage was a free elicitation task in which a total of 572 phrases were produced. In the second stage, a consensus vocabulary...
Acceptance of online audio-visual cultural heritage archive services: a study of the general public

NARCIS (Netherlands)

Ongena, G.; van de Wijngaert, Lidwien; Huizer, E.

2013-01-01

Introduction. This study examines the antecedents of user acceptance of an audio-visual heritage archive for a wider audience (i.e., the general public) by extending the technology acceptance model with the concepts of perceived enjoyment, nostalgia proneness and personal innovativeness. Method. A
Speech enhancement on smartphone voice recording

International Nuclear Information System (INIS)

Atmaja, Bagus Tris; Farid, Mifta Nur; Arifianto, Dhany

2016-01-01

Speech enhancement is challenging task in audio signal processing to enhance the quality of targeted speech signal while suppress other noises. In the beginning, the speech enhancement algorithm growth rapidly from spectral subtraction, Wiener filtering, spectral amplitude MMSE estimator to Non-negative Matrix Factorization (NMF). Smartphone as revolutionary device now is being used in all aspect of life including journalism; personally and professionally. Although many smartphones have two microphones (main and rear) the only main microphone is widely used for voice recording. This is why the NMF algorithm widely used for this purpose of speech enhancement. This paper evaluate speech enhancement on smartphone voice recording by using some algorithms mentioned previously. We also extend the NMF algorithm to Kulback-Leibler NMF with supervised separation. The last algorithm shows improved result compared to others by spectrogram and PESQ score evaluation. (paper)
CERN automatic audio-conference service

CERN Multimedia

Sierra Moral, R

2009-01-01

Scientists from all over the world need to collaborate with CERN on a daily basis. They must be able to communicate effectively on their joint projects at any time; as a result telephone conferences have become indispensable and widely used. Managed by 6 operators, CERN already has more than 20000 hours and 5700 audio-conferences per year. However, the traditional telephone based audio-conference system needed to be modernized in three ways. Firstly, to provide the participants with more autonomy in the organization of their conferences; secondly, to eliminate the constraints of manual intervention by operators; and thirdly, to integrate the audio-conferences into a collaborative working framework. The large number, and hence cost, of the conferences prohibited externalization and so the CERN telecommunications team drew up a specification to implement a new system. It was decided to use a new commercial collaborative audio-conference solution based on the SIP protocol. The system was tested as the first Euro...
CERN automatic audio-conference service

CERN Document Server

Sierra Moral, R

2010-01-01

Scientists from all over the world need to collaborate with CERN on a daily basis. They must be able to communicate effectively on their joint projects at any time; as a result telephone conferences have become indispensable and widely used. Managed by 6 operators, CERN already has more than 20000 hours and 5700 audio-conferences per year. However, the traditional telephone based audio-conference system needed to be modernized in three ways. Firstly, to provide the participants with more autonomy in the organization of their conferences; secondly, to eliminate the constraints of manual intervention by operators; and thirdly, to integrate the audio-conferences into a collaborative working framework. The large number, and hence cost, of the conferences prohibited externalization and so the CERN telecommunications team drew up a specification to implement a new system. It was decided to use a new commercial collaborative audio-conference solution based on the SIP protocol. The system was tested as the first Euro...
Debugging of Class-D Audio Power Amplifiers

DEFF Research Database (Denmark)

Crone, Lasse; Pedersen, Jeppe Arnsdorf; Mønster, Jakob Døllner

2012-01-01

Determining and optimizing the performance of a Class-D audio power amplier can be very dicult without knowledge of the use of audio performance measuring equipment and of how the various noise and distortion sources in uence the audio performance. This paper gives an introduction on how to measure...
Design of an audio advertisement dataset

Science.gov (United States)

Fu, Yutao; Liu, Jihong; Zhang, Qi; Geng, Yuting

2015-12-01

Since more and more advertisements swarm into radios, it is necessary to establish an audio advertising dataset which could be used to analyze and classify the advertisement. A method of how to establish a complete audio advertising dataset is presented in this paper. The dataset is divided into four different kinds of advertisements. Each advertisement's sample is given in *.wav file format, and annotated with a txt file which contains its file name, sampling frequency, channel number, broadcasting time and its class. The classifying rationality of the advertisements in this dataset is proved by clustering the different advertisements based on Principal Component Analysis (PCA). The experimental results show that this audio advertisement dataset offers a reliable set of samples for correlative audio advertisement experimental studies.
Efficient Audio Power Amplification - Challenges

DEFF Research Database (Denmark)

Andersen, Michael Andreas E.

2005-01-01

For more than a decade efficient audio power amplification has evolved and today switch-mode audio power amplification in various forms are the state-of-the-art. The technical steps that lead to this evolution are described and in addition many of the challenges still to be faced and where...
Processing Digital Imagery to Enhance Perceptions of Realism

Science.gov (United States)

Woodell, Glenn A.; Jobson, Daniel J.; Rahman, Zia-ur

2003-01-01

Multi-scale retinex with color restoration (MSRCR) is a method of processing digital image data based on Edwin Land s retinex (retina + cortex) theory of human color vision. An outgrowth of basic scientific research and its application to NASA s remote-sensing mission, MSRCR is embodied in a general-purpose algorithm that greatly improves the perception of visual realism and the quantity and quality of perceived information in a digitized image. In addition, the MSRCR algorithm includes provisions for automatic corrections to accelerate and facilitate what could otherwise be a tedious image-editing process. The MSRCR algorithm has been, and is expected to continue to be, the basis for development of commercial image-enhancement software designed to extend and refine its capabilities for diverse applications.
An Introduction to Digital Rights Management Systems

NARCIS (Netherlands)

Jonker, Willem

2007-01-01

This chapter gives a concise introduction to digital rights management (DRM) systems by first presenting the basic ingredients of the architecture of DRM systems for (audio and/or video) content delivery, followed by an introduction to two open-standard DRM systems, one developed in the mobile world
The role of automated speech and audio analysis in semantic multimedia annotation

NARCIS (Netherlands)

de Jong, Franciska M.G.; Ordelman, Roeland J.F.; van Hessen, Adrianus J.

This paper overviews the various ways in which automatic speech and audio analysis can be deployed to enhance the semantic annotation of multimedia content, and as a consequence to improve the effectiveness of conceptual access tools. A number of techniques will be presented, including the alignment
Consequence of audio visual collection in school libraries

OpenAIRE

Kuri, Ramesh

2016-01-01

The collection of Audio-Visual in library plays important role in teaching and learning. The importance of audio visual (AV) technology in education should not be underestimated. If audio-visual collection in library is carefully planned and designed, it can provide a rich learning environment. In this article, an author discussed the consequences of Audio-Visual collection in libraries especially for students of school library
Digitally enhanced thin layer chromatography: further development and some applications in isotopic chemistry.

Science.gov (United States)

Manthorpe, Daniel P; Lockley, William J S

2013-09-01

Improvements to thin layer chromatography (TLC) analysis can be made easily and cheaply by the application of digital colour photography and image analysis. The combined technique, digitally enhanced TLC (DE-TLC), is applicable to the accurate quantification of analytes in mixtures, to reaction monitoring and to other typical uses of TLC. Examples are given of the application of digitally enhanced TLC to: the deuteromethylations of theophylline to [methyl-(2)H3]caffeine and of umbelliferone to [(2)H3]7-methoxycoumarin; the selection of tertiary amine bases in deuterodechlorination reactions; stoichiometry optimisation in the borodeuteride reduction of quinizarin (1,4-dihydroxyanthraquinone) and to the assessment of xanthophyll yields in Lepidium sativum seedlings grown in deuterated media. Copyright © 2013 John Wiley & Sons, Ltd.

Implementing digital technology to enhance student learning of pathology.

Science.gov (United States)

Farah, C S; Maybury, T

2009-08-01

The introduction of digital technologies into the dental curriculum is an ongoing feature of broader changes going on in tertiary education. This report examines the introduction of digital virtual microscopy technology into the curriculum of the School of Dentistry at the University of Queensland (UQ) in Brisbane, Australia. Sixty students studying a course in pathology in 2005 were introduced to virtual microscopy technology alongside the more traditional light microscope and then asked to evaluate their own learning outcomes from this technology via a structured 5-point LIKART survey. A wide variety of questions dealing the pedagogic implications of the introduction of virtual microscopy into pathology were asked of students with the overall result being that it positively enhanced their learning of pathology via digital microscopic means. The success of virtual microscopy in dentistry at UQ is then discussed in the larger context of changes going on in tertiary education. In particular, the change from the print-literate tradition to the electronic one, that is from 'literacy to electracy'. Virtual microscopy is designated as a component of this transformation to electracy. Whilst traditional microscopic skills may still be valued in dental curricula, the move to virtual microscopy and computer-assisted, student-centred learning of pathology appears to enhance the learning experience in relation to its effectiveness in helping students engage and interact with the course material.
New audio applications of beryllium metal

International Nuclear Information System (INIS)

Sato, M.

1977-01-01

The major applications of beryllium metal in the field of audio appliances are for the vibrating cones for the two types of speakers 'TWITTER' for high range sound and 'SQUAWKER' for mid range sound, and also for beryllium cantilever tube assembled in stereo cartridge. These new applications are based on the characteristic property of beryllium having high ratio of modulus of elasticity to specific gravity. The production of these audio parts is described, and the audio response is shown. (author)
Digital citizens Digital nations: the next agenda

NARCIS (Netherlands)

A.W. (Bert) Mulder; M.W. (Martijn) Hartog

2015-01-01

DIGITAL CITIZENS CREATE A DIGITAL NATION Citizens will play the lead role as they – in the next phase of the information society – collectively create a digital nation. Personal adoption of information and communication technology will create a digital infrastructure that supports individual and
Efficient audio power amplification - challenges

Energy Technology Data Exchange (ETDEWEB)

Andersen, Michael A.E.

2005-07-01

For more than a decade efficient audio power amplification has evolved and today switch-mode audio power amplification in various forms are the state-of-the-art. The technical steps that lead to this evolution are described and in addition many of the challenges still to be faced and where extensive research and development are needed is covered. (au)
Classical Music, liveness and digital technologies

DEFF Research Database (Denmark)

Steijn, Arthur

2014-01-01

. This article uses the suggestion of Philip Auslander to rethink the relationship between the mediatized and live format in order to use digital technologies to enrich and develop the live performance as a starting position. On the background of an ongoing EU funded interregional project, a series...... of interrelated design experiments are presented which all share the ambition of integration digital technologies in life performances of classical music. A particular focus is put on the ongoing development of a design concept where interactive audio and visual experiences in an underground metro station shall...
Performance Analysis of Digital loudspeaker Arrays

DEFF Research Database (Denmark)

Pedersen, Bo Rohde; Kontomichos, Fotios; Mourjopoulos, John

2008-01-01

An analysis of digital loudspeaker arrays shows that the ways in which bits are mapped to the drivers influence the quality of the audio result. Specifically, a "bit-summed" rather than the traditional "bit-mapped" strategy greatly reduces the number of times drivers make binary transitions per...... period of the input frequency. Detailed simulations compare the results for a 32-loudspeaker array with a similar configuration with analog excitation of the drivers. Ideally, drivers in digital arrays should be very small and span a small area, but that sets limits on the low-frequency response...
Automated Categorization Scheme for Digital Libraries in Distance Learning: A Pattern Recognition Approach

Science.gov (United States)

Gunal, Serkan

2008-01-01

Digital libraries play a crucial role in distance learning. Nowadays, they are one of the fundamental information sources for the students enrolled in this learning system. These libraries contain huge amount of instructional data (text, audio and video) offered by the distance learning program. Organization of the digital libraries is…
Primary School Pupils' Response to Audio-Visual Learning Process in Port-Harcourt

Science.gov (United States)

Olube, Friday K.

2015-01-01

The purpose of this study is to examine primary school children's response on the use of audio-visual learning processes--a case study of Chokhmah International Academy, Port-Harcourt (owned by Salvation Ministries). It looked at the elements that enhance pupils' response to educational television programmes and their hindrances to these…
AudioMUD: a multiuser virtual environment for blind people.

Science.gov (United States)

Sánchez, Jaime; Hassler, Tiago

2007-03-01

A number of virtual environments have been developed during the last years. Among them there are some applications for blind people based on different type of audio, from simple sounds to 3-D audio. In this study, we pursued a different approach. We designed AudioMUD by using spoken text to describe the environment, navigation, and interaction. We have also introduced some collaborative features into the interaction between blind users. The core of a multiuser MUD game is a networked textual virtual environment. We developed AudioMUD by adding some collaborative features to the basic idea of a MUD and placed a simulated virtual environment inside the human body. This paper presents the design and usability evaluation of AudioMUD. Blind learners were motivated when interacted with AudioMUD and helped to improve the interaction through audio and interface design elements.
Digital Tools to Enhance Clinical Reasoning.

Science.gov (United States)

Manesh, Reza; Dhaliwal, Gurpreet

2018-05-01

Physicians can improve their diagnostic acumen by adopting a simulation-based approach to analyzing published cases. The tight coupling of clinical problems and their solutions affords physicians the opportunity to efficiently upgrade their illness scripts (structured knowledge of a specific disease) and schemas (structured frameworks for common problems). The more times clinicians practice accessing and applying those knowledge structures through published cases, the greater the odds that they will have an enhanced approach to similar patient-cases in the future. This article highlights digital resources that increase the number of cases a clinician experiences and learns from. Copyright © 2017 Elsevier Inc. All rights reserved.
The role of contrast-enhanced digital subtraction MRI in the diagnosis of vertebral metastasic tumors

International Nuclear Information System (INIS)

Xiao Yeyu; Yang Jun; Qi Weili; Liu Qize; Hong Bikai; Wu Renhua

2008-01-01

Objective: To evaluate the contrast-enhanced digital subtraction MRI in the diagnosis of vertebral metastasic tumors. Methods 66 vertebral metastasic tumors in 43 patients were examined with conventional MRI (T 1 WI, STIR and Contrast-enhanced T 1 WI) and contrast-enhanced digital subtraction MR imaging. All lesions were histologically proved. The quantity and characteristic imaging signs (including spiculation, bull eye sign and irregular edge) of lesions were detected separately by different sequences. K independent samples test was used. Results: The detection rates of 35 vertebral metastasic tumors with vertebral morphological changes were same in all MR sequences. But in the other 31 lesions without vertebral morphological changes, the detection rates were different and STIR was the highest in all sequences. Contrast-enhanced digital subtraction MRI was more sensitive than all the conventional MR sequences in finding characteristic imaging signs with statistically significant differences. Conclusion: Contrast enhanced subtraction MRI is an useful and convenient technique which has great value in finding vertebral metastasic tumors and depicting the characteristic imaging signs. (authors)
Audio Recording of Children with Dyslalia

OpenAIRE

Stefan Gheorghe Pentiuc; Maria D. Schipor; Ovidiu A. Schipor

2008-01-01

In this paper we present our researches regarding automat parsing of audio recordings. These recordings are obtained from children with dyslalia and are necessary for an accurate identification of speech problems. We develop a software application that helps parsing audio, real time, recordings.
Parametric time-frequency domain spatial audio

CERN Document Server

Delikaris-Manias, Symeon; Politis, Archontis

2018-01-01

This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming--covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed...
Predicting the Overall Spatial Quality of Automotive Audio Systems

Science.gov (United States)

Koya, Daisuke

The spatial quality of automotive audio systems is often compromised due to their unideal listening environments. Automotive audio systems need to be developed quickly due to industry demands. A suitable perceptual model could evaluate the spatial quality of automotive audio systems with similar reliability to formal listening tests but take less time. Such a model is developed in this research project by adapting an existing model of spatial quality for automotive audio use. The requirements for the adaptation were investigated in a literature review. A perceptual model called QESTRAL was reviewed, which predicts the overall spatial quality of domestic multichannel audio systems. It was determined that automotive audio systems are likely to be impaired in terms of the spatial attributes that were not considered in developing the QESTRAL model, but metrics are available that might predict these attributes. To establish whether the QESTRAL model in its current form can accurately predict the overall spatial quality of automotive audio systems, MUSHRA listening tests using headphone auralisation with head tracking were conducted to collect results to be compared against predictions by the model. Based on guideline criteria, the model in its current form could not accurately predict the overall spatial quality of automotive audio systems. To improve prediction performance, the QESTRAL model was recalibrated and modified using existing metrics of the model, those that were proposed from the literature review, and newly developed metrics. The most important metrics for predicting the overall spatial quality of automotive audio systems included those that were interaural cross-correlation (IACC) based, relate to localisation of the frontal audio scene, and account for the perceived scene width in front of the listener. Modifying the model for automotive audio systems did not invalidate its use for domestic audio systems. The resulting model predicts the overall spatial
Editing Audio with Audacity

Directory of Open Access Journals (Sweden)

Brandon Walsh

2016-08-01

Full Text Available For those interested in audio, basic sound editing skills go a long way. Being able to handle and manipulate the materials can help you take control of your object of study: you can zoom in and extract particular moments to analyze, process the audio, and upload the materials to a server to compliment a blog post on the topic. On a more practical level, these skills could also allow you to record and package recordings of yourself or others for distribution. That guest lecture taking place in your department? Record it and edit it yourself! Doing so is a lightweight way to distribute resources among various institutions, and it also helps make the materials more accessible for readers and listeners with a wide variety of learning needs. In this lesson you will learn how to use Audacity to load, record, edit, mix, and export audio files. Sound editing platforms are often expensive and offer extensive capabilities that can be overwhelming to the first-time user, but Audacity is a free and open source alternative that offers powerful capabilities for sound editing with a low barrier for entry. For this lesson we will work with two audio files: a recording of Bach’s Goldberg Variations available from MusOpen and another recording of your own voice that will be made in the course of the lesson. This tutorial uses Audacity 2.1.2, released January 2016.
Fusion for Audio-Visual Laughter Detection

NARCIS (Netherlands)

Reuderink, B.

2007-01-01

Laughter is a highly variable signal, and can express a spectrum of emotions. This makes the automatic detection of laughter a challenging but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio-visual laughter detection is performed
AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis

OpenAIRE

Sager, Sebastian; Elizalde, Benjamin; Borth, Damian; Schulze, Christian; Raj, Bhiksha; Lane, Ian

2016-01-01

Recently, sound recognition has been used to identify sounds, such as car and river. However, sounds have nuances that may be better described by adjective-noun pairs such as slow car, and verb-noun pairs such as flying insects, which are under explored. Therefore, in this work we investigate the relation between audio content and both adjective-noun pairs and verb-noun pairs. Due to the lack of datasets with these kinds of annotations, we collected and processed the AudioPairBank corpus cons...
Tourism research and audio methods

DEFF Research Database (Denmark)

Jensen, Martin Trandberg

2016-01-01

• Audio methods enriches sensuous tourism ethnographies. • The note suggests five research avenues for future auditory scholarship. • Sensuous tourism research has neglected the role of sounds in embodied tourism experiences.......• Audio methods enriches sensuous tourism ethnographies. • The note suggests five research avenues for future auditory scholarship. • Sensuous tourism research has neglected the role of sounds in embodied tourism experiences....
Audio-visual synchrony and feature-selective attention co-amplify early visual processing.

Science.gov (United States)

Keitel, Christian; Müller, Matthias M

2016-05-01

Our brain relies on neural mechanisms of selective attention and converging sensory processing to efficiently cope with rich and unceasing multisensory inputs. One prominent assumption holds that audio-visual synchrony can act as a strong attractor for spatial attention. Here, we tested for a similar effect of audio-visual synchrony on feature-selective attention. We presented two superimposed Gabor patches that differed in colour and orientation. On each trial, participants were cued to selectively attend to one of the two patches. Over time, spatial frequencies of both patches varied sinusoidally at distinct rates (3.14 and 3.63 Hz), giving rise to pulse-like percepts. A simultaneously presented pure tone carried a frequency modulation at the pulse rate of one of the two visual stimuli to introduce audio-visual synchrony. Pulsed stimulation elicited distinct time-locked oscillatory electrophysiological brain responses. These steady-state responses were quantified in the spectral domain to examine individual stimulus processing under conditions of synchronous versus asynchronous tone presentation and when respective stimuli were attended versus unattended. We found that both, attending to the colour of a stimulus and its synchrony with the tone, enhanced its processing. Moreover, both gain effects combined linearly for attended in-sync stimuli. Our results suggest that audio-visual synchrony can attract attention to specific stimulus features when stimuli overlap in space.
Newnes audio and Hi-Fi engineer's pocket book

CERN Document Server

Capel, Vivian

2013-01-01

Newnes Audio and Hi-Fi Engineer's Pocket Book, Second Edition provides concise discussion of several audio topics. The book is comprised of 10 chapters that cover different audio equipment. The coverage of the text includes microphones, gramophones, compact discs, and tape recorders. The book also covers high-quality radio, amplifiers, and loudspeakers. The book then reviews the concepts of sound and acoustics, and presents some facts and formulas relevant to audio. The text will be useful to sound engineers and other professionals whose work involves sound systems.

Speech watermarking: an approach for the forensic analysis of digital telephonic recordings.

Science.gov (United States)

Faundez-Zanuy, Marcos; Lucena-Molina, Jose J; Hagmüller, Martin

2010-07-01

In this article, the authors discuss the problem of forensic authentication of digital audio recordings. Although forensic audio has been addressed in several articles, the existing approaches are focused on analog magnetic recordings, which are less prevalent because of the large amount of digital recorders available on the market (optical, solid state, hard disks, etc.). An approach based on digital signal processing that consists of spread spectrum techniques for speech watermarking is presented. This approach presents the advantage that the authentication is based on the signal itself rather than the recording format. Thus, it is valid for usual recording devices in police-controlled telephone intercepts. In addition, our proposal allows for the introduction of relevant information such as the recording date and time and all the relevant data (this is not always possible with classical systems). Our experimental results reveal that the speech watermarking procedure does not interfere in a significant way with the posterior forensic speaker identification.
47 CFR 10.520 - Common audio attention signal.

Science.gov (United States)

2010-10-01

... 47 Telecommunication 1 2010-10-01 2010-10-01 false Common audio attention signal. 10.520 Section... Equipment Requirements § 10.520 Common audio attention signal. A Participating CMS Provider and equipment manufacturers may only market devices for public use under part 10 that include an audio attention signal that...
Audio Recording of Children with Dyslalia

Directory of Open Access Journals (Sweden)

Stefan Gheorghe Pentiuc

2008-01-01

Full Text Available In this paper we present our researches regarding automat parsing of audio recordings. These recordings are obtained from children with dyslalia and are necessary for an accurate identification of speech problems. We develop a software application that helps parsing audio, real time, recordings.
Audio Journal in an ELT Context

Directory of Open Access Journals (Sweden)

Neşe Aysin Siyli

2012-09-01

Full Text Available It is widely acknowledged that one of the most serious problems students of English as a foreign language face is their deprivation of practicing the language outside the classroom. Generally, the classroom is the sole environment where they can practice English, which by its nature does not provide rich setting to help students develop their competence by putting the language into practice. Motivated by this need, this descriptive study investigated the impact of audio dialog journals on students’ speaking skills. It also aimed to gain insights into students’ and teacher’s opinions on keeping audio dialog journals outside the class. The data of the study developed from student and teacher audio dialog journals, student written feedbacks, interviews held with the students, and teacher observations. The descriptive analysis of the data revealed that audio dialog journals served a number of functions ranging from cognitive to linguistic, from pedagogical to psychological, and social. The findings and pedagogical implications of the study are discussed in detail.
Virtual Microphones for Multichannel Audio Resynthesis

Directory of Open Access Journals (Sweden)

Athanasios Mouchtaris

2003-09-01

Full Text Available Multichannel audio offers significant advantages for music reproduction, including the ability to provide better localization and envelopment, as well as reduced imaging distortion. On the other hand, multichannel audio is a demanding media type in terms of transmission requirements. Often, bandwidth limitations prohibit transmission of multiple audio channels. In such cases, an alternative is to transmit only one or two reference channels and recreate the rest of the channels at the receiving end. Here, we propose a system capable of synthesizing the required signals from a smaller set of signals recorded in a particular venue. These synthesized Ã‚Â“virtualÃ‚Â” microphone signals can be used to produce multichannel recordings that accurately capture the acoustics of that venue. Applications of the proposed system include transmission of multichannel audio over the current Internet infrastructure and, as an extension of the methods proposed here, remastering existing monophonic and stereophonic recordings for multichannel rendering.
Unrealistic Optimism: Self-Enhancement or Person Positivity?

Science.gov (United States)

Regan, Pamela C.; And Others

1995-01-01

Two studies examined whether people are unrealistically optimistic only for their own futures or for the future of any individual. Results suggested that unrealistic optimism is a form of self-enhancement rather than person positivity bias. (JBJ)
Music Genre Classification Using MIDI and Audio Features

Science.gov (United States)

Cataltepe, Zehra; Yaslan, Yusuf; Sonmez, Abdullah

2007-12-01

We report our findings on using MIDI files and audio features from MIDI, separately and combined together, for MIDI music genre classification. We use McKay and Fujinaga's 3-root and 9-leaf genre data set. In order to compute distances between MIDI pieces, we use normalized compression distance (NCD). NCD uses the compressed length of a string as an approximation to its Kolmogorov complexity and has previously been used for music genre and composer clustering. We convert the MIDI pieces to audio and then use the audio features to train different classifiers. MIDI and audio from MIDI classifiers alone achieve much smaller accuracies than those reported by McKay and Fujinaga who used not NCD but a number of domain-based MIDI features for their classification. Combining MIDI and audio from MIDI classifiers improves accuracy and gets closer to, but still worse, accuracies than McKay and Fujinaga's. The best root genre accuracies achieved using MIDI, audio, and combination of them are 0.75, 0.86, and 0.93, respectively, compared to 0.98 of McKay and Fujinaga. Successful classifier combination requires diversity of the base classifiers. We achieve diversity through using certain number of seconds of the MIDI file, different sample rates and sizes for the audio file, and different classification algorithms.
Pitch Fork: A Novel tactile Digital Musical Instrument

OpenAIRE

Williams, Peter; Overholt, Daniel

2017-01-01

Pitch Fork is a prototype of an alternate, actuated digital musical instrument (DMI). It uses 5 infra-red and 4 piezoelectric sensors to control an additive synthesis engine. Iron bars are used as the physical point of contact in interaction with the aim of using material computation to control aspects of the digitally produced sound. This choice of material was also chosen to affect player experience. Sensor readings are relayed to a Macbook via an Arduino Mega. Mappings and audio output sig...
Clinical evaluation of contrast-enhanced digital mammography and contrast enhanced tomosynthesis--Comparison to contrast-enhanced breast MRI.

Science.gov (United States)

Chou, Chen-Pin; Lewin, John M; Chiang, Chia-Ling; Hung, Bao-Hui; Yang, Tsung-Lung; Huang, Jer-Shyung; Liao, Jia-Bin; Pan, Huay-Ben

2015-12-01

To compare the diagnostic accuracy of contrast-enhanced digital mammography (CEDM) and contrast-enhanced tomosynthesis (CET) to dynamic contrast enhanced breast MRI (DCE-MRI) using a multireader-multicase study. Institutional review board approval and informed consents were obtained. Total 185 patients (mean age 51.3) with BI-RADS 4 or 5 lesions were evaluated before biopsy with mammography, tomosynthesis, CEDM, CET and DCE-MRI. Mediolateral-oblique and cranio-caudal views of the target breast CEDM and CET were acquired at 2 and 4 min after contrast agent injection. A mediolateral-oblique view of the non-target breast was taken at 6 min. Each lesion was scored with forced BI-RADS categories by three readers. Each reader interpreted lesions in the following order: mammography, tomosynthesis, CEDM, CET, and DCE-MRI during a single reading session. Histology showed 81 cancers and 144 benign lesions in the study. Of the 81 malignant lesions, 44% (36/81) were invasive and 56% (45/81) were non-invasive. Areas under the ROC curve, averaged for the 3 readers, were as follows: 0.897 for DCE-MRI, 0.892 for CET, 0.878 for CEDM, 0.784 for tomosynthesis and 0.740 for mammography. Significant differences in AUC were found between the group of contrast enhanced modalities (CEDM, CET, DCE-MRI) and the unenhanced modalities (all p0.05). CET and CEDM may be considered as an alternative modality to MRI for following up women with abnormal mammography. All three contrast modalities were superior in accuracy to conventional digital mammography with or without tomosynthesis. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
PERSONALITY DOES NOT INFLUENCE EXERCISE-INDUCED MOOD ENHANCEMENT AMONG FEMALE EXERCISERS

Directory of Open Access Journals (Sweden)

Andrew M. Lane

2005-09-01

Full Text Available The present study investigated the influence of personality on exercise-induced mood changes. It was hypothesised that (a exercise would be associated with significant mood enhancement across all personality types, (b extroversion would be associated with positive mood and neuroticism with negative mood both pre- and post-exercise, and (c personality measures would interact with exercise-induced mood changes. Participants were 90 female exercisers (M = 25.8 yr, SD = 9.0 yr who completed the Eysenck Personality Inventory (EPI once and the Brunel Mood Scale (BRUMS before and after a 60-minute exercise session. Median splits were used to group participants into four personality types: stable introverts (n = 25, stable extroverts (n = 20, neurotic introverts (n = 26, and neurotic extroverts (n = 19. Repeated measures MANOVA showed significant mood enhancement following exercise across all personality types. Neuroticism was associated with negative mood scores pre- and post-exercise but the effect of extroversion on reported mood was relatively weak. There was no significant interaction effect between exercise-induced mood enhancement and personality. In conclusion, findings lend support to the notion that exercise is associated with improved mood. However, findings show that personality did not influence this effect, although neuroticism was associated with negative mood
Audio-Visual Classification of Sports Types

DEFF Research Database (Denmark)

Gade, Rikke; Abou-Zleikha, Mohamed; Christensen, Mads Græsbøll

2015-01-01

In this work we propose a method for classification of sports types from combined audio and visual features ex- tracted from thermal video. From audio Mel Frequency Cepstral Coefficients (MFCC) are extracted, and PCA are applied to reduce the feature space to 10 dimensions. From the visual modali...
Detection and Correction of Under-/Overexposed Optical Soundtracks by Coupling Image and Audio Signal Processing

Directory of Open Access Journals (Sweden)

Etienne Decenciere

2008-10-01

Full Text Available Film restoration using image processing, has been an active research field during the last years. However, the restoration of the soundtrack has been mainly performed in the sound domain, using signal processing methods, despite the fact that it is recorded as a continuous image between the images of the film and the perforations. While the very few published approaches focus on removing dust particles or concealing larger corrupted areas, no published works are devoted to the restoration of soundtracks degraded by substantial underexposure or overexposure. Digital restoration of optical soundtracks is an unexploited application field and, besides, scientifically rich, because it allows mixing both image and signal processing approaches. After introducing the principles of optical soundtrack recording and playback, this contribution focuses on our first approaches to detect and cancel the effects of under and overexposure. We intentionally choose to get a quantification of the effect of bad exposure in the 1D audio signal domain instead of 2D image domain. Our measurement is sent as feedback value to an image processing stage where the correction takes place, building up a Ã¢Â€Âœdigital image and audio signalÃ¢Â€Â closed loop processing. The approach is validated on both simulated alterations and real data.
Relative Effectiveness of Audio Tools for Fighter Pilots in Simulated Operational Flights: A Human Factors Approach

National Research Council Canada - National Science Library

Hourlier, Sylvain; Meehan, James; Leger, Alain; Roumes, Corinne

2005-01-01

.... Increasing use of audio has been suggested as a means to reduce visual workload, to enhance situation awareness, and mitigate the manual and cognitive demands of HOTAS and existing command-and-display concepts...
Digital Signal Processors

Indian Academy of Sciences (India)

modems, audio systems and video game terminals, to cite a few. Their use is growing ... For example, the systems used to reserve railway tickets is on-line as the ... Many scientific instruments today use DSPs to enhance their performance and.
Virtual environment display for a 3D audio room simulation

Science.gov (United States)

Chapin, William L.; Foster, Scott

1992-06-01

Recent developments in virtual 3D audio and synthetic aural environments have produced a complex acoustical room simulation. The acoustical simulation models a room with walls, ceiling, and floor of selected sound reflecting/absorbing characteristics and unlimited independent localizable sound sources. This non-visual acoustic simulation, implemented with 4 audio ConvolvotronsTM by Crystal River Engineering and coupled to the listener with a Poihemus IsotrakTM, tracking the listener's head position and orientation, and stereo headphones returning binaural sound, is quite compelling to most listeners with eyes closed. This immersive effect should be reinforced when properly integrated into a full, multi-sensory virtual environment presentation. This paper discusses the design of an interactive, visual virtual environment, complementing the acoustic model and specified to: 1) allow the listener to freely move about the space, a room of manipulable size, shape, and audio character, while interactively relocating the sound sources; 2) reinforce the listener's feeling of telepresence into the acoustical environment with visual and proprioceptive sensations; 3) enhance the audio with the graphic and interactive components, rather than overwhelm or reduce it; and 4) serve as a research testbed and technology transfer demonstration. The hardware/software design of two demonstration systems, one installed and one portable, are discussed through the development of four iterative configurations. The installed system implements a head-coupled, wide-angle, stereo-optic tracker/viewer and multi-computer simulation control. The portable demonstration system implements a head-mounted wide-angle, stereo-optic display, separate head and pointer electro-magnetic position trackers, a heterogeneous parallel graphics processing system, and object oriented C++ program code.
CERN automatic audio-conference service

International Nuclear Information System (INIS)

Sierra Moral, Rodrigo

2010-01-01

Scientists from all over the world need to collaborate with CERN on a daily basis. They must be able to communicate effectively on their joint projects at any time; as a result telephone conferences have become indispensable and widely used. Managed by 6 operators, CERN already has more than 20000 hours and 5700 audio-conferences per year. However, the traditional telephone based audio-conference system needed to be modernized in three ways. Firstly, to provide the participants with more autonomy in the organization of their conferences; secondly, to eliminate the constraints of manual intervention by operators; and thirdly, to integrate the audio-conferences into a collaborative working framework. The large number, and hence cost, of the conferences prohibited externalization and so the CERN telecommunications team drew up a specification to implement a new system. It was decided to use a new commercial collaborative audio-conference solution based on the SIP protocol. The system was tested as the first European pilot and several improvements (such as billing, security, redundancy...) were implemented based on CERN's recommendations. The new automatic conference system has been operational since the second half of 2006. It is very popular for the users and has doubled the number of conferences in the past two years.
CERN automatic audio-conference service

Energy Technology Data Exchange (ETDEWEB)

Sierra Moral, Rodrigo, E-mail: Rodrigo.Sierra@cern.c [CERN, IT Department 1211 Geneva-23 (Switzerland)

2010-04-01

Scientists from all over the world need to collaborate with CERN on a daily basis. They must be able to communicate effectively on their joint projects at any time; as a result telephone conferences have become indispensable and widely used. Managed by 6 operators, CERN already has more than 20000 hours and 5700 audio-conferences per year. However, the traditional telephone based audio-conference system needed to be modernized in three ways. Firstly, to provide the participants with more autonomy in the organization of their conferences; secondly, to eliminate the constraints of manual intervention by operators; and thirdly, to integrate the audio-conferences into a collaborative working framework. The large number, and hence cost, of the conferences prohibited externalization and so the CERN telecommunications team drew up a specification to implement a new system. It was decided to use a new commercial collaborative audio-conference solution based on the SIP protocol. The system was tested as the first European pilot and several improvements (such as billing, security, redundancy...) were implemented based on CERN's recommendations. The new automatic conference system has been operational since the second half of 2006. It is very popular for the users and has doubled the number of conferences in the past two years.
CERN automatic audio-conference service

Science.gov (United States)

Sierra Moral, Rodrigo

2010-04-01

Scientists from all over the world need to collaborate with CERN on a daily basis. They must be able to communicate effectively on their joint projects at any time; as a result telephone conferences have become indispensable and widely used. Managed by 6 operators, CERN already has more than 20000 hours and 5700 audio-conferences per year. However, the traditional telephone based audio-conference system needed to be modernized in three ways. Firstly, to provide the participants with more autonomy in the organization of their conferences; secondly, to eliminate the constraints of manual intervention by operators; and thirdly, to integrate the audio-conferences into a collaborative working framework. The large number, and hence cost, of the conferences prohibited externalization and so the CERN telecommunications team drew up a specification to implement a new system. It was decided to use a new commercial collaborative audio-conference solution based on the SIP protocol. The system was tested as the first European pilot and several improvements (such as billing, security, redundancy...) were implemented based on CERN's recommendations. The new automatic conference system has been operational since the second half of 2006. It is very popular for the users and has doubled the number of conferences in the past two years.
Near-field Localization of Audio

DEFF Research Database (Denmark)

Jensen, Jesper Rindom; Christensen, Mads Græsbøll

2014-01-01

Localization of audio sources using microphone arrays has been an important research problem for more than two decades. Many traditional methods for solving the problem are based on a two-stage procedure: first, information about the audio source, such as time differences-of-arrival (TDOAs......) and gain ratios-of-arrival (GROAs) between microphones is estimated, and, second, this knowledge is used to localize the audio source. These methods often have a low computational complexity, but this comes at the cost of a limited estimation accuracy. Therefore, we propose a new localization approach......, where the desired signal is modeled using TDOAs and GROAs, which are determined by the source location. This facilitates the derivation of one-stage, maximum likelihood methods under a white Gaussian noise assumption that is applicable in both near- and far-field scenarios. Simulations show...
Musical Audio Synthesis Using Autoencoding Neural Nets

OpenAIRE

Sarroff, Andy; Casey, Michael A.

2014-01-01

With an optimal network topology and tuning of hyperpa-\\ud rameters, artificial neural networks (ANNs) may be trained\\ud to learn a mapping from low level audio features to one\\ud or more higher-level representations. Such artificial neu-\\ud ral networks are commonly used in classification and re-\\ud gression settings to perform arbitrary tasks. In this work\\ud we suggest repurposing autoencoding neural networks as\\ud musical audio synthesizers. We offer an interactive musi-\\ud cal audio synt...

Music Genre Classification Using MIDI and Audio Features

Directory of Open Access Journals (Sweden)

Abdullah Sonmez

2007-01-01

Full Text Available We report our findings on using MIDI files and audio features from MIDI, separately and combined together, for MIDI music genre classification. We use McKay and Fujinaga's 3-root and 9-leaf genre data set. In order to compute distances between MIDI pieces, we use normalized compression distance (NCD. NCD uses the compressed length of a string as an approximation to its Kolmogorov complexity and has previously been used for music genre and composer clustering. We convert the MIDI pieces to audio and then use the audio features to train different classifiers. MIDI and audio from MIDI classifiers alone achieve much smaller accuracies than those reported by McKay and Fujinaga who used not NCD but a number of domain-based MIDI features for their classification. Combining MIDI and audio from MIDI classifiers improves accuracy and gets closer to, but still worse, accuracies than McKay and Fujinaga's. The best root genre accuracies achieved using MIDI, audio, and combination of them are 0.75, 0.86, and 0.93, respectively, compared to 0.98 of McKay and Fujinaga. Successful classifier combination requires diversity of the base classifiers. We achieve diversity through using certain number of seconds of the MIDI file, different sample rates and sizes for the audio file, and different classification algorithms.
Mood expression by seniors in digital communication : Evaluative comparison of four mood-reporting instruments with elderly users

NARCIS (Netherlands)

Alberts, J.W.; Vastenburg, M.H.; Desmet, P.M.A.

2013-01-01

Elderly users have widely adopted digital communication. Digital communication is often text-only, e.g. instant messaging (IM) and e-mail. Text-only communication has been found less effective than communication that uses richer channels such as audio and video. Mood expression instruments, such as
An Economistâ€™s Guide to Digital Music

OpenAIRE

Martin Peitz; Patrick Waelbroeck

2004-01-01

In this guide, we discuss the impact of digitalization on the music industry. We rely on market and survey data at the international level as well as expert statements from the industry. The guide investigates recent developments in legal and technological protection of digital music and describes new business models as well as consumers' attitude towards music downloads and audio-streaming. We conclude the guide by a discussion of the evolution of the music industry.
Current-Driven Switch-Mode Audio Power Amplifiers

DEFF Research Database (Denmark)

Knott, Arnold; Buhl, Niels Christian; Andersen, Michael A. E.

2012-01-01

The conversion of electrical energy into sound waves by electromechanical transducers is proportional to the current through the coil of the transducer. However virtually all audio power amplifiers provide a controlled voltage through the interface to the transducer. This paper is presenting...... a switch-mode audio power amplifier not only providing controlled current but also being supplied by current. This results in an output filter size reduction by a factor of 6. The implemented prototype shows decent audio performance with THD + N below 0.1 %....
Automatic anatomically selective image enhancement in digital chest radiography

International Nuclear Information System (INIS)

Sezan, M.I.; Minerbo, G.N.; Schaetzing, R.

1989-01-01

The authors develop a technique for automatic anatomically selective enhancement of digital chest radiographs. Anatomically selective enhancement is motivated by the desire to simultaneously meet the different enhancement requirements of the lung field and the mediastinum. A recent peak detection algorithm and a set of rules are applied to the image histogram to determine automatically a gray-level threshold between the lung field and mediastinum. The gray-level threshold facilitates anatomically selective gray-scale modification and/or unsharp masking. Further, in an attempt to suppress possible white-band or black-band artifacts due to unsharp masking at sharp edges, local-contrast adaptivity is incorporated into anatomically selective unsharp masking by designing an anatomy-sensitive emphasis parameter which varies asymmetrically with positive and negative values of the local image contrast
Digital simulation of staining in histopathology multispectral images: enhancement and linear transformation of spectral transmittance.

Science.gov (United States)

Bautista, Pinky A; Yagi, Yukako

2012-05-01

Hematoxylin and eosin (H&E) stain is currently the most popular for routine histopathology staining. Special and/or immuno-histochemical (IHC) staining is often requested to further corroborate the initial diagnosis on H&E stained tissue sections. Digital simulation of staining (or digital staining) can be a very valuable tool to produce the desired stained images from the H&E stained tissue sections instantaneously. We present an approach to digital staining of histopathology multispectral images by combining the effects of spectral enhancement and spectral transformation. Spectral enhancement is accomplished by shifting the N-band original spectrum of the multispectral pixel with the weighted difference between the pixel's original and estimated spectrum; the spectrum is estimated using M transformed to the spectral configuration associated to its reaction to a specific stain by utilizing an N × N transformation matrix, which is derived through application of least mean squares method to the enhanced and target spectral transmittance samples of the different tissue components found in the image. Results of our experiments on the digital conversion of an H&E stained multispectral image to its Masson's trichrome stained equivalent show the viability of the method.
High-Order Sparse Linear Predictors for Audio Processing

DEFF Research Database (Denmark)

Giacobello, Daniele; van Waterschoot, Toon; Christensen, Mads Græsbøll

2010-01-01

Linear prediction has generally failed to make a breakthrough in audio processing, as it has done in speech processing. This is mostly due to its poor modeling performance, since an audio signal is usually an ensemble of different sources. Nevertheless, linear prediction comes with a whole set...... of interesting features that make the idea of using it in audio processing not far fetched, e.g., the strong ability of modeling the spectral peaks that play a dominant role in perception. In this paper, we provide some preliminary conjectures and experiments on the use of high-order sparse linear predictors...... in audio processing. These predictors, successfully implemented in modeling the short-term and long-term redundancies present in speech signals, will be used to model tonal audio signals, both monophonic and polyphonic. We will show how the sparse predictors are able to model efﬁciently the different...
Audio Mining with emphasis on Music Genre Classification

DEFF Research Database (Denmark)

Meng, Anders

2004-01-01

Audio is an important part of our daily life, basically it increases our impression of the world around us whether this is communication, music, danger detection etc. Currently the field of Audio Mining, which here includes areas of music genre, music recognition / retrieval, playlist generation...... the world the problem of detecting environments from the input audio is researched as to increase the life quality of hearing-impaired. Basically there is a lot of work within the field of audio mining. The presentation will mainly focus on music genre classification where we have a fixed amount of genres...... to choose from. Basically every audio mining system is more or less consisting of the same stages as for the music genre setting. My research so far has mainly focussed on finding relevant features for music genre classification living at different timescales using early and late information fusion. It has...
Augmenting Environmental Interaction in Audio Feedback Systems

Directory of Open Access Journals (Sweden)

Seunghun Kim

2016-04-01

Full Text Available Audio feedback is defined as a positive feedback of acoustic signals where an audio input and output form a loop, and may be utilized artistically. This article presents new context-based controls over audio feedback, leading to the generation of desired sonic behaviors by enriching the influence of existing acoustic information such as room response and ambient noise. This ecological approach to audio feedback emphasizes mutual sonic interaction between signal processing and the acoustic environment. Mappings from analyses of the received signal to signal-processing parameters are designed to emphasize this specificity as an aesthetic goal. Our feedback system presents four types of mappings: approximate analyses of room reverberation to tempo-scale characteristics, ambient noise to amplitude and two different approximations of resonances to timbre. These mappings are validated computationally and evaluated experimentally in different acoustic conditions.
Investigating the impact of audio instruction and audio-visual biofeedback for lung cancer radiation therapy

Science.gov (United States)

George, Rohini

Lung cancer accounts for 13% of all cancers in the Unites States and is the leading cause of deaths among both men and women. The five-year survival for lung cancer patients is approximately 15%.(ACS facts & figures) Respiratory motion decreases accuracy of thoracic radiotherapy during imaging and delivery. To account for respiration, generally margins are added during radiation treatment planning, which may cause a substantial dose delivery to normal tissues and increase the normal tissue toxicity. To alleviate the above-mentioned effects of respiratory motion, several motion management techniques are available which can reduce the doses to normal tissues, thereby reducing treatment toxicity and allowing dose escalation to the tumor. This may increase the survival probability of patients who have lung cancer and are receiving radiation therapy. However the accuracy of these motion management techniques are inhibited by respiration irregularity. The rationale of this thesis was to study the improvement in regularity of respiratory motion by breathing coaching for lung cancer patients using audio instructions and audio-visual biofeedback. A total of 331 patient respiratory motion traces, each four minutes in length, were collected from 24 lung cancer patients enrolled in an IRB-approved breathing-training protocol. It was determined that audio-visual biofeedback significantly improved the regularity of respiratory motion compared to free breathing and audio instruction, thus improving the accuracy of respiratory gated radiotherapy. It was also observed that duty cycles below 30% showed insignificant reduction in residual motion while above 50% there was a sharp increase in residual motion. The reproducibility of exhale based gating was higher than that of inhale base gating. Modeling the respiratory cycles it was found that cosine and cosine 4 models had the best correlation with individual respiratory cycles. The overall respiratory motion probability distribution
Adding Value to the University of Oklahoma Libraries History of Science Collections through Digital Enhancement

Directory of Open Access Journals (Sweden)

Maura Valentino

2014-03-01

Full Text Available Much of the focus of digital collections has been and continues to be on rare and unique materials, including monographs. A monograph may be made even rarer and more valuable by virtue of hand written marginalia. Using technology to enhance scans of unique books and make previously unreadable marginalia readable increases the value of a digital object to researchers. This article describes a case study of enhancing the marginalia in a rare book by Copernicus.
Fusion of audio and visual cues for laughter detection

NARCIS (Netherlands)

Petridis, Stavros; Pantic, Maja

Past research on automatic laughter detection has focused mainly on audio-based detection. Here we present an audio- visual approach to distinguishing laughter from speech and we show that integrating the information from audio and video channels leads to improved performance over single-modal
Perceptual Audio Hashing Functions

Directory of Open Access Journals (Sweden)

Emin Anarım

2005-07-01

Full Text Available Perceptual hash functions provide a tool for fast and reliable identification of content. We present new audio hash functions based on summarization of the time-frequency spectral characteristics of an audio document. The proposed hash functions are based on the periodicity series of the fundamental frequency and on singular-value description of the cepstral frequencies. They are found, on one hand, to perform very satisfactorily in identification and verification tests, and on the other hand, to be very resilient to a large variety of attacks. Moreover, we address the issue of security of hashes and propose a keying technique, and thereby a key-dependent hash function.
Horatio Audio-Describes Shakespeare's "Hamlet": Blind and Low-Vision Theatre-Goers Evaluate an Unconventional Audio Description Strategy

Science.gov (United States)

Udo, J. P.; Acevedo, B.; Fels, D. I.

2010-01-01

Audio description (AD) has been introduced as one solution for providing people who are blind or have low vision with access to live theatre, film and television content. However, there is little research to inform the process, user preferences and presentation style. We present a study of a single live audio-described performance of Hart House…
Automatic, anatomically selective, artifact-free enhancement of digital chest radiographs

International Nuclear Information System (INIS)

Sezan, M.I.; Tekalp, A.M.; Schaetzing, R.

1988-01-01

The authors propose a technique for automatic, anatomically selective, artifact-free enhancement of digital chest radiographs. Anatomically selective enhancement is motivated by the different enhancement requirements of the lung field and the mediastinum. A recent peak detection algorithm is applied to the image histogram to automatically determine a gray-level threshold between the lung and mediastinum fields. The gray-level threshold facilitates anatomically selective gray-scale modification and unsharp masking. Further, in an attempt to suppress possible white-band artifacts due to unsharp masking at sharp edges, local-contrast adaptivity is incorporated into anatomically selective unsharp masking by designing an anatomy-sensitive emphasis parameter that varied asymmetrically with positive and negative values of the local image contrast
Audio localization for mobile robots

OpenAIRE

de Guillebon, Thibaut; Grau Saldes, Antoni; Bolea Monte, Yolanda

2009-01-01

The department of the University for which I worked is developing a project based on the interaction with robots in the environment. My work was to define an audio system for the robot. This audio system that I have to realize consists on a mobile head which is able to follow the sound in its environment. This subject was treated as a research problem, with the liberty to find and develop different solutions and make them evolve in the chosen way.
A review on brightness preserving contrast enhancement methods for digital image

Science.gov (United States)

Rahman, Md Arifur; Liu, Shilong; Li, Ruowei; Wu, Hongkun; Liu, San Chi; Jahan, Mahmuda Rawnak; Kwok, Ngaiming

2018-04-01

Image enhancement is an imperative step for many vision based applications. For image contrast enhancement, popular methods adopt the principle of spreading the captured intensities throughout the allowed dynamic range according to predefined distributions. However, these algorithms take little or no consideration into account of maintaining the mean brightness of the original scene, which is of paramount importance to carry the true scene illumination characteristics to the viewer. Though there have been significant amount of reviews on contrast enhancement methods published, updated review on overall brightness preserving image enhancement methods is still scarce. In this paper, a detailed survey is performed on those particular methods that specifically aims to maintain the overall scene illumination characteristics while enhancing the digital image.
Personal Profiles: Enhancing Social Interaction in Learning Networks

NARCIS (Netherlands)

Berlanga, Adriana; Bitter-Rijpkema, Marlies; Brouns, Francis; Sloep, Peter; Fetter, Sibren

2009-01-01

Berlanga, A. J., Bitter-Rijpkema, M., Brouns, F., Sloep, P. B., & Fetter, S. (2011). Personal Profiles: Enhancing Social Interaction in Learning Networks. International Journal of Web Based Communities, 7(1), 66-82.
Focus on Hinduism: Audio-Visual Resources for Teaching Religion. Occasional Publication No. 23.

Science.gov (United States)

Dell, David; And Others

The guide presents annotated lists of audio and visual materials about the Hindu religion. The authors point out that Hinduism cannot be comprehended totally by reading books; thus the resources identified in this guide will enhance understanding based on reading. The guide is intended for use by high school and college students, teachers,…
Chain of evidence generation for contrast enhancement in digital image forensics

Science.gov (United States)

Battiato, Sebastiano; Messina, Giuseppe; Strano, Daniela

2010-01-01

The quality of the images obtained by digital cameras has improved a lot since digital cameras early days. Unfortunately, it is not unusual in image forensics to find wrongly exposed pictures. This is mainly due to obsolete techniques or old technologies, but also due to backlight conditions. To extrapolate some invisible details a stretching of the image contrast is obviously required. The forensics rules to produce evidences require a complete documentation of the processing steps, enabling the replication of the entire process. The automation of enhancement techniques is thus quite difficult and needs to be carefully documented. This work presents an automatic procedure to find contrast enhancement settings, allowing both image correction and automatic scripting generation. The technique is based on a preprocessing step which extracts the features of the image and selects correction parameters. The parameters are thus saved through a JavaScript code that is used in the second step of the approach to correct the image. The generated script is Adobe Photoshop compliant (which is largely used in image forensics analysis) thus permitting the replication of the enhancement steps. Experiments on a dataset of images are also reported showing the effectiveness of the proposed methodology.

EVALUASI KEPUASAN PENGGUNA TERHADAP APLIKASI AUDIO BOOKS

Directory of Open Access Journals (Sweden)

Raditya Maulana Anuraga

2017-02-01

Full Text Available Listeno is the first application audio books in Indonesia so that the users can get the book in audio form like listen to music, Listeno have problems in a feature request Listeno offline mode that have not been released, a security problem mp3 files that must be considered, and the target Listeno not yet reached 100,000 active users. This research has the objective to evaluate user satisfaction to Audio Books with research method approach, Nielsen. The analysis in this study using Importance Performance Analysis (IPA is combined with the index of User Satisfaction (IKP based on the indicators used are: Benefit (Usefulness, Utility (Utility, Usability (Usability, easy to understand (Learnability, Efficient (efficiency , Easy to remember (Memorability, Error (Error, and satisfaction (satisfaction. The results showed Applications User Satisfaction Audio books are quite satisfied with the results of the calculation IKP 69.58%..
Spatial-Numerical Associations Enhance the Short-Term Memorization of Digit Locations

Directory of Open Access Journals (Sweden)

Catherine Thevenot

2018-05-01

Full Text Available Little is known about how spatial-numerical associations (SNAs affect the way individuals process their environment, especially in terms of learning and memory. In this study, we investigated the potential effects of SNAs in a digit memory task in order to determine whether spatially organized mental representations of numbers can influence the short-term encoding of digits positioned on an external display. To this aim, we designed a memory game in which participants had to match pairs of identical digits in a 9 × 2 matrix of cards. The nine cards of the first row had to be turned face up and then face down, one by one, to reveal a digit from 1 to 9. When a card was turned face up in the second row, the position of the matching digit in the first row had to be recalled. Our results showed that performance was better when small numbers were placed on the left side of the row and large numbers on the right side (i.e., congruent as compared to the inverse (i.e., incongruent or a random configuration. Our findings suggests that SNAs can enhance the memorization of digit positions and therefore that spatial mental representations of numbers can play an important role on the way humans process and encode the information around them. To our knowledge, this study is the first that reaches this conclusion in a context where digits did not have to be processed as numerical values.
Crafting digital media

CERN Document Server

James, Daniel

2010-01-01

Open source software, also known as free software, now offers a creative platform with world-class programs. Just ask the people who have completed high-quality projects or developed popular web 2.0 sites using open source desktop applications. This phenomenon is no longer underground or restricted to techies - there have been more than 61 million downloads of the Audacity audio editor and more than 60 million downloads of the GIMP for Windows photographic tool from SourceForge.net alone. Crafting Digital Media is your foundation course in photographic manipulation, illustration, animation, 3D
Visual impact in the digital press: a Spanish empirical research

Directory of Open Access Journals (Sweden)

Joan Francesc Fondevila Gascón

2010-12-01

Full Text Available Visual resource (photography and video inclusion in digital journalism is obtaining importance in the multimedia area. The principal resources of digital press are multimedia, hypertext and interactivity. Multimedia is in an initial process of evolution. The objective of this research is to observe empirically the use of visual resources by the digital pure player press. These media try to take advantage of the new multimedia possibilities in the development and presentation of the contents. We have analyzed empirically video and photography inclusion in the multimedia framework (text, photography, video, audio, infograph and animation programs in four digital newspapers (Libertad Digital and El Plural, in Spanish, and Vilaweb.cat and e-Noticies, in Catalan analyzed according to journalistic genres.
A listening test system for automotive audio

DEFF Research Database (Denmark)

Christensen, Flemming; Geoff, Martin; Minnaar, Pauli

2005-01-01

This paper describes a system for simulating automotive audio through headphones for the purposes of conducting listening experiments in the laboratory. The system is based on binaural technology and consists of a component for reproducing the sound of the audio system itself and a component...
Voice activity detection using audio-visual information

DEFF Research Database (Denmark)

Petsatodis, Theodore; Pnevmatikakis, Aristodemos; Boukis, Christos

2009-01-01

An audio-visual voice activity detector that uses sensors positioned distantly from the speaker is presented. Its constituting unimodal detectors are based on the modeling of the temporal variation of audio and visual features using Hidden Markov Models; their outcomes are fused using a post...
Learning Tools for Knowledge Nomads: Using Personal Digital Assistants (PDAs) in Web-based Learning Environments.

Science.gov (United States)

Loh, Christian Sebastian

2001-01-01

Examines how mobile computers, or personal digital assistants (PDAs), can be used in a Web-based learning environment. Topics include wireless networks on college campuses; online learning; Web-based learning technologies; synchronous and asynchronous communication via the Web; content resources; Web connections; and collaborative learning. (LRW)
Audio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features

Directory of Open Access Journals (Sweden)

Petar S. Aleksic

2002-11-01

Full Text Available We describe an audio-visual automatic continuous speech recognition system, which significantly improves speech recognition performance over a wide range of acoustic noise levels, as well as under clean audio conditions. The system utilizes facial animation parameters (FAPs supported by the MPEG-4 standard for the visual representation of speech. We also describe a robust and automatic algorithm we have developed to extract FAPs from visual data, which does not require hand labeling or extensive training procedures. The principal component analysis (PCA was performed on the FAPs in order to decrease the dimensionality of the visual feature vectors, and the derived projection weights were used as visual features in the audio-visual automatic speech recognition (ASR experiments. Both single-stream and multistream hidden Markov models (HMMs were used to model the ASR system, integrate audio and visual information, and perform a relatively large vocabulary (approximately 1000 words speech recognition experiments. The experiments performed use clean audio data and audio data corrupted by stationary white Gaussian noise at various SNRs. The proposed system reduces the word error rate (WER by 20% to 23% relatively to audio-only speech recognition WERs, at various SNRs (0Ã¢Â€Â“30 dB with additive white Gaussian noise, and by 19% relatively to audio-only speech recognition WER under clean audio conditions.
Parametric Packet-Layer Model for Evaluation Audio Quality in Multimedia Streaming Services

Science.gov (United States)

Egi, Noritsugu; Hayashi, Takanori; Takahashi, Akira

We propose a parametric packet-layer model for monitoring audio quality in multimedia streaming services such as Internet protocol television (IPTV). This model estimates audio quality of experience (QoE) on the basis of quality degradation due to coding and packet loss of an audio sequence. The input parameters of this model are audio bit rate, sampling rate, frame length, packet-loss frequency, and average burst length. Audio bit rate, packet-loss frequency, and average burst length are calculated from header information in received IP packets. For sampling rate, frame length, and audio codec type, the values or the names used in monitored services are input into this model directly. We performed a subjective listening test to examine the relationships between these input parameters and perceived audio quality. The codec used in this test was the Advanced Audio Codec-Low Complexity (AAC-LC), which is one of the international standards for audio coding. On the basis of the test results, we developed an audio quality evaluation model. The verification results indicate that audio quality estimated by the proposed model has a high correlation with perceived audio quality.
Audio power amplifier design handbook

CERN Document Server

Self, Douglas

2013-01-01

This book is essential for audio power amplifier designers and engineers for one simple reason...it enables you as a professional to develop reliable, high-performance circuits. The Author Douglas Self covers the major issues of distortion and linearity, power supplies, overload, DC-protection and reactive loading. He also tackles unusual forms of compensation and distortion produced by capacitors and fuses. This completely updated fifth edition includes four NEW chapters including one on The XD Principle, invented by the author, and used by Cambridge Audio. Cro
Digital dosimetry and personal and environmental monitoring assembly

International Nuclear Information System (INIS)

Cerovac, Z.; Radalj, Z.; Prlic, I.; Cerovac, H.

1996-01-01

Film+TLD and film or TLD Dosimetry have a certain delay in dose reporting, since the reports on occupational doses are usually available to the users within 40 days after the actual exposure. This is particularly important when the dose is received within the short-time interval or when the radiation source has some technical failures. For this reason, the additional monitoring is recommendable. The common Dosimetry service in Croatia is well established and the data available shows that over 80% of occupationally exposed persons are working in medical facilities, mainly with x-ray sources. Dosimetry services in the country are providing three types of dosemeters, film dosemeter badge, film+TLD dosemeter badge or plane TLD badge. We have decided to introduce the palette of digital pocket dosemeters to be used at different workplaces occupationally exposed to ionizing radiation. After the first experience with the ALARA 1G digital dosemeter it came out that this type of ionizing radiation measuring device is suitable for the various non-occupational purposes. After some technical improvement and with some telecommunication electronics this device is usable as a point environmental measuring station. This means that the probe of the record any change in normal environmental radiation field, send the data to the central station and to raise alarm if necessary. That is why we have made a prototype for environmental monitoring able to be connected to any kind of telecommunication net. (author)
Audio stream classification for multimedia database search

Science.gov (United States)

Artese, M.; Bianco, S.; Gagliardi, I.; Gasparini, F.

2013-03-01

Search and retrieval of huge archives of Multimedia data is a challenging task. A classification step is often used to reduce the number of entries on which to perform the subsequent search. In particular, when new entries of the database are continuously added, a fast classification based on simple threshold evaluation is desirable. In this work we present a CART-based (Classification And Regression Tree [1]) classification framework for audio streams belonging to multimedia databases. The database considered is the Archive of Ethnography and Social History (AESS) [2], which is mainly composed of popular songs and other audio records describing the popular traditions handed down generation by generation, such as traditional fairs, and customs. The peculiarities of this database are that it is continuously updated; the audio recordings are acquired in unconstrained environment; and for the non-expert human user is difficult to create the ground truth labels. In our experiments, half of all the available audio files have been randomly extracted and used as training set. The remaining ones have been used as test set. The classifier has been trained to distinguish among three different classes: speech, music, and song. All the audio files in the dataset have been previously manually labeled into the three classes above defined by domain experts.
Audio-tactile integration and the influence of musical training.

Science.gov (United States)

Kuchenbuch, Anja; Paraskevopoulos, Evangelos; Herholz, Sibylle C; Pantev, Christo

2014-01-01

Perception of our environment is a multisensory experience; information from different sensory systems like the auditory, visual and tactile is constantly integrated. Complex tasks that require high temporal and spatial precision of multisensory integration put strong demands on the underlying networks but it is largely unknown how task experience shapes multisensory processing. Long-term musical training is an excellent model for brain plasticity because it shapes the human brain at functional and structural levels, affecting a network of brain areas. In the present study we used magnetoencephalography (MEG) to investigate how audio-tactile perception is integrated in the human brain and if musicians show enhancement of the corresponding activation compared to non-musicians. Using a paradigm that allowed the investigation of combined and separate auditory and tactile processing, we found a multisensory incongruency response, generated in frontal, cingulate and cerebellar regions, an auditory mismatch response generated mainly in the auditory cortex and a tactile mismatch response generated in frontal and cerebellar regions. The influence of musical training was seen in the audio-tactile as well as in the auditory condition, indicating enhanced higher-order processing in musicians, while the sources of the tactile MMN were not influenced by long-term musical training. Consistent with the predictive coding model, more basic, bottom-up sensory processing was relatively stable and less affected by expertise, whereas areas for top-down models of multisensory expectancies were modulated by training.
Audio-tactile integration and the influence of musical training.

Directory of Open Access Journals (Sweden)

Anja Kuchenbuch

Full Text Available Perception of our environment is a multisensory experience; information from different sensory systems like the auditory, visual and tactile is constantly integrated. Complex tasks that require high temporal and spatial precision of multisensory integration put strong demands on the underlying networks but it is largely unknown how task experience shapes multisensory processing. Long-term musical training is an excellent model for brain plasticity because it shapes the human brain at functional and structural levels, affecting a network of brain areas. In the present study we used magnetoencephalography (MEG to investigate how audio-tactile perception is integrated in the human brain and if musicians show enhancement of the corresponding activation compared to non-musicians. Using a paradigm that allowed the investigation of combined and separate auditory and tactile processing, we found a multisensory incongruency response, generated in frontal, cingulate and cerebellar regions, an auditory mismatch response generated mainly in the auditory cortex and a tactile mismatch response generated in frontal and cerebellar regions. The influence of musical training was seen in the audio-tactile as well as in the auditory condition, indicating enhanced higher-order processing in musicians, while the sources of the tactile MMN were not influenced by long-term musical training. Consistent with the predictive coding model, more basic, bottom-up sensory processing was relatively stable and less affected by expertise, whereas areas for top-down models of multisensory expectancies were modulated by training.
Classroom Audio Distribution in the Postsecondary Setting: A Story of Universal Design for Learning

Science.gov (United States)

Flagg-Williams, Joan B.; Bokhorst-Heng, Wendy D.

2016-01-01

Classroom Audio Distribution Systems (CADS) consist of amplification technology that enhances the teacher's, or sometimes the student's, vocal signal above the background noise in a classroom. Much research has supported the benefits of CADS for student learning, but most of it has focused on elementary school classrooms. This study investigated…
Automatic method for selective enhancement of different tissue densities at digital chest radiography

International Nuclear Information System (INIS)

McNitt-Gray, M.F.; Taira, R.K.; Eldredge, S.L.; Razavi, M.

1991-01-01

This paper reports that digital chest radiographs often are too bright and/or lack contrast when viewed on a video display. The authors have developed a method that can automatically provide a series of look-up tables that selectively enhance the radiographically soft or dense tissues on a digital chest radiograph. This reduces viewer interaction and improves displayed image quality. On the basis of a histogram analysis, gray-level ranges are approximated for the patient background, radiographically soft tissues, and radiographically dense tissues. A series of look-up tables is automatically created by varying the contrast in each range to achieve a level of enhancement for a selected tissue range. This is repeated for differing amounts of enhancement and for each tissue range. This allows the viewer to interactively select a tissue density range and degree of enhancement at the time of display via precalculated look-up tables. Preclinical trials in pediatric radiology using computed radiography images show that this method reduces viewer interaction and improves or maintains the displayed image quality
MP3 audio-editing software for the department of radiology

International Nuclear Information System (INIS)

Hong Qingfen; Sun Canhui; Li Ziping; Meng Quanfei; Jiang Li

2006-01-01

Objective: To evaluate the MP3 audio-editing software in the daily work in the department of radiology. Methods: The audio content of daily consultation seminar, held in the department of radiology every morning, was recorded and converted into MP3 audio format by a computer integrated recording device. The audio data were edited, archived, and eventually saved in the computer memory storage media, which was experimentally replayed and applied in the research or teaching. Results: MP3 audio-editing was a simple process and convenient for saving and searching the data. The record could be easily replayed. Conclusion: MP3 audio-editing perfectly records and saves the contents of consultation seminar, and has replaced the conventional hand writing notes. It is a valuable tool in both research and teaching in the department. (authors)
Using digital watermarking to enhance security in wireless medical image transmission.

Science.gov (United States)

Giakoumaki, Aggeliki; Perakis, Konstantinos; Banitsas, Konstantinos; Giokas, Konstantinos; Tachakra, Sapal; Koutsouris, Dimitris

2010-04-01

During the last few years, wireless networks have been increasingly used both inside hospitals and in patients' homes to transmit medical information. In general, wireless networks suffer from decreased security. However, digital watermarking can be used to secure medical information. In this study, we focused on combining wireless transmission and digital watermarking technologies to better secure the transmission of medical images within and outside the hospital. We utilized an integrated system comprising the wireless network and the digital watermarking module to conduct a series of tests. The test results were evaluated by medical consultants. They concluded that the images suffered no visible quality degradation and maintained their diagnostic integrity. The proposed integrated system presented reasonable stability, and its performance was comparable to that of a fixed network. This system can enhance security during the transmission of medical images through a wireless channel.
Tolerance of image enhancement brightness and contrast in lateral cephalometric digital radiography for Steiner analysis

Science.gov (United States)

Rianti, R. A.; Priaminiarti, M.; Syahraini, S. I.

2017-08-01

Image enhancement brightness and contrast can be adjusted on lateral cephalometric digital radiographs to improve image quality and anatomic landmarks for measurement by Steiner analysis. To determine the limit value for adjustments of image enhancement brightness and contrast in lateral cephalometric digital radiography for Steiner analysis. Image enhancement brightness and contrast were adjusted on 100 lateral cephalometric radiography in 10-point increments (-30, -20, -10, 0, +10, +20, +30). Steiner analysis measurements were then performed by two observers. Reliabilities were tested by the Interclass Correlation Coefficient (ICC) and significance tested by ANOVA or the Kruskal Wallis test. No significant differences were detected in lateral cephalometric analysis measurements following adjustment of the image enhancement brightness and contrast. The limit value of adjustments of the image enhancement brightness and contrast associated with incremental 10-point changes (-30, -20, -10, 0, +10, +20, +30) does not affect the results of Steiner analysis.
Distribuce digitálního obsahu

OpenAIRE

Voborník, Vojtěch

2016-01-01

Bakalářská práce reaguje na současné způsoby distribuce digitálního obsahu na internetu (například audia, videa, softwaru, videoher, e-knih, fotografií apod.) mezi autorem a spotřebitelem, jejich výhody a nevýhody a nabízí alternativní způsob distribuce dosud nezveřejněných digitálních děl, prostřednictvím vytvořené webové stránky, která je výstupem této práce. Bachelor thesis responds to the current distribution methods of digital content on the internet (for example, audio, video, softwa...

Use of Effective Audio in E-learning Courseware

OpenAIRE

Ray, Kisor

2015-01-01

E-Learning uses electronic media, information & communication technologies to provide education to the masses. E-learning deliver hypertext, text, audio, images, animation and videos using desktop standalone computer, local area network based intranet and internet based contents. While producing an e-learning content or course-ware, a major decision making factor is whether to use audio for the benefit of the end users. Generally, three types of audio can be used in e-learning: narration, mus...
Tune in the Net with RealAudio.

Science.gov (United States)

Buchanan, Larry

1997-01-01

Describes how to connect to the RealAudio Web site to download a player that provides sound from Web pages to the computer through streaming technology. Explains hardware and software requirements and provides addresses for other RealAudio Web sites are provided, including weather information and current news. (LRW)
Digital watermarking opportunities enabled by mobile media proliferation

Science.gov (United States)

Modro, Sierra; Sharma, Ravi K.

2009-02-01

Consumer usages of mobile devices and electronic media are changing. Mobile devices now include increased computational capabilities, mobile broadband access, better integrated sensors, and higher resolution screens. These enhanced features are driving increased consumption of media such as images, maps, e-books, audio, video, and games. As users become more accustomed to using mobile devices for media, opportunities arise for new digital watermarking usage models. For example, transient media, like images being displayed on screens, could be watermarked to provide a link between mobile devices. Applications based on these emerging usage models utilizing watermarking can provide richer user experiences and drive increased media consumption. We describe the enabling factors and highlight a few of the usage models and new opportunities. We also outline how the new opportunities are driving further innovation in watermarking technologies. We discuss challenges in market adoption of applications based on these usage models.
Effects of audio-visual aids on foreign language test anxiety, reading and listening comprehension, and retention in EFL learners.

Science.gov (United States)

Lee, Shu-Ping; Lee, Shin-Da; Liao, Yuan-Lin; Wang, An-Chi

2015-04-01

This study examined the effects of audio-visual aids on anxiety, comprehension test scores, and retention in reading and listening to short stories in English as a Foreign Language (EFL) classrooms. Reading and listening tests, general and test anxiety, and retention were measured in English-major college students in an experimental group with audio-visual aids (n=83) and a control group without audio-visual aids (n=94) with similar general English proficiency. Lower reading test anxiety, unchanged reading comprehension scores, and better reading short-term and long-term retention after four weeks were evident in the audiovisual group relative to the control group. In addition, lower listening test anxiety, higher listening comprehension scores, and unchanged short-term and long-term retention were found in the audiovisual group relative to the control group after the intervention. Audio-visual aids may help to reduce EFL learners' listening test anxiety and enhance their listening comprehension scores without facilitating retention of such materials. Although audio-visual aids did not increase reading comprehension scores, they helped reduce EFL learners' reading test anxiety and facilitated retention of reading materials.
Portable audio electronics for impedance-based measurements in microfluidics

International Nuclear Information System (INIS)

Wood, Paul; Sinton, David

2010-01-01

We demonstrate the use of audio electronics-based signals to perform on-chip electrochemical measurements. Cell phones and portable music players are examples of consumer electronics that are easily operated and are ubiquitous worldwide. Audio output (play) and input (record) signals are voltage based and contain frequency and amplitude information. A cell phone, laptop soundcard and two compact audio players are compared with respect to frequency response; the laptop soundcard provides the most uniform frequency response, while the cell phone performance is found to be insufficient. The audio signals in the common portable music players and laptop soundcard operate in the range of 20 Hz to 20 kHz and are found to be applicable, as voltage input and output signals, to impedance-based electrochemical measurements in microfluidic systems. Validated impedance-based measurements of concentration (0.1–50 mM), flow rate (2–120 µL min −1 ) and particle detection (32 µm diameter) are demonstrated. The prevailing, lossless, wave audio file format is found to be suitable for data transmission to and from external sources, such as a centralized lab, and the cost of all hardware (in addition to audio devices) is ∼10 USD. The utility demonstrated here, in combination with the ubiquitous nature of portable audio electronics, presents new opportunities for impedance-based measurements in portable microfluidic systems. (technical note)
Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations

Directory of Open Access Journals (Sweden)

Müller Meinard

2007-01-01

Full Text Available One major goal of structural analysis of an audio recording is to automatically extract the repetitive structure or, more generally, the musical form of the underlying piece of music. Recent approaches to this problem work well for music, where the repetitions largely agree with respect to instrumentation and tempo, as is typically the case for popular music. For other classes of music such as Western classical music, however, musically similar audio segments may exhibit significant variations in parameters such as dynamics, timbre, execution of note groups, modulation, articulation, and tempo progression. In this paper, we propose a robust and efficient algorithm for audio structure analysis, which allows to identify musically similar segments even in the presence of large variations in these parameters. To account for such variations, our main idea is to incorporate invariance at various levels simultaneously: we design a new type of statistical features to absorb microvariations, introduce an enhanced local distance measure to account for local variations, and describe a new strategy for structure extraction that can cope with the global variations. Our experimental results with classical and popular music show that our algorithm performs successfully even in the presence of significant musical variations.
Modified DCTNet for audio signals classification

Science.gov (United States)

Xian, Yin; Pu, Yunchen; Gan, Zhe; Lu, Liang; Thompson, Andrew

2016-10-01

In this paper, we investigate DCTNet for audio signal classification. Its output feature is related to Cohen's class of time-frequency distributions. We introduce the use of adaptive DCTNet (A-DCTNet) for audio signals feature extraction. The A-DCTNet applies the idea of constant-Q transform, with its center frequencies of filterbanks geometrically spaced. The A-DCTNet is adaptive to different acoustic scales, and it can better capture low frequency acoustic information that is sensitive to human audio perception than features such as Mel-frequency spectral coefficients (MFSC). We use features extracted by the A-DCTNet as input for classifiers. Experimental results show that the A-DCTNet and Recurrent Neural Networks (RNN) achieve state-of-the-art performance in bird song classification rate, and improve artist identification accuracy in music data. They demonstrate A-DCTNet's applicability to signal processing problems.
Concurrent Unimodal Learning Enhances Multisensory Responses of Bi-Directional Crossmodal Learning in Robotic Audio-Visual Tracking

DEFF Research Database (Denmark)

Shaikh, Danish; Bodenhagen, Leon; Manoonpong, Poramate

2018-01-01

modalities to independently update modality-specific neural weights on a moment-by-moment basis, in response to dynamic changes in noisy sensory stimuli. The circuit is embodied as a non-holonomic robotic agent that must orient a towards a moving audio-visual target. The circuit continuously learns the best...
Audio Feedback to Physiotherapy Students for Viva Voce: How Effective Is "The Living Voice"?

Science.gov (United States)

Munro, Wendy; Hollingworth, Linda

2014-01-01

Assessment and feedback remains one of the categories that students are least satisfied with within the United Kingdom National Student Survey. The Student Charter promotes the use of various formats of feedback to enhance student learning. This study evaluates the use of audio MP3 as an alternative feedback mechanism to written feedback for…
Defining What Matters When Preserving Web-Based Personal Digital Collections: Listening to Bloggers

Directory of Open Access Journals (Sweden)

Ayoung Yoon

2013-06-01

Full Text Available User-generated content (UGC has become a part of personal digital collections on the Web, as such collections often contain personal memories, activities, thoughts and even profiles. With the increase in the creation of personal materials on the Web, the needs for archiving and preserving these materials are increasing, not only for the purpose of developing personal archives but also for the purpose of capturing social memory and tracking human traces in this era. Using both survey and interview methods, this study investigated blogs, one popular type of UGC, and analyzed travel bloggers’ perceptions of the value of blogs and the elements of blogs that are important for preservation. The study respondents found personal and sentimental value (e.g., a way to express themselves, a way to keep personal memories and thoughts, and a way to maintain a record for their family to be the most important reason for preserving blogs, followed by informational value and cultural/historical value. Sharing also appeared as one of the values that respondents found in their blogs. The respondents reported that self-created blog posts (content and information related to the blog posts (context are more important to preserve than some other elements (behavior and appearance. Integrating what bloggers consider as most valuable and what archivists think are worth preserving may be an important step when collecting personal blogs.
The dBoard: a Digital Scrum Board for Distributed Software Development

DEFF Research Database (Denmark)

Esbensen, Morten; Tell, Paolo; Cholewa, Jacob Benjamin

2015-01-01

In this paper we present the dBoard - a digital Scrum Board for distributed Agile software development teams. The dBoard is designed as a 'virtual window' between two Scrum team spaces. It connects two locations with live video and audio, which is overlaid with a synchronized and interactive...... digital Scrum board, and it adapts the fidelity of the video/audio to the presence of people in front of it. The dBoard is designed to work (i) as a passive information radiator from which it is easy to get an overview of the status of work, (ii) as a media space providing awareness about the presence...... of remote co-workers, and (iii) as an active meeting support tool. The paper presents a case study of distributed Scrum in a large software company that motivates the design of the dBoard, and details the design and technical implementation of the dBoard. The paper also reports on an initial user study...
The Fungible Audio-Visual Mapping and its Experience

Directory of Open Access Journals (Sweden)

Adriana Sa

2014-12-01

Full Text Available This article draws a perceptual approach to audio-visual mapping. Clearly perceivable cause and effect relationships can be problematic if one desires the audience to experience the music. Indeed perception would bias those sonic qualities that fit previous concepts of causation, subordinating other sonic qualities, which may form the relations between the sounds themselves. The question is, how can an audio-visual mapping produce a sense of causation, and simultaneously confound the actual cause-effect relationships. We call this a fungible audio-visual mapping. Our aim here is to glean its constitution and aspect. We will report a study, which draws upon methods from experimental psychology to inform audio-visual instrument design and composition. The participants are shown several audio-visual mapping prototypes, after which we pose quantitative and qualitative questions regarding their sense of causation, and their sense of understanding the cause-effect relationships. The study shows that a fungible mapping requires both synchronized and seemingly non-related components – sufficient complexity to be confusing. As the specific cause-effect concepts remain inconclusive, the sense of causation embraces the whole.
A Study of Applying Digital Mobile Museum Guide

Directory of Open Access Journals (Sweden)

Chao-yun Chaucer Liang

2003-09-01

Full Text Available With the prosperous development of information technology, museums begin to apply new technology to enhance operation and communication efficiency. One of the information technology. Personal Digital Mobile, featuring light weight and mobility, can help museum to set up an interactive navigation system, which offering capability of user-controlled guidance and both broad and depth information. In this study, literature related to museum tour guide, digital mobile navigation, and multimedia interaction design were reviewed, and two examples were offered for reference. The first one example is Exploratorium in American, which is cooperated with HP labs to integrate wireless networking and PDA devices. The domestic example is the design project of the Personal Digital Mobile Guide for the Emperor Ch’ien-lung’s Grand Cultural Enterprise Exhibition in National Palace Museum, 2002. This paper introduces the techniques involved, interactive storyboard, interface design, color planning, electronic element planning, etc. The process of applying theory into creative project may help future researches in the related areas.[Article content in Chinese
Concurrent audio-visual feedback for supporting drivers at intersections: A study using two linked driving simulators.

Science.gov (United States)

Houtenbos, M; de Winter, J C F; Hale, A R; Wieringa, P A; Hagenzieker, M P

2017-04-01

A large portion of road traffic crashes occur at intersections for the reason that drivers lack necessary visual information. This research examined the effects of an audio-visual display that provides real-time sonification and visualization of the speed and direction of another car approaching the crossroads on an intersecting road. The location of red blinking lights (left vs. right on the speedometer) and the lateral input direction of beeps (left vs. right ear in headphones) corresponded to the direction from where the other car approached, and the blink and beep rates were a function of the approaching car's speed. Two driving simulators were linked so that the participant and the experimenter drove in the same virtual world. Participants (N = 25) completed four sessions (two with the audio-visual display on, two with the audio-visual display off), each session consisting of 22 intersections at which the experimenter approached from the left or right and either maintained speed or slowed down. Compared to driving with the display off, the audio-visual display resulted in enhanced traffic efficiency (i.e., greater mean speed, less coasting) while not compromising safety (i.e., the time gap between the two vehicles was equivalent). A post-experiment questionnaire showed that the beeps were regarded as more useful than the lights. It is argued that the audio-visual display is a promising means of supporting drivers until fully automated driving is technically feasible. Copyright © 2016. Published by Elsevier Ltd.
Efficiency Optimization in Class-D Audio Amplifiers

DEFF Research Database (Denmark)

Yamauchi, Akira; Knott, Arnold; Jørgensen, Ivan Harald Holger

2015-01-01

This paper presents a new power efficiency optimization routine for designing Class-D audio amplifiers. The proposed optimization procedure finds design parameters for the power stage and the output filter, and the optimum switching frequency such that the weighted power losses are minimized under...... the given constraints. The optimization routine is applied to minimize the power losses in a 130 W class-D audio amplifier based on consumer behavior investigations, where the amplifier operates at idle and low power levels most of the time. Experimental results demonstrate that the optimization method can...... lead to around 30 % of efficiency improvement at 1.3 W output power without significant effects on both audio performance and the efficiency at high power levels....
DIGITAL BROADCASTING and INTERACTIVE TELEVISION in DISTANCE EDUCATION: Digital And Interactive Television Infrastructure Proposol for Anadolu University Open Education Faculty

Directory of Open Access Journals (Sweden)

Reha Recep ERGUL

2007-01-01

Full Text Available Rapid changes and improvements in the communication and information technologies beginning from the midst of the 20th Century and continuing today require new methods, constructions, and arrangements in the production and distribution of information. While television having the ability of presenting complex or difficult to comprehend concepts, subjects, and experimental studies to learners from different points of view, supported by 2D or 3D graphics and animations with audio visual stimulators replaces its technology from analog to digital and towards digital-interactive, it has also begun to convert the broadcasting technology in Turkey in this direction. Therefore, television broadcast infrastructure of Anadolu University Open Education Faculty needs to be replaced with a digital and interactive one. This study contains basic concepts of digital and interactive broadcasting and the new improvements. Furthermore, it includes the approaches in the basis of why and how a digital television broadcasting infrastructure should be stablished.
Robustness evaluation of transactional audio watermarking systems

Science.gov (United States)

Neubauer, Christian; Steinebach, Martin; Siebenhaar, Frank; Pickel, Joerg

2003-06-01

Distribution via Internet is of increasing importance. Easy access, transmission and consumption of digitally represented music is very attractive to the consumer but led also directly to an increasing problem of illegal copying. To cope with this problem watermarking is a promising concept since it provides a useful mechanism to track illicit copies by persistently attaching property rights information to the material. Especially for online music distribution the use of so-called transaction watermarking, also denoted with the term bitstream watermarking, is beneficial since it offers the opportunity to embed watermarks directly into perceptually encoded material without the need of full decompression/compression. Besides the concept of bitstream watermarking, former publications presented the complexity, the audio quality and the detection performance. These results are now extended by an assessment of the robustness of such schemes. The detection performance before and after applying selected attacks is presented for MPEG-1/2 Layer 3 (MP3) and MPEG-2/4 AAC bitstream watermarking, contrasted to the performance of PCM spread spectrum watermarking.
DOA Estimation of Audio Sources in Reverberant Environments

DEFF Research Database (Denmark)

Jensen, Jesper Rindom; Nielsen, Jesper Kjær; Heusdens, Richard

2016-01-01

Reverberation is well-known to have a detrimental impact on many localization methods for audio sources. We address this problem by imposing a model for the early reflections as well as a model for the audio source itself. Using these models, we propose two iterative localization methods...... that estimate the direction-of-arrival (DOA) of both the direct path of the audio source and the early reflections. In these methods, the contribution of the early reflections is essentially subtracted from the signal observations before localization of the direct path component, which may reduce the estimation...
Portable Audio Design

DEFF Research Database (Denmark)

Groth, Sanne Krogh

2014-01-01

attention to the specific genre; a grasping of the complex relationship between site and time, the actual and the virtual; and getting aquatint with the specific site’s soundscape by approaching it both intuitively and systematically. These steps will finally lead to an audio production that not only...
AUDIO CRYPTANALYSIS- AN APPLICATION OF SYMMETRIC KEY CRYPTOGRAPHY AND AUDIO STEGANOGRAPHY

Directory of Open Access Journals (Sweden)

Smita Paira

2016-09-01

Full Text Available In the recent trend of network and technology, “Cryptography” and “Steganography” have emerged out as the essential elements of providing network security. Although Cryptography plays a major role in the fabrication and modification of the secret message into an encrypted version yet it has certain drawbacks. Steganography is the art that meets one of the basic limitations of Cryptography. In this paper, a new algorithm has been proposed based on both Symmetric Key Cryptography and Audio Steganography. The combination of a randomly generated Symmetric Key along with LSB technique of Audio Steganography sends a secret message unrecognizable through an insecure medium. The Stego File generated is almost lossless giving a 100 percent recovery of the original message. This paper also presents a detailed experimental analysis of the algorithm with a brief comparison with other existing algorithms and a future scope. The experimental verification and security issues are promising.

DIGITAL NARRATIVES IN FUTURE UKRAINIAN LANGUAGE AND LITERATURE TEACHERS TRAINING

Directory of Open Access Journals (Sweden)

Olena Semenoh

2017-04-01

Full Text Available In the article on the basis of analyzing theoretical sources and practical experience some scientists’ works are disclosed, which deal with using and designing digital narratives in future Ukrainian language and literature teachers’ training, to develop a personality’s information and digital competence. It is reported that the themes, which are focused on postgraduate students’ acquainting with digital technologies of studying linguistic subjects at university, in specialized classes in secondary school, and a new type of educational institutions, should be introduced into language and methodological training. The author emphasizes on the relevance and importance of using digital narratives for democratization and humanization, the inspiration of the educational process. Narratives (stories in literary works, letters, confessions, biographies, diaries, comments, portrait sketches, pedagogical aphorisms, scripts, summaries of lessons with notes in the margins and others, biographical and pedagogical narratives provide information about the events, situations, taking into account individual reflexed experience of outstanding teachers. If students have an opportunity to develop skills of making narratives, they will gradually get communicative competences and feeling of confidence in their own ability that are necessary in the life. The works by M. Leshchenko and L. Tymchuk that are devoted to studying biography narratives are overviewed. The author suggests her own works of studying biography narratives of outstanding personalities (O. Zakharenko, I. Ziaziun, N. Voloshyna, L. Matsko and others. Digital narrative is characterized as a dynamic means of sending information messages in which a word, an image and sound are expressed in a joint digital code; as multimedia project that combines text, a picture, audio and video files in a short video clip. It is spoken in detail that digital narratives that are used or made together with students
Adaptive DCTNet for Audio Signal Classification

OpenAIRE

Xian, Yin; Pu, Yunchen; Gan, Zhe; Lu, Liang; Thompson, Andrew

2016-01-01

In this paper, we investigate DCTNet for audio signal classification. Its output feature is related to Cohen's class of time-frequency distributions. We introduce the use of adaptive DCTNet (A-DCTNet) for audio signals feature extraction. The A-DCTNet applies the idea of constant-Q transform, with its center frequencies of filterbanks geometrically spaced. The A-DCTNet is adaptive to different acoustic scales, and it can better capture low frequency acoustic information that is sensitive to h...
Media Sosial Instagram sebagai Sarana Sosialisasi Kebijakan Penyiaran Digital

Directory of Open Access Journals (Sweden)

Agung Prabowo

2017-05-01

Full Text Available This research tests the hypothesis that social media (Instagram is used as an effective medium to disseminate and educate people on issue of migration and digital TV. It is a three-week experimental research to 79 students as respondents based on video animation and text related to digital broadcasting. Instagram is chosen in term of interactive and audio-visual characteristics. The result shows that there is non-significant difference on students’ knowledge after treatment. The Chi Square test shows that Asymptotic significance is 0.646 (greater than 0,05. It indicates that there is no significant difference of knowledge before and after receiving a message about digital TV via Instagram.
Fall Detection Using Smartphone Audio Features.

Science.gov (United States)

Cheffena, Michael

2016-07-01

An automated fall detection system based on smartphone audio features is developed. The spectrogram, mel frequency cepstral coefficents (MFCCs), linear predictive coding (LPC), and matching pursuit (MP) features of different fall and no-fall sound events are extracted from experimental data. Based on the extracted audio features, four different machine learning classifiers: k-nearest neighbor classifier (k-NN), support vector machine (SVM), least squares method (LSM), and artificial neural network (ANN) are investigated for distinguishing between fall and no-fall events. For each audio feature, the performance of each classifier in terms of sensitivity, specificity, accuracy, and computational complexity is evaluated. The best performance is achieved using spectrogram features with ANN classifier with sensitivity, specificity, and accuracy all above 98%. The classifier also has acceptable computational requirement for training and testing. The system is applicable in home environments where the phone is placed in the vicinity of the user.
E-Books and Audiobooks: Extending the Digital Reading Experience

Science.gov (United States)

Larson, Lotta C.

2015-01-01

This article examines how sixth-grade students navigated and perceived a combined e-book and audiobook reading experience using Kindle Fires. While audiobooks and e-books are not new, little is known about students' use and perceptions of the combination of these two media, as the ability to synchronize audio contents with digital texts is rather…
Open-Loop Audio-Visual Stimulation (AVS): A Useful Tool for Management of Insomnia?

Science.gov (United States)

Tang, Hsin-Yi Jean; Riegel, Barbara; McCurry, Susan M; Vitiello, Michael V

2016-03-01

Audio Visual Stimulation (AVS), a form of neurofeedback, is a non-pharmacological intervention that has been used for both performance enhancement and symptom management. We review the history of AVS, its two sub-types (close- and open-loop), and discuss its clinical implications. We also describe a promising new application of AVS to improve sleep, and potentially decrease pain. AVS research can be traced back to the late 1800s. AVS's efficacy has been demonstrated for both performance enhancement and symptom management. Although AVS is commonly used in clinical settings, there is limited literature evaluating clinical outcomes and mechanisms of action. One of the challenges to AVS research is the lack of standardized terms, which makes systematic review and literature consolidation difficult. Future studies using AVS as an intervention should; (1) use operational definitions that are consistent with the existing literature, such as AVS, Audio-visual Entrainment, or Light and Sound Stimulation, (2) provide a clear rationale for the chosen training frequency modality, (3) use a randomized controlled design, and (4) follow the Consolidated Standards of Reporting Trials and/or related guidelines when disseminating results.
Algorithmic Skin: Health-Tracking Technologies, Personal Analytics and the Biopedagogies of Digitized Health and Physical Education

Science.gov (United States)

Williamson, Ben

2015-01-01

The emergence of digitized health and physical education, or "eHPE", embeds software algorithms in the organization of health and physical education pedagogies. Particularly with the emergence of wearable and mobile activity trackers, biosensors and personal analytics apps, algorithmic processes have an increasingly powerful part to play…
Digital communication device

DEFF Research Database (Denmark)

2005-01-01

The invention concerns a digital communication device like a hearing aid or a headset. The hearing aid or headset has a power supply, a signal processing device, means for receiving a wireless signal and a receiver or loudspeaker, which produces an audio signal based on a modulated pulsed signal...... point is provided which is in electrical contact with the metal of the metal box and whereby this third connection point is connected to the electric circuitry of the communication device at a point having a stable and well defined electrical potential. In this way the electro-and magnetic radiation...
Audio-Tutorial Instruction: A Strategy For Teaching Introductory College Geology.

Science.gov (United States)

Fenner, Peter; Andrews, Ted F.

The rationale of audio-tutorial instruction is discussed, and the history and development of the audio-tutorial botany program at Purdue University is described. Audio-tutorial programs in geology at eleven colleges and one school are described, illustrating several ways in which programs have been developed and integrated into courses. Programs…
X-ray film digitization using a personal computer and hand-held scanner: a simple technique for storing images

International Nuclear Information System (INIS)

Munoz-Nunez, C. F.; Lloret-Alcaniz, A.

1998-01-01

To develop a simple, low-cost technique for the digitization of X-ray films for personal use. A 66-MHz 486 PC with 8 MB of RAM, a Logitech ScanMan 256 hand-held scanner and a standard negatoscope with the power source converted to direct current. Although the system was originally designed for the digitization of mammographies, it has also been used with computed tomography, magnetic resonance, digital angiography and ultrasonographic images, as well as plain X-rays. After a minimal training period, the system digitized X-ray films easily and rapidly. Although the scanning values vary depending on the type of image to be digitized, an input spatial resolution of 200 dpi and a contrast resolution of 256 levels of gray are generally adequate. Of the storage formats tested, JPEG presented the best quality/image size ratio. A simple, low-cost technique has been developed for the digitization of X-ray films. This technique enables the storage of images in a digital format, thus facilitating their presentation and transmission. (Author) 9 refs
The Power of Digital Storytelling to Support Teaching and Learning

Science.gov (United States)

Robin, Bernard R.

2016-01-01

Although the term "digital storytelling" may not be familiar to all readers, over the last twenty years, an increasing number of educators, students and others around the world have created short movies by combining computer-based images, text, recorded audio narration, video clips and music in order to present information on various…
A digital strategy for manometer dynamic enhancement. [for wind tunnel monitoring

Science.gov (United States)

Stoughton, J. W.

1978-01-01

Application of digital signal processing techniques to improve the non-linear dynamic characteristics of a sonar-type mercury manometer is described. The dynamic enhancement strategy quasi-linearizes the manometer characteristics and improves the effective bandwidth in the context of a wind-tunnel pressure regulation system. Model identification data and real-time hybrid simulation data demonstrate feasibility of approach.
Optimized Audio Classification and Segmentation Algorithm by Using Ensemble Methods

Directory of Open Access Journals (Sweden)

Saadia Zahid

2015-01-01

Full Text Available Audio segmentation is a basis for multimedia content analysis which is the most important and widely used application nowadays. An optimized audio classification and segmentation algorithm is presented in this paper that segments a superimposed audio stream on the basis of its content into four main audio types: pure-speech, music, environment sound, and silence. An algorithm is proposed that preserves important audio content and reduces the misclassification rate without using large amount of training data, which handles noise and is suitable for use for real-time applications. Noise in an audio stream is segmented out as environment sound. A hybrid classification approach is used, bagged support vector machines (SVMs with artificial neural networks (ANNs. Audio stream is classified, firstly, into speech and nonspeech segment by using bagged support vector machines; nonspeech segment is further classified into music and environment sound by using artificial neural networks and lastly, speech segment is classified into silence and pure-speech segments on the basis of rule-based classifier. Minimum data is used for training classifier; ensemble methods are used for minimizing misclassification rate and approximately 98% accurate segments are obtained. A fast and efficient algorithm is designed that can be used with real-time multimedia applications.
Image enhancement of digital periapical radiographs according to diagnostic tasks

International Nuclear Information System (INIS)

Choi, Jin Woo; Han, Won Jeong; Kim, Eun Kyung

2014-01-01

his study was performed to investigate the effect of image enhancement of periapical radiographs according to the diagnostic task. Eighty digital intraoral radiographs were obtained from patients and classified into four groups according to the diagnostic tasks of dental caries, periodontal diseases, periapical lesions, and endodontic files. All images were enhanced differently by using five processing techniques. Three radiologists blindly compared the subjective image quality of the original images and the processed images using a 5-point scale. There were significant differences between the image quality of the processed images and that of the original images (P<0.01) in all the diagnostic task groups. Processing techniques showed significantly different efficacy according to the diagnostic task (P<0.01). Image enhancement affects the image quality differently depending on the diagnostic task. And the use of optimal parameters is important for each diagnostic task.
Image enhancement of digital periapical radiographs according to diagnostic tasks

Energy Technology Data Exchange (ETDEWEB)

Choi, Jin Woo; Han, Won Jeong; Kim, Eun Kyung [Dept. of Oral and Maxillofacial Radiology, Dankook University College of Dentistry, Cheonan (Korea, Republic of)

2014-03-15

his study was performed to investigate the effect of image enhancement of periapical radiographs according to the diagnostic task. Eighty digital intraoral radiographs were obtained from patients and classified into four groups according to the diagnostic tasks of dental caries, periodontal diseases, periapical lesions, and endodontic files. All images were enhanced differently by using five processing techniques. Three radiologists blindly compared the subjective image quality of the original images and the processed images using a 5-point scale. There were significant differences between the image quality of the processed images and that of the original images (P<0.01) in all the diagnostic task groups. Processing techniques showed significantly different efficacy according to the diagnostic task (P<0.01). Image enhancement affects the image quality differently depending on the diagnostic task. And the use of optimal parameters is important for each diagnostic task.
Audio Description as a Pedagogical Tool

Directory of Open Access Journals (Sweden)

Georgina Kleege

2015-05-01

Full Text Available Audio description is the process of translating visual information into words for people who are blind or have low vision. Typically such description has focused on films, museum exhibitions, images and video on the internet, and live theater. Because it allows people with visual impairments to experience a variety of cultural and educational texts that would otherwise be inaccessible, audio description is a mandated aspect of disability inclusion, although it remains markedly underdeveloped and underutilized in our classrooms and in society in general. Along with increasing awareness of disability, audio description pushes students to practice close reading of visual material, deepen their analysis, and engage in critical discussions around the methodology, standards and values, language, and role of interpretation in a variety of academic disciplines. We outline a few pedagogical interventions that can be customized to different contexts to develop students' writing and critical thinking skills through guided description of visual material.
Extracting meaning from audio signals - a machine learning approach

DEFF Research Database (Denmark)

Larsen, Jan

2007-01-01

* Machine learning framework for sound search * Genre classification * Music and audio separation * Wind noise suppression......* Machine learning framework for sound search * Genre classification * Music and audio separation * Wind noise suppression...
37 CFR 382.2 - Royalty fees for the digital performance of sound recordings and the making of ephemeral...

Science.gov (United States)

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Royalty fees for the digital... SATELLITE DIGITAL AUDIO RADIO SERVICES Preexisting Subscription Services § 382.2 Royalty fees for the... monthly royalty fee for the public performance of sound recordings pursuant to 17 U.S.C. 114(d)(2) and the...
Independent Interactive Inquiry-Based Learning Modules Using Audio-Visual Instruction In Statistics

OpenAIRE

McDaniel, Scott N.; Green, Lisa

2012-01-01

Simulations can make complex ideas easier for students to visualize and understand. It has been shown that guidance in the use of these simulations enhances students’ learning. This paper describes the implementation and evaluation of the Independent Interactive Inquiry-based (I3) Learning Modules, which use existing open-source Java applets, combined with audio-visual instruction. Students are guided to discover and visualize important concepts in post-calculus and algebra-based courses in p...
Decision-level fusion for audio-visual laughter detection

NARCIS (Netherlands)

Reuderink, B.; Poel, M.; Truong, K.; Poppe, R.; Pantic, M.

2008-01-01

Laughter is a highly variable signal, which can be caused by a spectrum of emotions. This makes the automatic detection of laughter a challenging, but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio-visual laughter detection is

Decision-Level Fusion for Audio-Visual Laughter Detection

NARCIS (Netherlands)

Reuderink, B.; Poel, Mannes; Truong, Khiet Phuong; Poppe, Ronald Walter; Pantic, Maja; Popescu-Belis, Andrei; Stiefelhagen, Rainer

Laughter is a highly variable signal, which can be caused by a spectrum of emotions. This makes the automatic detection of laugh- ter a challenging, but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio- visual laughter detection is
Content Discovery from Composite Audio : An unsupervised approach

NARCIS (Netherlands)

Lu, L.

2009-01-01

In this thesis, we developed and assessed a novel robust and unsupervised framework for semantic inference from composite audio signals. We focused on the problem of detecting audio scenes and grouping them into meaningful clusters. Our approach addressed all major steps in a general process of
A compact electroencephalogram recording device with integrated audio stimulation system

Science.gov (United States)

Paukkunen, Antti K. O.; Kurttio, Anttu A.; Leminen, Miika M.; Sepponen, Raimo E.

2010-06-01

A compact (96×128×32 mm3, 374 g), battery-powered, eight-channel electroencephalogram recording device with an integrated audio stimulation system and a wireless interface is presented. The recording device is capable of producing high-quality data, while the operating time is also reasonable for evoked potential studies. The effective measurement resolution is about 4 nV at 200 Hz sample rate, typical noise level is below 0.7 μVrms at 0.16-70 Hz, and the estimated operating time is 1.5 h. An embedded audio decoder circuit reads and plays wave sound files stored on a memory card. The activities are controlled by an 8 bit main control unit which allows accurate timing of the stimuli. The interstimulus interval jitter measured is less than 1 ms. Wireless communication is made through bluetooth and the data recorded are transmitted to an external personal computer (PC) interface in real time. The PC interface is implemented with LABVIEW® and in addition to data acquisition it also allows online signal processing, data storage, and control of measurement activities such as contact impedance measurement, for example. The practical application of the device is demonstrated in mismatch negativity experiment with three test subjects.
Susceptibility to a multisensory speech illusion in older persons is driven by perceptual processes

Directory of Open Access Journals (Sweden)

Annalisa eSetti

2013-09-01

Full Text Available Recent studies suggest that multisensory integration is enhanced in older adults but it is not known whether this enhancement is solely driven by perceptual processes or affected by cognitive processes. Using the ‘McGurk illusion’, in Experiment 1 we found that audio-visual integration of incongruent audio-visual words was higher in older adults than in younger adults, although the recognition of either audio- or visual-only presented words was the same across groups. In Experiment 2 we tested recall of sentences within which an incongruent audio-visual speech word was embedded. The overall semantic meaning of the sentence was compatible with either one of the unisensory components of the target word and/or with the illusory percept. Older participants recalled more illusory audio-visual words in sentences than younger adults, however, there was no differential effect of word compatibility on recall for the two groups. Our findings suggest that the relatively high susceptibility to the audio-visual speech illusion in older participants is due more to perceptual than cognitive processing.
Economic and legal aspects of introducing novel ICT instruments: integrating sound into social media marketing - from audio branding to soundscaping

OpenAIRE

Daj, A.

2013-01-01

The pervasive expansion and implementation of ICT based marketing instruments imposes a new economic investigation of business models and regulatory solutions. Moreover, the current status of Social Media research indicates that the use of social networking and collaboration technologies is deeply changing the way people communicate, consume and cooperate with each other. Against the backdrop of widespread availability of digital audio-video content and the growing number of “smart” mobile de...
Digital staining for histopathology multispectral images by the combined application of spectral enhancement and spectral transformation.

Science.gov (United States)

Bautista, Pinky A; Yagi, Yukako

2011-01-01

In this paper we introduced a digital staining method for histopathology images captured with an n-band multispectral camera. The method consisted of two major processes: enhancement of the original spectral transmittance and the transformation of the enhanced transmittance to its target spectral configuration. Enhancement is accomplished by shifting the original transmittance with the scaled difference between the original transmittance and the transmittance estimated with m dominant principal component (PC) vectors;the m-PC vectors were determined from the transmittance samples of the background image. Transformation of the enhanced transmittance to the target spectral configuration was done using an nxn transformation matrix, which was derived by applying a least square method to the enhanced and target spectral training data samples of the different tissue components. Experimental results on the digital conversion of a hematoxylin and eosin (H&E) stained multispectral image to its Masson's trichrome stained (MT) equivalent shows the viability of the method.
Huffman coding in advanced audio coding standard

Science.gov (United States)

Brzuchalski, Grzegorz

2012-05-01

This article presents several hardware architectures of Advanced Audio Coding (AAC) Huffman noiseless encoder, its optimisations and working implementation. Much attention has been paid to optimise the demand of hardware resources especially memory size. The aim of design was to get as short binary stream as possible in this standard. The Huffman encoder with whole audio-video system has been implemented in FPGA devices.
Does Digital Video Enhance Student Learning in Field-Based Experiments and Develop Graduate Attributes beyond the Classroom?

Science.gov (United States)

Fuller, Ian C.; France, Derek

2016-01-01

The connection between fieldwork and development of graduate attributes is explored in this paper. Digital technologies present opportunities to potentially enhance the learning experience of students undertaking fieldwork, and develop core digital attributes and competencies required by Higher Education Institutions (HEIs) and employers. This…
Digital health increasing the impact with personalized design

OpenAIRE

Empelen, P. van; Otten, W.; Molema, H.; Keijsers, J.; Mooij, R.

2016-01-01

Digital health is considered the ‘holy grail’ of effective and sustainable health(care). It uses the latest technology, apps and data to support and improve health. Digital health tools can benefit both patients and healthy individuals, with support and advice. But healthcare professionals, policymakers and scientist can also benefit from the (big) data and insights collected by digital health applications. A well-known example of digital health is eHealth, which provides information- and com...
Mobile Learning Devices. Essentials for Principals

Science.gov (United States)

Rogers, Kipp D.

2011-01-01

In "Mobile Learning Devices," the author helps educators confront and overcome their fears and doubts about using mobile learning devices (MLDs) such as cell phones, personal digital assistants, MP3 players, handheld games, digital audio players, and laptops in classrooms. School policies that ban such tools are outdated, the author suggests;…
Imagination and Modern Audio Visual Form

Directory of Open Access Journals (Sweden)

Ana Đurković

2017-09-01

Full Text Available Through three episodes Archetype of modern fairy tales, the mysterious world of fantasy and reality,tell as a serious story about archetypes, symbols, knowledge of good and evil. Rts editor: Natasa Neskovic Written and directed by: Suncica Jergovic Editing: Ana Djurkovic How to illuminate concept of phantasy and affective factors in our imagination a priori something so imaginary, by their genetic provenance, such as a movie scene, or digital picture and sound. You can not always avoid the association to a valid phrase of arnhajm’s truth: mass age -massage: the medium is the message. In elementary and tersely definition of „the shot“ from Plaževsky film language there is term for „le cadre“, however these are selected bits of reality, immanent frame that contains the individual act of images divided of the continent’s view of reality, handling the specific code of semantic value, when its’s imaginative, of course, by aesthetic categories and evaluations. In this type of positive simulacrum, it can not be better segment for the current thinking about the limits of imagination and truth in contemporary media, and contemporary global environment, than the original audio-visual forms through whose prism we search throught a fairy tale in a same time myth and imagination as well as exploring its overall impact on the personality. Everything can be a fairy tale, even false, amoral platitudes politicized by political lobbies in a contemporary existing power sistems, but this is no fairy tale authenticity in it, or creative act, nor humanity and artificial and historical entity of a man that is always present in the ethical effort of a true artist. So, we are investigating the conditions of creative images, modalities of audiovisual media in film language,and it is the archetype of the fairy tale, which, with its psychodynamics still exists and which is removed when the modern man is tired of lies and simulations during his global
Mining Contextual Information for Ephemeral Digital Video Preservation

Directory of Open Access Journals (Sweden)

Chirag Shah

2009-06-01

Full Text Available Normal 0 For centuries the archival community has understood and practiced the art of adding contextual information while preserving an artifact. The question now is how these practices can be transferred to the digital domain. With the growing expansion of production and consumption of digital objects (documents, audio, video, etc. it has become essential to identify and study issues related to their representation. A curator in the digital realm may be said to have the same responsibilities as one in a traditional archival domain. However, with the mass production and spread of digital objects, it may be difficult to do all the work manually. In the present article this problem is considered in the area of digital video preservation. We show how this problem can be formulated and propose a framework for capturing contextual information for ephemeral digital video preservation. This proposal is realized in a system called ContextMiner, which allows us to cater to a digital curator's needs with its four components: digital video curation, collection visualization, browsing interfaces, and video harvesting and monitoring. While the issues and systems described here are geared toward digital videos, they can easily be applied to other kinds of digital objects.
Semantically transparent fingerprinting for right protection of digital cinema

Science.gov (United States)

Wu, Xiaolin

2003-06-01

Digital cinema, a new frontier and crown jewel of digital multimedia, has the potential of revolutionizing the science, engineering and business of movie production and distribution. The advantages of digital cinema technology over traditional analog technology are numerous and profound. But without effective and enforceable copyright protection measures, digital cinema can be more susceptible to widespread piracy, which can dampen or even prevent the commercial deployment of digital cinema. In this paper we propose a novel approach of fingerprinting each individual distribution copy of a digital movie for the purpose of tracing pirated copies back to their source. The proposed fingerprinting technique presents a fundamental departure from the traditional digital watermarking/fingerprinting techniques. Its novelty and uniqueness lie in a so-called semantic or subjective transparency property. The fingerprints are created by editing those visual and audio attributes that can be modified with semantic and subjective transparency to the audience. Semantically-transparent fingerprinting or watermarking is the most robust kind among all existing watermarking techniques, because it is content-based not sample-based, and semantically-recoverable not statistically-recoverable.
Sounding ruins: reflections on the production of an ‘audio drift’

Science.gov (United States)

Gallagher, Michael

2014-01-01

This article is about the use of audio media in researching places, which I term ‘audio geography’. The article narrates some episodes from the production of an ‘audio drift’, an experimental environmental sound work designed to be listened to on a portable MP3 player whilst walking in a ruinous landscape. Reflecting on how this work functions, I argue that, as well as representing places, audio geography can shape listeners’ attention and bodily movements, thereby reworking places, albeit temporarily. I suggest that audio geography is particularly apt for amplifying the haunted and uncanny qualities of places. I discuss some of the issues raised for research ethics, epistemology and spectral geographies. PMID:29708107
Evidence that personal genome testing enhances student learning in a course on genomics and personalized medicine.

Directory of Open Access Journals (Sweden)

Keyan Salari

Full Text Available An emerging debate in academic medical centers is not about the need for providing trainees with fundamental education on genomics, but rather the most effective educational models that should be deployed. At Stanford School of Medicine, a novel hands-on genomics course was developed in 2010 that provided students the option to undergo personal genome testing as part of the course curriculum. We hypothesized that use of personal genome testing in the classroom would enhance the learning experience of students. No data currently exist on how such methods impact student learning; thus, we surveyed students before and after the course to determine its impact. We analyzed responses using paired statistics from the 31 medical and graduate students who completed both pre-course and post-course surveys. Participants were stratified by those who did (N = 23 or did not (N = 8 undergo personal genome testing. In reflecting on the experience, 83% of students who underwent testing stated that they were pleased with their decision compared to 12.5% of students who decided against testing (P = 0.00058. Seventy percent of those who underwent personal genome testing self-reported a better understanding of human genetics on the basis of having undergone testing. Further, students who underwent personal genome testing demonstrated an average 31% increase in pre- to post-course scores on knowledge questions (P = 3.5×10(-6; this was significantly higher (P = 0.003 than students who did not undergo testing, who showed a non-significant improvement. Undergoing personal genome testing and using personal genotype data in the classroom enhanced students' self-reported and assessed knowledge of genomics, and did not appear to cause significant anxiety. At least for self-selected students, the incorporation of personal genome testing can be an effective educational tool to teach important concepts of clinical genomic testing.
Generation of Binary Off-axis Digital Fresnel Hologram with Enhanced Quality

Directory of Open Access Journals (Sweden)

Peter Wai Ming Tsang

2015-06-01

Full Text Available The emergence of high resolution printer and digital micromirror device (DMD has enabled real, off-axis holograms to be printed, or projected onto a screen. As most printers and DMD can only reproduce binary dots, the pixels in a hologram have to be truncated to 2 levels. However, direct binarizing a hologram will lead to severe degradation on its reconstructed image. In this paper, a method for generating binary off-axis digital Fresnel hologram is reported. A hologram generated with the proposed method is referred to as the "Enhanced Sampled Binary Hologram" (ESBH. The reconstructed image of the ESBH is superior in visual quality as compare with the one obtained with existing technique, and also resistant to noise contamination.
Proposal for internet-based Digital Dental Chart for personal dental identification in forensics.

Science.gov (United States)

Hanaoka, Yoichi; Ueno, Asao; Tsuzuki, Tamiyuki; Kajiwara, Masahiro; Minaguchi, Kiyoshi; Sato, Yoshinobu

2007-05-03

A dental chart is very useful as a standard source of evidence in the personal identification of bodies. However, the kind of dental chart available will often vary as a number of types of odontogram have been developed where the visual representation of dental conditions has relied on hand-drawn representation. We propose the Digital Dental Chart (DDC) as a new style of dental chart, especially for open investigations aimed at establishing the identity of unknown bodies. Each DDC is constructed using actual oral digital images and dental data, and is easy to upload onto an Internet website. The DDC is a more useful forensic resource than the standard types of dental chart in current use as it has several advantages, among which are its ability to carry a large volume of information and reproduce dental conditions clearly and in detail on a cost-effective basis.
Localization of Digital Content for Use in Secondary Schools of Bangladesh

Science.gov (United States)

Chowdhury, Md. Didar; Al-Mahmood, Abdullah; Bashar, Md. Abul; Ahmed, Jamal Uddin

2011-01-01

Localization of digital content (LDC) in Bangladesh is aimed at the production of graphical and video content adapted for the local curriculum and suitable for use in secondary schools. Teachers learnt how to make PowerPoint presentations into which they can incorporate video, audio and graphical content downloaded from the Internet. Empowering…
IELTS speaking instruction through audio/voice conferencing

Directory of Open Access Journals (Sweden)

Hamed Ghaemi

2012-02-01

Full Text Available The currentstudyaimsatinvestigatingtheimpactofAudio/Voiceconferencing,asanewapproachtoteaching speaking, on the speakingperformanceand/orspeakingband score ofIELTScandidates.Experimentalgroupsubjectsparticipated in an audio conferencing classwhile those of the control group enjoyed attending in a traditional IELTS Speakingclass. At the endofthestudy,allsubjectsparticipatedinanIELTSExaminationheldonNovemberfourthin Tehran,Iran.To compare thegroupmeansforthestudy,anindependentt-testanalysiswasemployed.Thedifferencebetween experimental and control groupwasconsideredtobestatisticallysignificant(P<0.01.Thatisthecandidates in experimental group have outperformed the ones in control group in IELTS Speaking test scores.
A high efficiency PWM CMOS class-D audio power amplifier

Energy Technology Data Exchange (ETDEWEB)

Zhu Zhangming; Liu Lianxi; Yang Yintang [Institute of Microelectronics, Xidian University, Xi' an 710071 (China); Lei Han, E-mail: zmyh@263.ne [Xi' an Power-Rail Micro Co., Ltd, Xi' an 710075 (China)

2009-02-15

Based on the difference close-loop feedback technique and the difference pre-amp, a high efficiency PWM CMOS class-D audio power amplifier is proposed. A rail-to-rail PWM comparator with window function has been embedded in the class-D audio power amplifier. Design results based on the CSMC 0.5 mum CMOS process show that the max efficiency is 90%, the PSRR is -75 dB, the power supply voltage range is 2.5-5.5 V, the THD+N in 1 kHz input frequency is less than 0.20%, the quiescent current in no load is 2.8 mA, and the shutdown current is 0.5 muA. The active area of the class-D audio power amplifier is about 1.47 x 1.52 mm{sup 2}. With the good performance, the class-D audio power amplifier can be applied to several audio power systems.

A high efficiency PWM CMOS class-D audio power amplifier

International Nuclear Information System (INIS)

Zhu Zhangming; Liu Lianxi; Yang Yintang; Lei Han

2009-01-01

Based on the difference close-loop feedback technique and the difference pre-amp, a high efficiency PWM CMOS class-D audio power amplifier is proposed. A rail-to-rail PWM comparator with window function has been embedded in the class-D audio power amplifier. Design results based on the CSMC 0.5 μm CMOS process show that the max efficiency is 90%, the PSRR is -75 dB, the power supply voltage range is 2.5-5.5 V, the THD+N in 1 kHz input frequency is less than 0.20%, the quiescent current in no load is 2.8 mA, and the shutdown current is 0.5 μA. The active area of the class-D audio power amplifier is about 1.47 x 1.52 mm 2 . With the good performance, the class-D audio power amplifier can be applied to several audio power systems.
A high efficiency PWM CMOS class-D audio power amplifier

Science.gov (United States)

Zhangming, Zhu; Lianxi, Liu; Yintang, Yang; Han, Lei

2009-02-01

Based on the difference close-loop feedback technique and the difference pre-amp, a high efficiency PWM CMOS class-D audio power amplifier is proposed. A rail-to-rail PWM comparator with window function has been embedded in the class-D audio power amplifier. Design results based on the CSMC 0.5 μm CMOS process show that the max efficiency is 90%, the PSRR is -75 dB, the power supply voltage range is 2.5-5.5 V, the THD+N in 1 kHz input frequency is less than 0.20%, the quiescent current in no load is 2.8 mA, and the shutdown current is 0.5 μA. The active area of the class-D audio power amplifier is about 1.47 × 1.52 mm2. With the good performance, the class-D audio power amplifier can be applied to several audio power systems.
Contrast enhanced digital mammography: Is it useful in detecting lesions in edematous breast?

Directory of Open Access Journals (Sweden)

Noha Abd ElShafy ElSaid

2015-09-01

Conclusion: Dual-energy contrast-enhanced digital mammography is a useful technique in identification of lesions in mammographically dense edematous breasts and proved to be a useful tool in the follow-up of cases presenting by edema after conservative breast surgery and chemotherapy.
Dynamically-Loaded Hardware Libraries (HLL) Technology for Audio Applications

DEFF Research Database (Denmark)

Esposito, A.; Lomuscio, A.; Nunzio, L. Di

2016-01-01

In this work, we apply hardware acceleration to embedded systems running audio applications. We present a new framework, Dynamically-Loaded Hardware Libraries or HLL, to dynamically load hardware libraries on reconfigurable platforms (FPGAs). Provided a library of application-specific processors......, we load on-the-fly the specific processor in the FPGA, and we transfer the execution from the CPU to the FPGA-based accelerator. The proposed architecture provides excellent flexibility with respect to the different audio applications implemented, high quality audio, and an energy efficient solution....
A review of lossless audio compression standards and algorithms

Science.gov (United States)

Muin, Fathiah Abdul; Gunawan, Teddy Surya; Kartiwi, Mira; Elsheikh, Elsheikh M. A.

2017-09-01

Over the years, lossless audio compression has gained popularity as researchers and businesses has become more aware of the need for better quality and higher storage demand. This paper will analyse various lossless audio coding algorithm and standards that are used and available in the market focusing on Linear Predictive Coding (LPC) specifically due to its popularity and robustness in audio compression, nevertheless other prediction methods are compared to verify this. Advanced representation of LPC such as LSP decomposition techniques are also discussed within this paper.
WebGL and web audio software lightweight components for multimedia education

Science.gov (United States)

Chang, Xin; Yuksel, Kivanc; Skarbek, Władysław

2017-08-01

The paper presents the results of our recent work on development of contemporary computing platform DC2 for multimedia education usingWebGL andWeb Audio { the W3C standards. Using literate programming paradigm the WEBSA educational tools were developed. It offers for a user (student), the access to expandable collection of WEBGL Shaders and web Audio scripts. The unique feature of DC2 is the option of literate programming, offered for both, the author and the reader in order to improve interactivity to lightweightWebGL andWeb Audio components. For instance users can define: source audio nodes including synthetic sources, destination audio nodes, and nodes for audio processing such as: sound wave shaping, spectral band filtering, convolution based modification, etc. In case of WebGL beside of classic graphics effects based on mesh and fractal definitions, the novel image processing analysis by shaders is offered like nonlinear filtering, histogram of gradients, and Bayesian classifiers.
An Analogue Interface for Musical Robots

OpenAIRE

Long, Jason; Kapur, Ajay; Carnegie, Dale

2016-01-01

The majority of musical robotics performances, projects and installations utilise microcontroller hardware to digitally interface the robotic instruments with sequencer software and other musical controllers, often via a personal computer. While in many ways digital interfacing offers considerable power and flexibility, digital protocols, equipment and audio workstations often tend to suggest particular music-making work-flows and have resolution and timing limitations. This paper describes t...
A Cross-Disciplinary Successful Aging Intervention and Evaluation: Comparison of Person-to-Person and Digital-Assisted Approaches

Directory of Open Access Journals (Sweden)

Hui-Chuan Hsu

2018-05-01

Full Text Available Background: Successful aging has been the paradigm of old-age life. The purpose of this study was to implement and evaluate a cross-disciplinary intervention program using two approaches for community-based older adults in Taichung, Taiwan. Methods: The content of the intervention included successful aging concepts and preparation, physical activity, chronic disease and health management, dietary and nutrition information, cognitive training, emotional awareness and coping skills, family relationship and resilience, legal concepts regarding financial protection, and Internet use. The traditional person-to-person (P2P intervention approach was implemented among participants at urban centers, and the personal-and-digital (P&D intervention approach was implemented among participants at rural centers; before the P&D group received the intervention, participants were assessed as the control group for comparison. Results: Healthy behavior and nutrition improved for the P2P group, although not significantly. Strategies for adapting to old age and reducing ineffective coping were significantly improved in the P2P group. The ability to search for health information improved in the P&D group, and knowledge of finance-related law increased in the P2P group. Conclusion: A continuous, well-designed and evidence-based intervention program is beneficial for improving the health of older adults, or at least delaying its decline.
The Effects of Visual Illustrations on Learners' Achievement and Interest in PDA- (Personal Digital Assistant) Based Learning

Science.gov (United States)

Park, Sanghoon; Kim, Minjeong; Lee, Youngmin; Son, Chanhee; Lee, Miyoung

2005-01-01

PDAs (Personal Digital Assistants) have been used widely in educational settings. In this study, the visual illustration of a scientific text (cognitive-interest illustration, emotional-interest illustration, or no illustration) was manipulated to investigate its impact on student interest in instructional materials, achievement, and time spent on…
Using a Personal Digital Assistant to Increase Independent Task Completion by Students with Autism Spectrum Disorder

Science.gov (United States)

Mechling, Linda C.; Gast, David L.; Seid, Nicole H.

2009-01-01

In this study, a personal digital assistant (PDA) with picture, auditory, and video prompts with voice over, was evaluated as a portable self-prompting device for students with autism spectrum disorder (ASD). Using a multiple probe design across three cooking recipes and replicated with three students with ASD, the system was tested for its…
Mathematics Education ITE Students Examining the Value of Digital Learning Objects

Science.gov (United States)

Hawera Ngarewa; Wright, Noeline; Sharma, Sashi

2017-01-01

One issue in mathematics initial teacher education (ITE) is how to best support students to use digital technologies (DTs) to enhance their teaching of mathematics. While most ITE students are probably using DTs on a daily basis for personal use, they are often unfamiliar with using them for educative purposes in New Zealand primary school…
Class D audio amplifiers for high voltage capacitive transducers

DEFF Research Database (Denmark)

Nielsen, Dennis

of high volume, weight, and cost. High efficient class D amplifiers are now widely available offering power densities, that their linear counterparts can not match. Unlike the technology of audio amplifiers, the loudspeaker is still based on the traditional electrodynamic transducer invented by C.W. Rice......Audio reproduction systems contains two key components, the amplifier and the loudspeaker. In the last 20 – 30 years the technology of audio amplifiers have performed a fundamental shift of paradigm. Class D audio amplifiers have replaced the linear amplifiers, suffering from the well-known issues...... with the low level of acoustical output power and complex amplifier requirements, have limited the commercial success of the technology. Horn or compression drivers are typically favoured, when high acoustic output power is required, this is however at the expense of significant distortion combined...
Personality in speech assessment and automatic classification

CERN Document Server

Polzehl, Tim

2015-01-01

This work combines interdisciplinary knowledge and experience from research fields of psychology, linguistics, audio-processing, machine learning, and computer science. The work systematically explores a novel research topic devoted to automated modeling of personality expression from speech. For this aim, it introduces a novel personality assessment questionnaire and presents the results of extensive labeling sessions to annotate the speech data with personality assessments. It provides estimates of the Big 5 personality traits, i.e. openness, conscientiousness, extroversion, agreeableness, and neuroticism. Based on a database built on the questionnaire, the book presents models to tell apart different personality types or classes from speech automatically.
Current Issues and Trends in Multidimensional Sensing Technologies for Digital Media

Science.gov (United States)

Nagata, Noriko; Ohki, Hidehiro; Kato, Kunihito; Koshimizu, Hiroyasu; Sagawa, Ryusuke; Fujiwara, Takayuki; Yamashita, Atsushi; Hashimoto, Manabu

Multidimensional sensing (MDS) technologies have numerous applications in the field of digital media, including the development of audio and visual equipment for human-computer interaction (HCI) and manufacture of data storage devices; furthermore, MDS finds applications in the fields of medicine and marketing, i.e., in e-marketing and the development of diagnosis equipment.
StirMark Benchmark: audio watermarking attacks based on lossy compression

Science.gov (United States)

Steinebach, Martin; Lang, Andreas; Dittmann, Jana

2002-04-01

StirMark Benchmark is a well-known evaluation tool for watermarking robustness. Additional attacks are added to it continuously. To enable application based evaluation, in our paper we address attacks against audio watermarks based on lossy audio compression algorithms to be included in the test environment. We discuss the effect of different lossy compression algorithms like MPEG-2 audio Layer 3, Ogg or VQF on a selection of audio test data. Our focus is on changes regarding the basic characteristics of the audio data like spectrum or average power and on removal of embedded watermarks. Furthermore we compare results of different watermarking algorithms and show that lossy compression is still a challenge for most of them. There are two strategies for adding evaluation of robustness against lossy compression to StirMark Benchmark: (a) use of existing free compression algorithms (b) implementation of a generic lossy compression simulation. We discuss how such a model can be implemented based on the results of our tests. This method is less complex, as no real psycho acoustic model has to be applied. Our model can be used for audio watermarking evaluation of numerous application fields. As an example, we describe its importance for e-commerce applications with watermarking security.
Digital contrast enhancement of 18Fluorine-fluorodeoxyglucose positron emission tomography images in hepatocellular carcinoma

International Nuclear Information System (INIS)

Pandey, Anil Kumar; Sharma, Sanjay Kumar; Agarwal, Krishan Kant; Sharma, Punit; Bal, Chandrasekhar; Kumar, Rakesh

2016-01-01

The role of 18 fluorodeoxyglucose positron emission tomography (PET) is limited for detection of primary hepatocellular carcinoma (HCC) due to low contrast to the tumor, and normal hepatocytes (background). The aim of the present study was to improve the contrast between the tumor and background by standardizing the input parameters of a digital contrast enhancement technique. A transverse slice of PET image was adjusted for the best possible contrast, and saved in JPEG 2000 format. We processed this image with a contrast enhancement technique using 847 possible combinations of input parameters (threshold “m” and slope “e”). The input parameters which resulted in an image having a high value of 2 nd order entropy, and edge content, and low value of absolute mean brightness error, and saturation evaluation metrics, were considered as standardized input parameters. The same process was repeated for total nine PET-computed tomography studies, thus analyzing 7623 images. The selected digital contrast enhancement technique increased the contrast between the HCC tumor and background. In seven out of nine images, the standardized input parameters “m” had values between 150 and 160, and for other two images values were 138 and 175, respectively. The value of slope “e” was 4 in 4 images, 3 in 3 images and 1 in 2 images. It was found that it is important to optimize the input parameters for the best possible contrast for each image; a particular value was not sufficient for all the HCC images. The use of above digital contrast enhancement technique improves the tumor to background ratio in PET images of HCC and appears to be useful. Further clinical validation of this finding is warranted
AKTIVITAS SEKUNDER AUDIO UNTUK MENJAGA KEWASPADAAN PENGEMUDI MOBIL INDONESIA

Directory of Open Access Journals (Sweden)

Iftikar Zahedi Sutalaksana

2013-03-01

Full Text Available Tingkat kecelakaan lalu lintas yang melibatkan mobil di Indonesia semakin mengkhawatirkan. Tingginya peran faktor manusia sebagai penyebab utama kejadian kecelakaan patut diperhatikan. Penurunan kewaspadaan saat mengemudi akibat kantuk atau kelelahan merupakan salah satu kondisi yang mendorong terjadinya kecelakaan. Tulisan ini memaparkan aplikasi audio response test sebagai aktivitas sekunder dalam mengemudikan mobil. Response test yang dimaksud merupakan seperangkat aplikasi pada dashboard mobil yang menuntut respon pengemudi setiap stimulus suara bekerja. Audio response test ini diusulkan sebagai pemantau tingkat kewaspadaan pengemudi selama berkendara. Kewaspadaan pengemudi merupakan kondisi selama berkendara yang terjaga, awas, dan mampu memproses semua stimulus dengan baik. Hasil studi ini menghasilkan suatu bentuk audio response test yang terintegrasi dengan sistem berkendara di dalam mobil. Sumber bunyi diperdengarkan dengan intensitas konstan antara 80-85 dB. Bunyi akan berhenti jika pengemudi memberikan respon atas stimulus suara tersebut. Response test ini dirancang untuk mampu memantau tingkat kewaspadaan pengemudi selama berkendara. Penerapannya diharapkan mampu membantu menekan tingkat kecelakaan lalu lintas di Indonesia. Kata kunci: mengemudi, aktivitas sekunder, audio, kewaspadaan, response test Abstract The level of traffic accidents involving cars in Indonesia increasingly alarming. The high role of the human factor as the main cause of accident noteworthy. Decreased alertness while driving due to sleepiness or fatigue is one of the conditions that led to the accident. This paper describes an audio application response test as a secondary activity of driving a car. Response test is a set of applications on the dashboard of a car that demands a response driver each stimulus voice work. Audio response was proposed as test monitors the driver's level of alertness while driving. Vigilance driver was driving conditions during
Audio Networking in the Music Industry

Directory of Open Access Journals (Sweden)

Glebs Kuzmics

2018-01-01

Full Text Available This paper surveys the rôle of computer networking technologies in the music industry. A comparison of their relevant technologies, their defining advantages and disadvantages; analyses and discussion of the situation in the market of network enabled audio products followed by a discussion of different devices are presented. The idea of replacing a proprietary solution with open-source and freeware software programs has been chosen as the fundamental concept of this research. The technologies covered include: native IEEE AVnu Alliance Audio Video Bridging (AVB, CobraNet®, Audinate Dante™ and Harman BLU Link.
Frequency Hopping Method for Audio Watermarking

Directory of Open Access Journals (Sweden)

A. Anastasijević

2012-11-01

Full Text Available This paper evaluates the degradation of audio content for a perceptible removable watermark. Two different approaches to embedding the watermark in the spectral domain were investigated. The frequencies for watermark embedding are chosen according to a pseudorandom sequence making the methods robust. Consequentially, the lower quality audio can be used for promotional purposes. For a fee, the watermark can be removed with a secret watermarking key. Objective and subjective testing was conducted in order to measure degradation level for the watermarked music samples and to examine residual distortion for different parameters of the watermarking algorithm and different music genres.
Four-quadrant flyback converter for direct audio power amplification

DEFF Research Database (Denmark)

Ljusev, Petar; Andersen, Michael Andreas E.

2005-01-01

This paper presents a bidirectional, four-quadrant flyback converter for use in direct audio power amplification. When compared to the standard Class-D switching audio power amplifier with a separate power supply, the proposed four-quadrant flyback converter provides simple solution with better...

Audio Technology and Mobile Human Computer Interaction

DEFF Research Database (Denmark)

Chamberlain, Alan; Bødker, Mads; Hazzard, Adrian

2017-01-01

Audio-based mobile technology is opening up a range of new interactive possibilities. This paper brings some of those possibilities to light by offering a range of perspectives based in this area. It is not only the technical systems that are developing, but novel approaches to the design...... and understanding of audio-based mobile systems are evolving to offer new perspectives on interaction and design and support such systems to be applied in areas, such as the humanities....
Mobile video-to-audio transducer and motion detection for sensory substitution

Directory of Open Access Journals (Sweden)

Maxime eAmbard

2015-10-01

Full Text Available Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that despite a contrasted visual background and a highly lossy encoding method, the information in the audio signal is sufficient to allow object localization, object trajectory evaluation, object approach detection, and spatial separation of multiple objects. We also show that this type of audio signal can be interpreted by human users by asking ten subjects to discriminate trajectories based on generated audio signals.
Vroom: designing an augmented environment for remote collaboration in digital cinema production

Science.gov (United States)

Margolis, Todd; Cornish, Tracy

2013-03-01

As media technologies become increasingly affordable, compact and inherently networked, new generations of telecollaborative platforms continue to arise which integrate these new affordances. Virtual reality has been primarily concerned with creating simulations of environments that can transport participants to real or imagined spaces that replace the "real world". Meanwhile Augmented Reality systems have evolved to interleave objects from Virtual Reality environments into the physical landscape. Perhaps now there is a new class of systems that reverse this precept to enhance dynamic media landscapes and immersive physical display environments to enable intuitive data exploration through collaboration. Vroom (Virtual Room) is a next-generation reconfigurable tiled display environment in development at the California Institute for Telecommunications and Information Technology (Calit2) at the University of California, San Diego. Vroom enables freely scalable digital collaboratories, connecting distributed, high-resolution visualization resources for collaborative work in the sciences, engineering and the arts. Vroom transforms a physical space into an immersive media environment with large format interactive display surfaces, video teleconferencing and spatialized audio built on a highspeed optical network backbone. Vroom enables group collaboration for local and remote participants to share knowledge and experiences. Possible applications include: remote learning, command and control, storyboarding, post-production editorial review, high resolution video playback, 3D visualization, screencasting and image, video and multimedia file sharing. To support these various scenarios, Vroom features support for multiple user interfaces (optical tracking, touch UI, gesture interface, etc.), support for directional and spatialized audio, giga-pixel image interactivity, 4K video streaming, 3D visualization and telematic production. This paper explains the design process that
Audio power amplifier techniques with energy efficient power conversion. Vol. 1

Energy Technology Data Exchange (ETDEWEB)

Nielsen, Karsten

1998-04-01

A fundamental study of both analog and digital pulse modulation methods is carried out. A novel class of multi-level pulse modulation methods - Phase Shifted Carrier Pulse Width Modulation (PSCPWM) - is introduced and show to have several advantageous features, primarily caused by the much improved synthesis of the modulating signal. Enhanced digital pulse modulation methods for digital Pulse Modulation Amplifier (PMA) systems are investigated, and a simple methodology for digital PWM modulator synthesis is devised. It is concluded, that the modulator performance is not a limitation in the system, regardless of the domain of modulator implementation. Power conversion in PMA systems is adressed from the perspective of both linearity and efficienty optimization. Based on detailed studies of the distortion mechanisms in the power conversion stage it is concluded, that this is the fundamental limitation on system performance due to several physical limitations. The analysis of general power stage efficiency concludes that dramatic improvements in energy efficiency are possible with PMA systems that are optimized for efficiency. A control system design methodology is devised as a platform for synthesis of robust control systems. Investigations of three fundamental control structures show that even simple control systems offer a remarkable value, although the considered topologies also have their limitations which is verified by practical evaluation in hardware. A novel control method is introduced - Multivariable Enhanced Cascade Control (MECC). MECC provides flexible control over all essential system parameters and is furthermore simple in realization. Practical evaluation of a MECC based PMA shows state-of-the-art performance. The application of non-linear control methods is investigated with the introduction of an enhanced non-linear control/modulator topology. Although the non-linear controller is theoretically interesting, the method proves to suffer from various
Unsupervised topic modelling on South African parliament audio data

CSIR Research Space (South Africa)

Kleynhans, N

2014-11-01

Full Text Available Using a speech recognition system to convert spoken audio to text can enable the structuring of large collections of spoken audio data. A convenient means to summarise or cluster spoken data is to identify the topic under discussion. There are many...
Classifying laughter and speech using audio-visual feature prediction

NARCIS (Netherlands)

Petridis, Stavros; Asghar, Ali; Pantic, Maja

2010-01-01

In this study, a system that discriminates laughter from speech by modelling the relationship between audio and visual features is presented. The underlying assumption is that this relationship is different between speech and laughter. Neural networks are trained which learn the audio-to-visual and
Analytical Features: A Knowledge-Based Approach to Audio Feature Generation

Directory of Open Access Journals (Sweden)

Pachet François

2009-01-01

Full Text Available We present a feature generation system designed to create audio features for supervised classification tasks. The main contribution to feature generation studies is the notion of analytical features (AFs, a construct designed to support the representation of knowledge about audio signal processing. We describe the most important aspects of AFs, in particular their dimensional type system, on which are based pattern-based random generators, heuristics, and rewriting rules. We show how AFs generalize or improve previous approaches used in feature generation. We report on several projects using AFs for difficult audio classification tasks, demonstrating their advantage over standard audio features. More generally, we propose analytical features as a paradigm to bring raw signals into the world of symbolic computation.
Digitized molecular diagnostics: reading disk-based bioassays with standard computer drives.

Science.gov (United States)

Li, Yunchao; Ou, Lily M L; Yu, Hua-Zhong

2008-11-01

We report herein a digital signal readout protocol for screening disk-based bioassays with standard optical drives of ordinary desktop/notebook computers. Three different types of biochemical recognition reactions (biotin-streptavidin binding, DNA hybridization, and protein-protein interaction) were performed directly on a compact disk in a line array format with the help of microfluidic channel plates. Being well-correlated with the optical darkness of the binding sites (after signal enhancement by gold nanoparticle-promoted autometallography), the reading error levels of prerecorded audio files can serve as a quantitative measure of biochemical interaction. This novel readout protocol is about 1 order of magnitude more sensitive than fluorescence labeling/scanning and has the capability of examining multiplex microassays on the same disk. Because no modification to either hardware or software is needed, it promises a platform technology for rapid, low-cost, and high-throughput point-of-care biomedical diagnostics.
ENERGY STAR Certified Audio Video

Data.gov (United States)

U.S. Environmental Protection Agency — Certified models meet all ENERGY STAR requirements as listed in the Version 3.0 ENERGY STAR Program Requirements for Audio Video Equipment that are effective as of...
Television and the Internet: The Role Digital Technologies Play in Adolescents’ Audio-Visual Media Consumption. Young Television Audiences in Catalonia (Spain

Directory of Open Access Journals (Sweden)

Meritxell Roca

2014-03-01

Full Text Available The aim of this reported study was to investigate adolescents TV consumption habits and perceptions. Although there appears to be no general consensus on how the Internet affects TV consumption by teenagers, and data vary depending on the country, according to our study, Spanish adolescents perceive television as a habit “of the past” and find the computer a device more suited to their recreational and audio-visual consumption needs. The data obtained from eight focus groups of teenagers aged between 12 and 18 and an online survey sent to their parents show that watching TV is an activity usually linked to the home’s communal spaces. On the contrary, online audio-visual consumption (understood as a wider term not limited to just TV shows is perceived by adolescents as a more convenient activity as it adapts to their own schedules and needs.
Digital Video as a Personalized Learning Assignment: A Qualitative Study of Student Authored Video Using the ICSDR Model

Science.gov (United States)

Campbell, Laurie O.; Cox, Thomas D.

2018-01-01

Students within this study followed the ICSDR (Identify, Conceptualize/Connect, Storyboard, Develop, Review/Reflect/Revise) development model to create digital video, as a personalized and active learning assignment. The participants, graduate students in education, indicated that following the ICSDR framework for student-authored video guided…
TECHNICAL NOTE: Portable audio electronics for impedance-based measurements in microfluidics

Science.gov (United States)

Wood, Paul; Sinton, David

2010-08-01

We demonstrate the use of audio electronics-based signals to perform on-chip electrochemical measurements. Cell phones and portable music players are examples of consumer electronics that are easily operated and are ubiquitous worldwide. Audio output (play) and input (record) signals are voltage based and contain frequency and amplitude information. A cell phone, laptop soundcard and two compact audio players are compared with respect to frequency response; the laptop soundcard provides the most uniform frequency response, while the cell phone performance is found to be insufficient. The audio signals in the common portable music players and laptop soundcard operate in the range of 20 Hz to 20 kHz and are found to be applicable, as voltage input and output signals, to impedance-based electrochemical measurements in microfluidic systems. Validated impedance-based measurements of concentration (0.1-50 mM), flow rate (2-120 µL min-1) and particle detection (32 µm diameter) are demonstrated. The prevailing, lossless, wave audio file format is found to be suitable for data transmission to and from external sources, such as a centralized lab, and the cost of all hardware (in addition to audio devices) is ~10 USD. The utility demonstrated here, in combination with the ubiquitous nature of portable audio electronics, presents new opportunities for impedance-based measurements in portable microfluidic systems.
Class-D audio amplifiers with negative feedback

OpenAIRE

Cox, Stephen M.; Candy, B. H.

2006-01-01

There are many different designs for audio amplifiers. Class-D, or switching, amplifiers generate their output signal in the form of a high-frequency square wave of variable duty cycle (ratio of on time to off time). The square-wave nature of the output allows a particularly efficient output stage, with minimal losses. The output is ultimately filtered to remove components of the spectrum above the audio range. Mathematical models are derived here for a variety of related class-D amplifier de...
A second-order class-D audio amplifier

OpenAIRE

Cox, Stephen M.; Tan, M.T.; Yu, J.

2011-01-01

Class-D audio amplifiers are particularly efficient, and this efficiency has led to their ubiquity in a wide range of modern electronic appliances. Their output takes the form of a high-frequency square wave whose duty cycle (ratio of on-time to off-time) is modulated at low frequency according to the audio signal. A mathematical model is developed here for a second-order class-D amplifier design (i.e., containing one second-order integrator) with negative feedback. We derive exact expression...
Documentary management of the sport audio-visual information in the generalist televisions

OpenAIRE

Jorge Caldera Serrano; Felipe Alonso

2007-01-01

The management of the sport audio-visual documentation of the Information Systems of the state, zonal and local chains is analyzed within the framework. For it it is made makes a route by the documentary chain that makes the sport audio-visual information with the purpose of being analyzing each one of the parameters, showing therefore a series of recommendations and norms for the preparation of the sport audio-visual registry. Evidently the audio-visual sport documentation difference i...
Multi Carrier Modulation Audio Power Amplifier with Programmable Logic

DEFF Research Database (Denmark)

Christiansen, Theis; Andersen, Toke Meyer; Knott, Arnold

2009-01-01

While switch-mode audio power amplifiers allow compact implementations and high output power levels due to their high power efficiency, they are very well known for creating electromagnetic interference (EMI) with other electronic equipment. To lower the EMI of switch-mode (class D) audio power a...
Perancangan Sistem Audio Mobil Berbasiskan Sistem Pakar dan Web

Directory of Open Access Journals (Sweden)

Djunaidi Santoso

2011-12-01

Full Text Available Designing car audio that fits user’s needs is a fun activity. However, the design often consumes more time and costly since it should be consulted to the experts several times. For easy access to information in designing a car audio system as well as error prevention, an car audio system based on expert system and web is designed for those who do not have sufficient time and expense to consult directly to experts. This system consists of tutorial modules designed using the HyperText Preprocessor (PHP and MySQL as database. This car audio system design is evaluated uses black box testing method which focuses on the functional needs of the application. Tests are performed by providing inputs and produce outputs corresponding to the function of each module. The test results prove the correspondence between input and output, which means that the program meet the initial goals of the design.
A Psychoacoustic-Based Multiple Audio Object Coding Approach via Intra-Object Sparsity

Directory of Open Access Journals (Sweden)

Maoshen Jia

2017-12-01

Full Text Available Rendering spatial sound scenes via audio objects has become popular in recent years, since it can provide more flexibility for different auditory scenarios, such as 3D movies, spatial audio communication and virtual classrooms. To facilitate high-quality bitrate-efficient distribution for spatial audio objects, an encoding scheme based on intra-object sparsity (approximate k-sparsity of the audio object itself is proposed in this paper. The statistical analysis is presented to validate the notion that the audio object has a stronger sparseness in the Modified Discrete Cosine Transform (MDCT domain than in the Short Time Fourier Transform (STFT domain. By exploiting intra-object sparsity in the MDCT domain, multiple simultaneously occurring audio objects are compressed into a mono downmix signal with side information. To ensure a balanced perception quality of audio objects, a Psychoacoustic-based time-frequency instants sorting algorithm and an energy equalized Number of Preserved Time-Frequency Bins (NPTF allocation strategy are proposed, which are employed in the underlying compression framework. The downmix signal can be further encoded via Scalar Quantized Vector Huffman Coding (SQVH technique at a desirable bitrate, and the side information is transmitted in a lossless manner. Both objective and subjective evaluations show that the proposed encoding scheme outperforms the Sparsity Analysis (SPA approach and Spatial Audio Object Coding (SAOC in cases where eight objects were jointly encoded.
Impact of audio-visual storytelling in simulation learning experiences of undergraduate nursing students.

Science.gov (United States)

Johnston, Sandra; Parker, Christina N; Fox, Amanda

2017-09-01

Use of high fidelity simulation has become increasingly popular in nursing education to the extent that it is now an integral component of most nursing programs. Anecdotal evidence suggests that students have difficulty engaging with simulation manikins due to their unrealistic appearance. Introduction of the manikin as a 'real patient' with the use of an audio-visual narrative may engage students in the simulated learning experience and impact on their learning. A paucity of literature currently exists on the use of audio-visual narratives to enhance simulated learning experiences. This study aimed to determine if viewing an audio-visual narrative during a simulation pre-brief altered undergraduate nursing student perceptions of the learning experience. A quasi-experimental post-test design was utilised. A convenience sample of final year baccalaureate nursing students at a large metropolitan university. Participants completed a modified version of the Student Satisfaction with Simulation Experiences survey. This 12-item questionnaire contained questions relating to the ability to transfer skills learned in simulation to the real clinical world, the realism of the simulation and the overall value of the learning experience. Descriptive statistics were used to summarise demographic information. Two tailed, independent group t-tests were used to determine statistical differences within the categories. Findings indicated that students reported high levels of value, realism and transferability in relation to the viewing of an audio-visual narrative. Statistically significant results (t=2.38, psimulation to clinical practice. The subgroups of age and gender although not significant indicated some interesting results. High satisfaction with simulation was indicated by all students in relation to value and realism. There was a significant finding in relation to transferability on knowledge and this is vital to quality educational outcomes. Copyright © 2017. Published by
BAT: An open-source, web-based audio events annotation tool

OpenAIRE

Blai Meléndez-Catalan, Emilio Molina, Emilia Gómez

2017-01-01

In this paper we present BAT (BMAT Annotation Tool), an open-source, web-based tool for the manual annotation of events in audio recordings developed at BMAT (Barcelona Music and Audio Technologies). The main feature of the tool is that it provides an easy way to annotate the salience of simultaneous sound sources. Additionally, it allows to define multiple ontologies to adapt to multiple tasks and offers the possibility to cross-annotate audio data. Moreover, it is easy to install and deploy...

On the Use of Memory Models in Audio Features

DEFF Research Database (Denmark)

Jensen, Karl Kristoffer

2011-01-01

Audio feature estimation is potentially improved by including higher- level models. One such model is the Short Term Memory (STM) model. A new paradigm of audio feature estimation is obtained by adding the influence of notes in the STM. These notes are identified when the perceptual spectral flux...
Audio Teleconferencing: Low Cost Technology for External Studies Networking.

Science.gov (United States)

Robertson, Bill

1987-01-01

This discussion of the benefits of audio teleconferencing for distance education programs and for business and government applications focuses on the recent experience of Canadian educational users. Four successful operating models and their costs are reviewed, and it is concluded that audio teleconferencing is cost efficient and educationally…
Automatic Organisation and Quality Analysis of User-Generated Content with Audio Fingerprinting

OpenAIRE

Cavaco, Sofia; Magalhaes, Joao; Mordido, Gonçalo

2018-01-01

The increase of the quantity of user-generated content experienced in social media has boosted the importance of analysing and organising the content by its quality. Here, we propose a method that uses audio fingerprinting to organise and infer the quality of user-generated audio content. The proposed method detects the overlapping segments between different audio clips to organise and cluster the data according to events, and to infer the audio quality of the samples. A test setup with conce...
Real-Time Audio Processing on the T-CREST Multicore Platform

DEFF Research Database (Denmark)

Ausin, Daniel Sanz; Pezzarossa, Luca; Schoeberl, Martin

2017-01-01

of the audio signal. This paper presents a real-time multicore audio processing system based on the T-CREST platform. T-CREST is a time-predictable multicore processor for real-time embedded systems. Multiple audio effect tasks have been implemented, which can be connected together in different configurations...... forming sequential and parallel effect chains, and using a network-onchip for intercommunication between processors. The evaluation of the system shows that real-time processing of multiple effect configurations is possible, and that the estimation and control of latency ensures real-time behavior.......Multicore platforms are nowadays widely used for audio processing applications, due to the improvement of computational power that they provide. However, some of these systems are not optimized for temporally constrained environments, which often leads to an undesired increase in the latency...
Girls, identities and agency in adolescents' digital literacy practices

Directory of Open Access Journals (Sweden)

Vassiliki Adampa

2012-03-01

Full Text Available This paper focuses on the ways girls use digital environments, like Word, PowerPoint and chatting programmes, for writing and communication purposes. By combining quantitative and qualitative methods of analysis and by adopting a critical discourse framework, we will explore the relationship between girls and new media, especially the ones related to digital writing, in terms of three interconnected variables. The first one is related to the role of the two most important socialisation institutions, home and school, at the present historical juncture, characterised by intense mobility and an expansion of traditional forms of literacy. The strategic choices of the girls' families and their schools' teaching practices contributed significantly to the formulation of their digital writing practices. The second variable is gender. Our data clearly show that a substantial number of girls were more inclined than their male peers to use word-processing and presentation software, performing, thus, the school discourses of 'diligent students'. The third key variable concerns the personality of the girls who filtered in their own unique ways their social experiences, overcame limitations, took initiatives and appropriated technologically-mediated writing media for personally meaningful ends that enhanced their school and/or entertainment Discourses.
The Effect of Audio and Animation in Multimedia Instruction

Science.gov (United States)

Koroghlanian, Carol; Klein, James D.

2004-01-01

This study investigated the effects of audio, animation, and spatial ability in a multimedia computer program for high school biology. Participants completed a multimedia program that presented content by way of text or audio with lean text. In addition, several instructional sequences were presented either with static illustrations or animations.…
Selected Audio-Visual Materials for Consumer Education. [New Version.

Science.gov (United States)

Johnston, William L.

Ninety-two films, filmstrips, multi-media kits, slides, and audio cassettes, produced between 1964 and 1974, are listed in this selective annotated bibliography on consumer education. The major portion of the bibliography is devoted to films and filmstrips. The main topics of the audio-visual materials include purchasing, advertising, money…
Audio-visual temporal recalibration can be constrained by content cues regardless of spatial overlap

Directory of Open Access Journals (Sweden)

Warrick eRoseboom

2013-04-01

Full Text Available It has now been well established that the point of subjective synchrony for audio and visual events can be shifted following exposure to asynchronous audio-visual presentations, an effect often referred to as temporal recalibration. Recently it was further demonstrated that it is possible to concurrently maintain two such recalibrated, and opposing, estimates of audio-visual temporal synchrony. However, it remains unclear precisely what defines a given audio-visual pair such that it is possible to maintain a temporal relationship distinct from other pairs. It has been suggested that spatial separation of the different audio-visual pairs is necessary to achieve multiple distinct audio-visual synchrony estimates. Here we investigated if this was necessarily true. Specifically, we examined whether it is possible to obtain two distinct temporal recalibrations for stimuli that differed only in featural content. Using both complex (audio visual speech; Experiment 1 and simple stimuli (high and low pitch audio matched with either vertically or horizontally oriented Gabors; Experiment 2 we found concurrent, and opposite, recalibrations despite there being no spatial difference in presentation location at any point throughout the experiment. This result supports the notion that the content of an audio-visual pair can be used to constrain distinct audio-visual synchrony estimates regardless of spatial overlap.
An integrated audio-visual impact tool for wind turbine installations

International Nuclear Information System (INIS)

Lymberopoulos, N.; Belessis, M.; Wood, M.; Voutsinas, S.

1996-01-01

An integrated software tool was developed for the design of wind parks that takes into account their visual and audio impact. The application is built on a powerful hardware platform and is fully operated through a graphic user interface. The topography, the wind turbines and the daylight conditions are realised digitally. The wind park can be animated in real time and the user can take virtual walks in it while the set-up of the park can be altered interactively. In parallel, the wind speed levels on the terrain, the emitted noise intensity, the annual energy output and the cash flow can be estimated at any stage of the session and prompt the user for rearrangements. The tool has been used to visually simulate existing wind parks in St. Breok, UK and Andros Island, Greece. The results lead to the conclusion that such a tool can assist to the public acceptance and licensing procedures of wind parks. (author)
Effect of Audio Coaching on Correlation of Abdominal Displacement With Lung Tumor Motion

International Nuclear Information System (INIS)

Nakamura, Mitsuhiro; Narita, Yuichiro; Matsuo, Yukinori; Narabayashi, Masaru; Nakata, Manabu; Sawada, Akira; Mizowaki, Takashi; Nagata, Yasushi; Hiraoka, Masahiro

2009-01-01

Purpose: To assess the effect of audio coaching on the time-dependent behavior of the correlation between abdominal motion and lung tumor motion and the corresponding lung tumor position mismatches. Methods and Materials: Six patients who had a lung tumor with a motion range >8 mm were enrolled in the present study. Breathing-synchronized fluoroscopy was performed initially without audio coaching, followed by fluoroscopy with recorded audio coaching for multiple days. Two different measurements, anteroposterior abdominal displacement using the real-time positioning management system and superoinferior (SI) lung tumor motion by X-ray fluoroscopy, were performed simultaneously. Their sequential images were recorded using one display system. The lung tumor position was automatically detected with a template matching technique. The relationship between the abdominal and lung tumor motion was analyzed with and without audio coaching. Results: The mean SI tumor displacement was 10.4 mm without audio coaching and increased to 23.0 mm with audio coaching (p < .01). The correlation coefficients ranged from 0.89 to 0.97 with free breathing. Applying audio coaching, the correlation coefficients improved significantly (range, 0.93-0.99; p < .01), and the SI lung tumor position mismatches became larger in 75% of all sessions. Conclusion: Audio coaching served to increase the degree of correlation and make it more reproducible. In addition, the phase shifts between tumor motion and abdominal displacement were improved; however, all patients breathed more deeply, and the SI lung tumor position mismatches became slightly larger with audio coaching than without audio coaching.
Auditory and audio-visual processing in patients with cochlear, auditory brainstem, and auditory midbrain implants: An EEG study.

Science.gov (United States)

Schierholz, Irina; Finke, Mareike; Kral, Andrej; Büchner, Andreas; Rach, Stefan; Lenarz, Thomas; Dengler, Reinhard; Sandmann, Pascale

2017-04-01

There is substantial variability in speech recognition ability across patients with cochlear implants (CIs), auditory brainstem implants (ABIs), and auditory midbrain implants (AMIs). To better understand how this variability is related to central processing differences, the current electroencephalography (EEG) study compared hearing abilities and auditory-cortex activation in patients with electrical stimulation at different sites of the auditory pathway. Three different groups of patients with auditory implants (Hannover Medical School; ABI: n = 6, CI: n = 6; AMI: n = 2) performed a speeded response task and a speech recognition test with auditory, visual, and audio-visual stimuli. Behavioral performance and cortical processing of auditory and audio-visual stimuli were compared between groups. ABI and AMI patients showed prolonged response times on auditory and audio-visual stimuli compared with NH listeners and CI patients. This was confirmed by prolonged N1 latencies and reduced N1 amplitudes in ABI and AMI patients. However, patients with central auditory implants showed a remarkable gain in performance when visual and auditory input was combined, in both speech and non-speech conditions, which was reflected by a strong visual modulation of auditory-cortex activation in these individuals. In sum, the results suggest that the behavioral improvement for audio-visual conditions in central auditory implant patients is based on enhanced audio-visual interactions in the auditory cortex. Their findings may provide important implications for the optimization of electrical stimulation and rehabilitation strategies in patients with central auditory prostheses. Hum Brain Mapp 38:2206-2225, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
All-Digital Time-Domain CMOS Smart Temperature Sensor with On-Chip Linearity Enhancement.

Science.gov (United States)

Chen, Chun-Chi; Chen, Chao-Lieh; Lin, Yi

2016-01-30

This paper proposes the first all-digital on-chip linearity enhancement technique for improving the accuracy of the time-domain complementary metal-oxide semiconductor (CMOS) smart temperature sensor. To facilitate on-chip application and intellectual property reuse, an all-digital time-domain smart temperature sensor was implemented using 90 nm Field Programmable Gate Arrays (FPGAs). Although the inverter-based temperature sensor has a smaller circuit area and lower complexity, two-point calibration must be used to achieve an acceptable inaccuracy. With the help of a calibration circuit, the influence of process variations was reduced greatly for one-point calibration support, reducing the test costs and time. However, the sensor response still exhibited a large curvature, which substantially affected the accuracy of the sensor. Thus, an on-chip linearity-enhanced circuit is proposed to linearize the curve and achieve a new linearity-enhanced output. The sensor was implemented on eight different Xilinx FPGA using 118 slices per sensor in each FPGA to demonstrate the benefits of the linearization. Compared with the unlinearized version, the maximal inaccuracy of the linearized version decreased from 5 °C to 2.5 °C after one-point calibration in a range of -20 °C to 100 °C. The sensor consumed 95 μW using 1 kSa/s. The proposed linearity enhancement technique significantly improves temperature sensing accuracy, avoiding costly curvature compensation while it is fully synthesizable for future Very Large Scale Integration (VLSI) system.
Four-quadrant flyback converter for direct audio power amplification

OpenAIRE

Ljusev, Petar; Andersen, Michael Andreas E.

2005-01-01

This paper presents a bidirectional, four-quadrant flyback converter for use in direct audio power amplification. When compared to the standard Class-D switching audio power amplifier with a separate power supply, the proposed four-quadrant flyback converter provides simple solution with better efficiency, higher level of integration and lower component count.
Selective attention modulates the direction of audio-visual temporal recalibration.

Science.gov (United States)

Ikumi, Nara; Soto-Faraco, Salvador

2014-01-01

Temporal recalibration of cross-modal synchrony has been proposed as a mechanism to compensate for timing differences between sensory modalities. However, far from the rich complexity of everyday life sensory environments, most studies to date have examined recalibration on isolated cross-modal pairings. Here, we hypothesize that selective attention might provide an effective filter to help resolve which stimuli are selected when multiple events compete for recalibration. We addressed this question by testing audio-visual recalibration following an adaptation phase where two opposing audio-visual asynchronies were present. The direction of voluntary visual attention, and therefore to one of the two possible asynchronies (flash leading or flash lagging), was manipulated using colour as a selection criterion. We found a shift in the point of subjective audio-visual simultaneity as a function of whether the observer had focused attention to audio-then-flash or to flash-then-audio groupings during the adaptation phase. A baseline adaptation condition revealed that this effect of endogenous attention was only effective toward the lagging flash. This hints at the role of exogenous capture and/or additional endogenous effects producing an asymmetry toward the leading flash. We conclude that selective attention helps promote selected audio-visual pairings to be combined and subsequently adjusted in time but, stimulus organization exerts a strong impact on recalibration. We tentatively hypothesize that the resolution of recalibration in complex scenarios involves the orchestration of top-down selection mechanisms and stimulus-driven processes.
Selective attention modulates the direction of audio-visual temporal recalibration.

Directory of Open Access Journals (Sweden)

Nara Ikumi

Full Text Available Temporal recalibration of cross-modal synchrony has been proposed as a mechanism to compensate for timing differences between sensory modalities. However, far from the rich complexity of everyday life sensory environments, most studies to date have examined recalibration on isolated cross-modal pairings. Here, we hypothesize that selective attention might provide an effective filter to help resolve which stimuli are selected when multiple events compete for recalibration. We addressed this question by testing audio-visual recalibration following an adaptation phase where two opposing audio-visual asynchronies were present. The direction of voluntary visual attention, and therefore to one of the two possible asynchronies (flash leading or flash lagging, was manipulated using colour as a selection criterion. We found a shift in the point of subjective audio-visual simultaneity as a function of whether the observer had focused attention to audio-then-flash or to flash-then-audio groupings during the adaptation phase. A baseline adaptation condition revealed that this effect of endogenous attention was only effective toward the lagging flash. This hints at the role of exogenous capture and/or additional endogenous effects producing an asymmetry toward the leading flash. We conclude that selective attention helps promote selected audio-visual pairings to be combined and subsequently adjusted in time but, stimulus organization exerts a strong impact on recalibration. We tentatively hypothesize that the resolution of recalibration in complex scenarios involves the orchestration of top-down selection mechanisms and stimulus-driven processes.
Resolution enhancement in digital holography by self-extrapolation of holograms.

Science.gov (United States)

Latychevskaia, Tatiana; Fink, Hans-Werner

2013-03-25

It is generally believed that the resolution in digital holography is limited by the size of the captured holographic record. Here, we present a method to circumvent this limit by self-extrapolating experimental holograms beyond the area that is actually captured. This is done by first padding the surroundings of the hologram and then conducting an iterative reconstruction procedure. The wavefront beyond the experimentally detected area is thus retrieved and the hologram reconstruction shows enhanced resolution. To demonstrate the power of this concept, we apply it to simulated as well as experimental holograms.
Digital innovations and emerging technologies for enhanced recovery programmes

DEFF Research Database (Denmark)

Michard, F; Gan, T J; Kehlet, H

2017-01-01

Enhanced recovery programmes (ERPs) are increasingly used to improve post-surgical recovery. However, compliance to various components of ERPs-a key determinant of success-remains sub-optimal. Emerging technologies have the potential to help patients and caregivers to improve compliance with ERPs...... of the above-mentioned ERP elements is omitted during the surgical journey.By optimizing compliance to the multiple components of ERPs, digital innovations, non-invasive techniques and wearable sensors have the potential to magnify the clinical and economic benefits of ERPs. Among the growing number...... of technical innovations, studies are needed to clarify which tools and solutions have real clinical value and are cost-effective....
Self-oscillating modulators for direct energy conversion audio power amplifiers

Energy Technology Data Exchange (ETDEWEB)

Ljusev, P.; Andersen, Michael A.E.

2005-07-01

Direct energy conversion audio power amplifier represents total integration of switching-mode power supply and Class D audio power amplifier into one compact stage, achieving high efficiency, high level of integration, low component count and eventually low cost. This paper presents how self-oscillating modulators can be used with the direct switching-mode audio power amplifier to improve its performance by providing fast hysteretic control with high power supply rejection ratio, open-loop stability and high bandwidth. Its operation is thoroughly analyzed and simulated waveforms of a prototype amplifier are presented. (au)
Rehabilitation of balance-impaired stroke patients through audio-visual biofeedback

DEFF Research Database (Denmark)

Gheorghe, Cristina; Nissen, Thomas; Juul Rosengreen Christensen, Daniel

2015-01-01

This study explored how audio-visual biofeedback influences physical balance of seven balance-impaired stroke patients, between 33–70 years-of-age. The setup included a bespoke balance board and a music rhythm game. The procedure was designed as follows: (1) a control group who performed a balance...... training exercise without any technological input, (2) a visual biofeedback group, performing via visual input, and (3) an audio-visual biofeedback group, performing via audio and visual input. Results retrieved from comparisons between the data sets (2) and (3) suggested superior postural stability...
Two-way digital communications

Science.gov (United States)

Glenn, William E.; Daly, Ed

1996-03-01

The communications industry has been rapidly converting from analog to digital communications for audio, video, and data. The initial applications have been concentrating on point-to-multipoint transmission. Currently, a new revolution is occurring in which two-way point-to-point transmission is a rapidly growing market. The system designs for video compression developed for point-to-multipoint transmission are unsuitable for this new market as well as for satellite based video encoding. A new system developed by the Space Communications Technology Center has been designed to address both of these newer applications. An update on the system performance and design will be given.

Field evaluation of personal digital assistant enabled by global positioning system : impact on quality of activity and diary data

NARCIS (Netherlands)

Bellemans, T.; Kochan, B.; Janssens, D.; Wets, G.; Timmermans, H.J.P.; Stopher, P.

2016-01-01

Tom Bellemans, Bruno Kochan, Davy Janssens, Geert Wets and Harry Timmermans (2008), ‘Field Evaluation of Personal Digital Assistant Enabled by Global Positioning System: Impact on Quality of Activity and Diary Data’, Transportation Research Record: Journal of the Transportation Research Board, No.
Professional and personal enhancement: a pragmatic approach in dental education.

Science.gov (United States)

Deivanayagam, Kandaswamy; K, Anbarasi

2016-06-01

Students of health education are often offended by the transitions and challenges they face while encountering diverse people, ideas and academic workloads. They may be offended because of reasons not only related to their societal background but also to their basic competence in managing transitions. In the Asian scenario, students enter the first year of professional education in their late teen age along with the definition of self which was created by their parents. There are different issues that arise in this age group that may positively shape or negatively affect the personalities of students. They need to achieve a sense of balance between personal and professional traits on their own. Several students are often unable to cultivate the expected required qualities, which leads to an abject state of mind and hinder their progress. We identified the most common personal and professional hurdles in the lives of dental students and we provided experiential solutions to overcome the hurdles by using a sociable approach through an integrated, continuing education program. Designing and implementing a cohesive, amalgamated and inspiring personal and professional enhancement action program for dental students. Feedback from students reflected that the needs and expectations of students vary with academic phase. In addition students expressed that this program series inculcated some positive skills, and overall, they are satisfied with the utility of the program. Personal and professional enhancement of students in accordance with individual needs as well as with expected requirements needs a committed administrative action plan. Our results in this context are encouraging and can be considered for application in dental institutions.
A novel digital workflow to manufacture personalized three-dimensional-printed hollow surgical obturators after maxillectomy.

Science.gov (United States)

Kortes, J; Dehnad, H; Kotte, A N T; Fennis, W M M; Rosenberg, A J W P

2018-04-07

Partial or complete resection of the maxilla during tumour surgery causes oronasal defects, leading to oral-maxillofacial dysfunction, for which the surgical obturator (SO) is an important treatment option. Traditional manufacturing of SOs is complex, time-consuming, and often results in inadequate fit and function. This technical note describes a novel digital workflow to design and manufacture a three-dimensional (3D)-printed hollow SO. Registered computed tomography and magnetic resonance imaging images are used for gross tumour delineation. The produced RTStruct set is exported as a stereolitography (STL) file and merged with a 3D model of the dental status. Based on these merged files, a personalized and hollow digital SO design is created, and 3D printed. Due to the proper fit of the prefabricated SO, a soft silicone lining material can be used during surgery to adapt the prosthesis to the oronasal defect, instead of putty materials that are not suitable for this purpose. An STL file of this final SO is created during surgery, based on a scan of the relined SO. The digital workflow results in a SO weight reduction, an increased fit, an up-to-date digital SO copy, and overall easier clinical handling. Copyright © 2018 International Association of Oral and Maxillofacial Surgeons. Published by Elsevier Ltd. All rights reserved.
Pragmatic Randomized, Controlled Trial of Patient Navigators and Enhanced Personal Health Records in CKD.

Science.gov (United States)

Navaneethan, Sankar D; Jolly, Stacey E; Schold, Jesse D; Arrigain, Susana; Nakhoul, Georges; Konig, Victoria; Hyland, Jennifer; Burrucker, Yvette K; Dann, Priscilla Davis; Tucky, Barbara H; Sharp, John; Nally, Joseph V

2017-09-07

Patient navigators and enhanced personal health records improve the quality of health care delivered in other disease states. We aimed to develop a navigator program for patients with CKD and an electronic health record-based enhanced personal health record to disseminate CKD stage-specific goals of care and education. We also conducted a pragmatic randomized clinical trial to compare the effect of a navigator program for patients with CKD with enhanced personal health record and compare their combination compared with usual care among patients with CKD stage 3b/4. Two hundred and nine patients from six outpatient clinics (in both primary care and nephrology settings) were randomized in a 2×2 factorial design into four-study groups: ( 1 ) enhanced personal health record only, ( 2 ) patient navigator only, ( 3 ) both, and ( 4 ) usual care (control) group. Primary outcome measure was the change in eGFR over a 2-year follow-up period. Secondary outcome measures included acquisition of appropriate CKD-related laboratory measures, specialty referrals, and hospitalization rates. Median age of the study population was 68 years old, and 75% were white. At study entry, 54% of patients were followed by nephrologists, and 88% were on renin-angiotensin system blockers. After a 2-year follow-up, rate of decline in eGFR was similar across the four groups ( P =0.19). Measurements of CKD-related laboratory parameters were not significantly different among the groups. Furthermore, referral for dialysis education and vascular access placement, emergency room visits, and hospitalization rates were not statistically significant different between the groups. We successfully developed a patient navigator program and an enhanced personal health record for the CKD population. However, there were no differences in eGFR decline and other outcomes among the study groups. Larger and long-term studies along with cost-effectiveness analyses are needed to evaluate the role of patient navigators
Balancing Audio

DEFF Research Database (Denmark)

Walther-Hansen, Mads

2016-01-01

is not thoroughly understood. In this paper I treat balance as a metaphor that we use to reason about several different actions in music production, such as adjusting levels, editing the frequency spectrum or the spatiality of the recording. This study is based on an exploration of a linguistic corpus of sound......This paper explores the concept of balance in music production and examines the role of conceptual metaphors in reasoning about audio editing. Balance may be the most central concept in record production, however, the way we cognitively understand and respond meaningfully to a mix requiring balance...
Four-quadrant flyback converter for direct audio power amplification

Energy Technology Data Exchange (ETDEWEB)

Ljusev, P.; Andersen, Michael A.E.

2005-07-01

This paper presents a bidirectional, four-quadrant yback converter for use in direct audio power amplication. When compared to the standard Class-D switching-mode audio power amplier with separate power supply, the proposed four-quadrant flyback converter provides simple and compact solution with high efciency, higher level of integration, lower component count, less board space and eventually lower cost. Both peak and average current-mode control for use with 4Q flyback power converters are described and compared. Integrated magnetics is presented which simplies the construction of the auxiliary power supplies for control biasing and isolated gate drives. The feasibility of the approach is proven on audio power amplier prototype for subwoofer applications. (au)
Using Network Oriented Research Assistant (NORA) Technology to Compare Digital Photographic With In-Person Assessment of Acne Vulgaris.

Science.gov (United States)

Singer, Hannah M; Almazan, Timothy; Craft, Noah; David, Consuelo V; Eells, Samantha; Erfe, Crisel; Lazzaro, Cynthia; Nguyen, Kathy; Preciado, Katy; Tan, Belinda; Patel, Vishal A

2018-02-01

Teledermatology has undergone exponential growth in the past 2 decades. Many technological innovations are becoming available without necessarily undergoing validation studies for specific dermatologic applications. To determine whether patient-taken photographs of acne using Network Oriented Research Assistant (NORA) result in similar lesion counts and Investigator's Global Assessment (IGA) findings compared with in-person examination findings. This pilot reliability study enrolled consecutive patients with acne vulgaris from a single general dermatology practice in Los Angeles, California, who were able to use NORA on an iPhone 6 to take self-photographs. Patients were enrolled from January 1 through March 31, 2016. Each individual underwent in-person and digital evaluation of his or her acne by the same dermatologist. A period of at least 1 week separated the in-person and digital assessments of acne. All participants were trained on how to use NORA on the iPhone 6 and take photographs of their face with the rear-facing camera. Reliability of patient-taken photographs with NORA for acne evaluation compared with in-person examination findings. Acne assessment measures included lesion count (total, inflammatory, noninflammatory, and cystic) and IGA for acne severity. A total of 69 patients (37 male [54%] and 32 female [46%]; mean [SD] age, 22.7 [7.7] years) enrolled in the study. The intraclass correlation coefficients of in-person and photograph-based acne evaluations indicated strong agreement. The intraclass correlation coefficient for total lesion count was 0.81; for the IGA, 0.75. Inflammatory lesion count, noninflammatory lesion count, and cyst count had intraclass correlation coefficients of 0.72, 0.72, and 0.82, respectively. This study found agreement between acne evaluations performed in person and from self-photographs with NORA. As a reliable telehealth technology for acne, NORA can be used as a teledermatology platform for dermatology research and can
Animation, audio, and spatial ability: Optimizing multimedia for scientific explanations

Science.gov (United States)

Koroghlanian, Carol May

This study investigated the effects of audio, animation and spatial ability in a computer based instructional program for biology. The program presented instructional material via text or audio with lean text and included eight instructional sequences presented either via static illustrations or animations. High school students enrolled in a biology course were blocked by spatial ability and randomly assigned to one of four treatments (Text-Static Illustration Audio-Static Illustration, Text-Animation, Audio-Animation). The study examined the effects of instructional mode (Text vs. Audio), illustration mode (Static Illustration vs. Animation) and spatial ability (Low vs. High) on practice and posttest achievement, attitude and time. Results for practice achievement indicated that high spatial ability participants achieved more than low spatial ability participants. Similar results for posttest achievement and spatial ability were not found. Participants in the Static Illustration treatments achieved the same as participants in the Animation treatments on both the practice and posttest. Likewise, participants in the Text treatments achieved the same as participants in the Audio treatments on both the practice and posttest. In terms of attitude, participants responded favorably to the computer based instructional program. They found the program interesting, felt the static illustrations or animations made the explanations easier to understand and concentrated on learning the material. Furthermore, participants in the Animation treatments felt the information was easier to understand than participants in the Static Illustration treatments. However, no difference for any attitude item was found for participants in the Text as compared to those in the Audio treatments. Significant differences were found by Spatial Ability for three attitude items concerning concentration and interest. In all three items, the low spatial ability participants responded more positively
Parametric Audio Based Decoder and Music Synthesizer for Mobile Applications

NARCIS (Netherlands)

Oomen, A.W.J.; Szczerba, M.Z.; Therssen, D.

2011-01-01

This paper reviews parametric audio coders and discusses novel technologies introduced in a low-complexity, low-power consumption audiodecoder and music synthesizer platform developed by the authors. Thedecoder uses parametric coding scheme based on the MPEG-4 Parametric Audio standard. In order to
Audio Feedback -- Better Feedback?

Science.gov (United States)

Voelkel, Susanne; Mello, Luciane V.

2014-01-01

National Student Survey (NSS) results show that many students are dissatisfied with the amount and quality of feedback they get for their work. This study reports on two case studies in which we tried to address these issues by introducing audio feedback to one undergraduate (UG) and one postgraduate (PG) class, respectively. In case study one…
Pythia: A Privacy-enhanced Personalized Contextual Suggestion System for Tourism

NARCIS (Netherlands)

Drosatos, G.; Efraimidis, P.S.; Arampatzis, A.; Stamatelatos, G.; Athanasiadis, I.N.

2015-01-01

We present Pythia, a privacy-enhanced non-invasive contextual suggestion system for tourists, with important architectural innovations. The system offers high quality personalized recommendations, non-invasive operation and protection of user privacy. A key feature of Pythia is the exploitation of
75 FR 17874 - Digital Audio Broadcasting Systems and Their Impact on the Terrestrial Radio Broadcast Service

Science.gov (United States)

2010-04-08

... power (ERP), and implements interference mitigation and remediation procedures to resolve promptly... ERP increase undertaken pursuant to the procedures adopted. The increase in FM hybrid digital ERP will... increases in FM digital ERP do not adversely affect existing FM analog operations. These rule changes...
Building technology platform aimed to develop service robot with embedded personality and enhanced communication with social environment

Directory of Open Access Journals (Sweden)

Aleksandar Rodić

2015-04-01

Full Text Available The paper is addressed to prototyping of technology platform aimed to develop of ambient-aware human-centric indoor service robot with attributes of emotional intelligence to enhance interaction with social environment. The robot consists of a wheel-based mobile platform with spinal (segmented torso, bi-manual manipulation system with multi-finger robot hands and robot head. Robot prototype was designed to see, hear, speak and use its multimodal interface for enhanced communication with humans. Robot is capable of demonstrating its affective and social behavior by using audio and video interface as well as body gestures. Robot is equipped with advanced perceptive system based on heterogeneous sensorial system, including laser range finder, ultrasonic distance sensors and proximity detectors, 3-axis inertial sensor (accelerometer and gyroscope, stereo vision system, 2 wide-range microphones, and 2 loudspeakers. The device is foreseen to operate autonomously but it may be also operated remotely from a host computer through wireless communication link as well as by use of a smart-phone based on advanced client-server architecture. Robot prototype has embedded attributes of artificial intelligence and utilizes advanced cognitive capabilities such as spatial reasoning, obstacle and collision avoidance, simultaneous localization and mapping, etc. Robot is designed in a manner to enable uploading of new or changing existing algorithms of emotional intelligence that should provide to robot human-like affective and social behavior. The key objective of the project presented in the paper regards to building advanced technology platform for research and development of personal robots aimed to use for different purpose, e.g. robot-entertainer, battler, robot for medical care, security robot, etc. In a word, the designed technology platform is expected to help in development human-centered service robots to be used at home, in the office, public institutions
Emotion-based Music Rretrieval on a Well-reduced Audio Feature Space

DEFF Research Database (Denmark)

Ruxanda, Maria Magdalena; Chua, Bee Yong; Nanopoulos, Alexandros

2009-01-01

-emotion. However, the real-time systems that retrieve music over large music databases, can achieve order of magnitude performance increase, if applying multidimensional indexing over a dimensionally reduced audio feature space. To meet this performance achievement, in this paper, extensive studies are conducted......Music expresses emotion. A number of audio extracted features have influence on the perceived emotional expression of music. These audio features generate a high-dimensional space, on which music similarity retrieval can be performed effectively, with respect to human perception of the music...... on a number of dimensionality reduction algorithms, including both classic and novel approaches. The paper clearly envisages which dimensionality reduction techniques on the considered audio feature space, can preserve in average the accuracy of the emotion-based music retrieval....
News video story segmentation method using fusion of audio-visual features

Science.gov (United States)

Wen, Jun; Wu, Ling-da; Zeng, Pu; Luan, Xi-dao; Xie, Yu-xiang

2007-11-01

News story segmentation is an important aspect for news video analysis. This paper presents a method for news video story segmentation. Different form prior works, which base on visual features transform, the proposed technique uses audio features as baseline and fuses visual features with it to refine the results. At first, it selects silence clips as audio features candidate points, and selects shot boundaries and anchor shots as two kinds of visual features candidate points. Then this paper selects audio feature candidates as cues and develops different fusion method, which effectively using diverse type visual candidates to refine audio candidates, to get story boundaries. Experiment results show that this method has high efficiency and adaptability to different kinds of news video.
Intersubjectivity in a digital genre: the Spanish indefinite pronoun uno (“one”) and person deixis in Yahoo Questions&Answers

OpenAIRE

Rasson, Marie; De Cock, Barbara; 14th International Pragmatics Conference

2015-01-01

In this paper, we study various mechanisms to create intersubjectivity in a digital genre, namely Yahoo Questions and Answers (YQA). More concretely, we focus on the Spanish indefinite strategy uno (“one”) and its interaction with deictic person pronouns. YQA aims to provide assistance to users, who can ask other users questions on topics of all types. The other users respond by giving advice - often by referring to their personal experience - or their opinion on a given issue (Placencia, 201...
Improving audio chord transcription by exploiting harmonic and metric knowledge

NARCIS (Netherlands)

de Haas, W.B.; Rodrigues Magalhães, J.P.; Wiering, F.

2012-01-01

We present a new system for chord transcription from polyphonic musical audio that uses domain-specific knowledge about tonal harmony and metrical position to improve chord transcription performance. Low-level pulse and spectral features are extracted from an audio source using the Vamp plugin
Self-oscillating modulators for direct energy conversion audio power amplifiers

DEFF Research Database (Denmark)

Ljusev, Petar; Andersen, Michael Andreas E.

2005-01-01

Direct energy conversion audio power amplifier represents total integration of switching-mode power supply and Class D audio power amplifier into one compact stage, achieving high efficiency, high level of integration, low component count and eventually low cost. This paper presents how self-oscillating...
Digital health increasing the impact with personalized design

NARCIS (Netherlands)

Empelen, P. van; Otten, W.; Molema, H.; Keijsers, J.; Mooij, R.

2016-01-01

Digital health is considered the ‘holy grail’ of effective and sustainable health(care). It uses the latest technology, apps and data to support and improve health. Digital health tools can benefit both patients and healthy individuals, with support and advice. But healthcare professionals,
Digital television: a new way to deliver information

Science.gov (United States)

Huang, Samson

1998-12-01

Digital television (DTV) is a new way to deliver video, audio, and other data. Why should TV be converted to digital? How does DTV work? What can we do with it? This paper provides some introduction about DTV, its history, and its roll-out plan. It then compares DTV with analog TV, and describes how DTV works. It also describes why the computer industry, as well as the consumer electronics industry, are both very interested I the DTV market. Next, it describes what Intel has done on DTV, including how we build a PC- based DTV, its test evaluation results, its new applications, and Intel's DTV station DMRL. This paper also describes remaining issues, our roadmap, vision, and future directions.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.