Advanced search

From:

To:

Items from 1 to 20 out of 31 results

chapter

Late reverberation reduction and blind reverberation time measurement for automatic speech recognition

Arkadiy Prodeus

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 634 - 639

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

Development of automatic speech recognition (ASR) systems robust to late reverberation action is urgent task. It is well known that a late reverberation reduction algorithm used as ASR pre-processor demands prior estimation of reverberation time. Blind reverberation time measurements are less accurate than ones for known room impulse response (RIR) direct measurements. As result, it is naturally expect...

chapter

I-Vector based depression level estimation technique

Barkha Rani

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 2067 - 2071

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

Depression is considered as a psychosomatic state associated with the soft biometric features. People suffering from depression always behave abnormal. Depression is a clinically proven disorder that can overwhelm a person and his ability to perform even a simple task. Soft biometric provides important information about a person without being enough for their verification because they lack uniqueness...

chapter

Dual-channel speech separation by sub-segmental directional statistics

Zhaogui Ding, Weifeng Li, Qingmin Liao

2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) > 2287 - 2291

2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)

In this letter, we present a novel speech separation scheme using two microphones. We divide the inter-phase information into sub-segments and statistic these directional segments. Then we construct the objective function by convolving the statistics information with a low pass filter. By the decreasing gradient algorithm and ideal binary mask, we obtain the separated speeches. The method is valid...

chapter

Sound event classification with feature vector combination for automatic audio-based surveillance

Seunghyung Lee, Jinuk Park, Sangjun Park, Minsoo Hahn

2016 IEEE International Conference on Consumer Electronics (ICCE) > 147 - 148

2016 IEEE International Conference on Consumer Electronics (ICCE)

This paper deals with the sound event classification for automatic audio-based surveillance. To improve the performance, we proposed a feature vector combination scheme to use multiple feature vectors simultaneously. Then, the performance is evaluated by using the combination of three segment-based features. The result shows significant amount of improvement compare to the conventional method.

chapter

A Simple Voice Extraction Algorithm in a High-Noise Environment

Yichao Cao, Fujun Wang, Yaheng Zhang

2014 IEEE 17th International Conference on Computational Science and Engineering > 1201 - 1204

2014 IEEE 17th International Conference on Computational Science and Engineering (CSE)

This study elaborates on the implementation of a strong noise suppression algorithm for speech-related applications. Three different structures of the linear minimum mean square error estimator are presented with parameter estimation. The reference voice sample is replaced with a similar one without degrading the performance.

chapter

On semi-blind estimation of echo paths during double-talk based on nonstationarity

Zbynek Koldovsky, Jiri Malek, Michael Muller, Petr Tichavskjy

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 198 - 202

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

The estimation of a filter that determines an echo path is a difficult problem when double-talk is present. We use a Blind Source Separation model based on signals' nonstationarity with a partially known mixing matrix to estimate the filter during the double talk. A second-order approximation of a log-likelihood function is used to derive a quadratic crite-rion. Then, we propose two methods to estimate...

chapter

A discriminative learning approach to probabilistic acoustic source localization

Hendrik Kayser, Jorn Anemuller

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 99 - 103

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

Sound source localization algorithms commonly include assessment of inter-sensor (generalized) correlation functions to obtain direction-of-arrival estimates. Here, we present a classification-based method for source localization that uses discriminative support vector machine-learning of correlation patterns that are indicative of source presence or absence. Subsequent probabilistic modeling generates...

chapter

A quantitative comparison of blind C₅₀ estimators

P. Peso Parada, D. Sharma, J. Lainez, D. Barreda, more

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) > 298 - 302

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC)

The problem of blind estimation of the room acoustic clarity index C₅₀ from single-channel reverberant speech signals is presented in this paper. We analyze the performance of several machine learning methods for a regression task using 309 features derived from the speech signal and modeled with a Deep Belief Network (DBN), Classification And Regression Tree (CART) and Linear Regression (LR). These...

chapter

Self-localization of wireless acoustic sensors in meeting rooms

Mikko Parviainen, Pasi Pertila, Matti S. Hamalainen

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) > 152 - 156

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)

This paper presents a passive acoustic self-localization and synchronization system, which estimates the positions of wireless acoustic sensors utilizing the signals emitted by the persons present in the same room. The system is designed to utilize common off-the-shelf devices such as mobile phones. Once devices are self-localized and synchronized, the system could be utilized by traditional array...

chapter

A comparative analysis on cepstrum, linear predictive coding and particle filtering based formant estimation methods

Mustafa Anil Resat, Halil Ibrahim Gokcimen, Umut Arioz

2014 22nd Signal Processing and Communications Applications Conference (SIU) > 365 - 368

2014 22nd Signal Processing and Communications Applications Conference (SIU)

Formants are able to define basic properties of speech efficiently by using very limited parameter sets; thus they have found important usage area at many applications of speech processing like coding, recognition, synthesis and enhancement. Estimation of formants is harder than simply tracking the peaks of the spectrum; as the output of the vocal tract's spectral peaks are dependent on the shape...

chapter

Maximum-likelihood based 3D acoustical signature estimation

Banu Gunel

2014 22nd Signal Processing and Communications Applications Conference (SIU) > 786 - 789

2014 22nd Signal Processing and Communications Applications Conference (SIU)

An audio recording, made in a real environment, carries an acoustical signature which changes according to the acoustical characteristics of the environment and the recording positions. This signature which is similar to a 3D room impulse response contains the directions, levels and arrival times of the direct source and reflections. Although it is easy to obtain reverberant recordings by convolving...

chapter

Using reverberation time estimates in blind separation of acoustic sources

Diego Barreto Haddad, Mariane Rembold Petraglia, Paulo Bulkool Batalheiro

2013 IEEE Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE) > 153 - 157

2013 IEEE Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE)

Blind separation techniques of sound sources, designed to work with voice signals, present a performance highly dependent on the number of coefficients of the separation system. In general, different environments require different lengths of separation filters. This paper proposes the use of reverberation time information arising from lateral blind estimation techniques for tuning the degree of freedom...

chapter

An overview of informed audio source separation

Antoine Liutkus, Jean-Louis Durrieu, Laurent Daudet, Gael Richard

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) > 1 - 4

2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)

Audio source separation consists in recovering different unknown signals called sources by filtering their observed mixtures. In music processing, most mixtures are stereophonic songs and the sources are the individual signals played by the instruments, e.g. bass, vocals, guitar, etc. Source separation is often achieved through a classical generalized Wiener filtering, which is controlled by parameters...

chapter

Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits

Alexey Tarasov, Sarah Jane Delany, Brian Mac Namee

2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing > 979 - 980

2012 International Conference on Privacy, Security, Risk and Trust (PASSAT)

Many application areas that use supervised machine learning make use of multiple raters to collect target ratings for training data. Usage of multiple raters, however, inevitably introduces the risk that a proportion of them will be unreliable. The presence of unreliable raters can prolong the rating process, make it more expensive and lead to inaccurate ratings. The dominant, "static" approach...

chapter

A compressed sensing approach to the simultaneous recording of multiple room impulse responses

Alexis Benichoux, Emmanuel Vincent, Remi Gribonval

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 285 - 288

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We consider the estimation of multiple room impulse responses from the simultaneous recording of several known sources. Existing techniques are restricted to the case where the number of sources is at most equal to the number of sensors. We relax this assumption in the case where the sources are known. To this aim, we propose statistical models of the filters associated with convex log-likelihoods,...

chapter

On the estimation of low fundamental frequencies

Mads Groesboll Christensen

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 169 - 172

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

In this paper, we analyze the difficult problem of estimating low fundamental frequencies from periodic signals, like those produced by musical instruments. The problem arises when the fundamental frequency is low for a given number of samples as this causes the harmonics to overlap in the frequency domain. Moreover, we demonstrate how the performance of estimators can generally be improved by avoiding...

chapter

Soft frame margin estimation of Gaussian Mixture Models for speaker recognition with sparse training data

Yan Yin, Qi Li

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5268 - 5271

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Discriminative Training (DT) methods for acoustic modeling, such as MMI, MCE, and SVM, have been proved effective in speaker recognition. In this paper we propose a DT method for GMM using soft frame margin estimation. Unlike other DT methods such as MMI or MCE, the soft frame margin estimation attempts to enhance the generalization capability of GMM to unseen data in case the mismatch exists between...

chapter

Evaluating different confirmation strategies for speech-to-speech translation systems

David Stallard, Rohit Prasad, Shankar Ananthakrishnan, Fred Choi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5218 - 5221

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Speech-to-speech translation systems have made a great deal of progress in recent years. But users of such systems still face the problem of not knowing whether the system has translated their utterance correctly. Various confirmation strategies can be used to address this problem. Some of these generate a confirmation utterance for the user to approve, such as reading back the ASR result, or performing...

chapter

Chapter 14: A Comparative Study of ICA Based Approaches for Separation of Components in Functional MRI Sequences

G.R. Rad, H.H. Larijani

2008 3rd International Conference on Geometric Modeling and Imaging > 87 - 91

3rd International Conference on Geometric Modeling and Imaging (GMAI 2008)

This paper prepares a review of ICA based approaches that are used for separation of components in functional MRI sequences. In previous works, the FastICA and the Infomax algorithms are investigated in more details; therefore, in this paper we focus on methods such as "radical ICA", "SDD ICA", "Erica" and "Evd" for separation purposes. This comparative study...

chapter

Improved Adaptive Fingerprint Binarization

Josef Ström Bartunek, Mikael Nilsson, Jörgen Nordberg, Ingvar Claesson

2008 Congress on Image and Signal Processing > 5 > 756 - 760

International Congress on Image and Signal Processing (CISP 2008)

In this paper improvements to a previous work are presented. Removing the redundant artifacts in the fingerprint mask is introduced enhancing the final result. The proposed method is entirely adaptive process adjusting to each fingerprint without any further supervision of the user. Hence, the algorithm is insensitive to the characteristics of the fingerprint sensor and the various physical appearances...

Keywords:
CONFERENCES
ESTIMATION
SPEECH

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (16)
ACOUSTICS (15)
SIGNAL PROCESSING (14)
SIGNAL PROCESSING ALGORITHMS (14)
ALGORITHM DESIGN AND ANALYSIS (12)
NOISE (10)
SIGNAL TO NOISE RATIO (10)
ACCURACY (9)
ARTIFICIAL NEURAL NETWORKS (8)
COMPLEXITY THEORY (8)
COMPUTERS (8)
ELECTRONIC MAIL (8)
CORRELATION (7)
EQUATIONS (7)
MATHEMATICAL MODEL (7)
EDUCATIONAL INSTITUTIONS (6)
SIMULATION (6)
FILTERING ALGORITHMS (5)
FREQUENCY DOMAIN ANALYSIS (5)
INDEXES (5)
REAL TIME SYSTEMS (5)
TRAINING (5)
APPROXIMATION ALGORITHMS (4)
APPROXIMATION METHODS (4)
COMPUTATIONAL MODELING (4)
ENCODING (4)
FEATURE EXTRACTION (4)
FILTERING (4)
FREQUENCY ESTIMATION (4)
IMAGE PROCESSING (4)
IMAGE SEGMENTATION (4)
MICROPHONES (4)
PREDICTION ALGORITHMS (4)
REVIEWS (4)
ROBUSTNESS (4)
SOURCE SEPARATION (4)
TIME FREQUENCY ANALYSIS (4)
TRANSFORMS (4)
ADAPTIVE SYSTEMS (3)
ANALYTICAL MODELS (3)
ARRAY SIGNAL PROCESSING (3)
AUDITORY SYSTEM (3)
BACKGROUND NOISE (3)
BLIND SOURCE SEPARATION (3)
CIRCUITS AND SYSTEMS (3)
COMPUTER LANGUAGES (3)
COUPLINGS (3)
CYBERNETICS (3)
DATABASES (3)
DEGRADATION (3)
DELAY (3)
EIGENVALUES AND EIGENFUNCTIONS (3)
FAST FOURIER TRANSFORMS (3)
FILTERING THEORY (3)
FOURIER TRANSFORMS (3)
GAIN (3)
GAUSSIAN NOISE (3)
HARMONIC ANALYSIS (3)
HELIUM (3)
IEEE TRANSACTIONS ON IMAGE PROCESSING (3)
IMAGE RESOLUTION (3)
MACHINE LEARNING (3)
MANGANESE (3)
MATRIX DECOMPOSITION (3)
OPTIMIZATION (3)
PATTERN RECOGNITION (3)
PERFORMANCE EVALUATION (3)
SPEECH RECOGNITION (3)
TESTING (3)
USA COUNCILS (3)
VECTORS (3)
WAVELET TRANSFORMS (3)
WHITE NOISE (3)
ADAPTATION MODEL (2)
ADDITIVE NOISE (2)
BANDWIDTH (2)
BRAIN MODELING (2)
BRAIN MODELS (2)
CAMERAS (2)
CLASSIFICATION ALGORITHMS (2)
CLUSTERING ALGORITHMS (2)
COMPUTER VISION (2)
CONVERGENCE (2)
CONVOLUTION (2)
COVARIANCE MATRIX (2)
DATA MODELS (2)
DIGITAL SIGNAL PROCESSING (2)
DISCRETE FOURIER TRANSFORMS (2)
DISTANCE MEASUREMENT (2)
DISTORTION (2)
ENTROPY (2)
FREQUENCY MODULATION (2)
FREQUENCY RESPONSE (2)
GABOR FILTERS (2)
GRAPHICS (2)
HIDDEN MARKOV MODELS (2)
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2)
more

INFONA - science communication portal

Advanced search

Advanced search

Late reverberation reduction and blind reverberation time measurement for automatic speech recognition

I-Vector based depression level estimation technique

Dual-channel speech separation by sub-segmental directional statistics

Sound event classification with feature vector combination for automatic audio-based surveillance

A Simple Voice Extraction Algorithm in a High-Noise Environment

On semi-blind estimation of echo paths during double-talk based on nonstationarity

A discriminative learning approach to probabilistic acoustic source localization

A quantitative comparison of blind C₅₀ estimators

Self-localization of wireless acoustic sensors in meeting rooms

A comparative analysis on cepstrum, linear predictive coding and particle filtering based formant estimation methods

Maximum-likelihood based 3D acoustical signature estimation

Using reverberation time estimates in blind separation of acoustic sources

An overview of informed audio source separation

Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits

A compressed sensing approach to the simultaneous recording of multiple room impulse responses

On the estimation of low fundamental frequencies

Soft frame margin estimation of Gaussian Mixture Models for speaker recognition with sparse training data

Evaluating different confirmation strategies for speech-to-speech translation systems

Chapter 14: A Comparative Study of ICA Based Approaches for Separation of Components in Functional MRI Sequences

Improved Adaptive Fingerprint Binarization

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options