Advanced search

From:

To:

Items from 1 to 20 out of 31 results

chapter

Cepstral noise subtraction for robust automatic speech recognition

Robert Rehr, Timo Gerkmann

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 375 - 378

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The robustness of speech recognizers towards noise can be increased by normalizing the statistical moments of the Mel-frequency cepstral coefficients (MFCCs), e. g. by using cepstral mean normalization (CMN) or cepstral mean and variance normalization (CMVN). The necessary statistics are estimated over a long time window and often, a complete utterance is chosen. Consequently, changes in the background...

chapter

On the fundamental limitations of spectral subtraction: An assessment by automatic speech recognition

Nicholas W. D. Evans, John S. Mason, Wei M. Liu, Benoit Fauve

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

Spectral subtraction is one of the earliest and longest standing, popular approaches to noise compensation and speech enhancement. A literature search reveals an abundance of recent research papers that report the successful application of spectral subtraction to noise robust automatic speech recognition (ASR). However, as with many alternative approaches, the benefits lessen as noise levels in the...

chapter

Extracting situational awareness from microblogs during disaster events

Anirban Sen, Koustav Rudra, Saptarshi Ghosh

2015 7th International Conference on Communication Systems and Networks (COMSNETS) > 1 - 6

2015 7th International Conference on Communication Systems and Networks (COMSNETS)

Microblogging sites such as Twitter and Weibo are increasingly being used to enhance situational awareness during various natural and man-made disaster events such as floods, earthquakes, and bomb blasts. During any such event, thousands of microblogs (tweets) are posted in short intervals of time. Typically, only a small fraction of these tweets contribute to situational awareness, while the majority...

chapter

An inter-speaker evaluation through simulation of electrolarynx control based on statistical F₀ prediction

Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, more

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

An electrolarynx is a device that artificially generates excitation sounds to produce electrolaryngeal (EL) speech. Although proficient laryngectomees can produce intelligible EL speech by using this device, it sounds quite unnatural due to the mechanical excitation. To address this issue, we have proposed several EL speech enhancement methods using statistical voice conversion and showed that statistical...

chapter

An improved pitch detection of speech combined with speech enhancement

Xin Xu, Tian-qi Zhang, Sui Shi, Ya-juan Zhang

2014 7th International Congress on Image and Signal Processing > 778 - 782

2014 7th International Congress on Image and Signal Processing (CISP)

For poor robustness issues of pitch detection of noisy speech, the improved pitch detection method combined with speech enhancement is proposed in this paper. Firstly, in order to reduce background noise and receive the clean speech relatively, we use the multi-band spectral subtraction and the masking properties of human auditory system to work on the noisy speech, and next use the energy and zero-crossing...

chapter

On existence of optimal boundary value between early reflections and late reverberation

Arkadiy Prodeus, Olga Ladoshko

2014 IEEE 34th International Scientific Conference on Electronics and Nanotechnology (ELNANO) > 442 - 446

2014 IEEE 34th International Conference on Electronics and Nanotechnology (ELNANO)

Enhancement of speech distorted by reverberation is issue of the day. The problem has been actively studied in the last decade. However, it is still extremely difficult to find clear recommendations on choice of boundary value between early reflections and late reverberation, optimal in sense of such criteria as speech recognition accuracy and speech quality. Another problem is getting of simple pre-processor...

chapter

Signal and feature domain enhancement approaches for robust speech recognition

Jinkyu Lee, Soonho Baek, Hong-Goo Kang

2011 8th International Conference on Information, Communications & Signal Processing > 1 - 4

2011 8th International Conference on Information, Communications & Signal Processing (ICICS)

This paper analyzes the impact of various preprocessing modules to improve the performance of automatic speech recognition system (ASR) in noisy environment. After choosing the state-of-the-art algorithms designed in the signal domain and feature domain, their performances in various noise conditions are thoroughly evaluated. Since the enhancement has been directly made to the features that are actually...

chapter

Cross-Channel Spectral Subtraction for meeting speech recognition

Yu Nasu, Koichi Shinoda, Sadaoki Furui

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4812 - 4815

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose Cross-Channel Spectral Subtraction (CCSS), a source separation method for recognizing meeting speech where one microphone is prepared for each speaker. The method quickly adapts to changes in transfer functions and uses spectral subtraction to suppress the speech of other speakers. Compared with conventional source separation methods based on independent component analysis (ICA) or that...

chapter

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

Iosif Mporas, Todor Ganchev, Otilia Kocsis, Nikos Fakotakis

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5176 - 5179

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a speech pre-processing scheme (SPPS) for robust speech recognition in the moving motorcycle environment. The SPPS is dynamically adapted during the run-time operation of the speech front-end, depending on short-time characteristics of the acoustic environment. In detail, the fast varying acoustic environment is modeled by GMM clusters based on which a selection function determines the...

chapter

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5136 - 5139

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this study, we evaluate our proposed methods for enhancing alaryngeal speech based on statistical voice conversion techniques. Voice conversion based on a Gaussian mixture model has been applied to the conversion of alaryngeal speech into normal speech (AL-to-Speech). Moreover, one-to-many eigenvoice conversion (EVC) has also been applied to AL-to-Speech to enable the recovery of the original voice...

chapter

Improving speech intelligibility in an adverse condition using subband spectral subtraction method

Rashmi Makhijani, Urmila Shrawankar, V M Thakare

2011 International Conference on Communications and Signal Processing > 168 - 170

2011 International Conference on Communications and Signal Processing (ICCSP)

Many people have great difficulty in Understanding speech with background noise. Speech Enhancement plays a vital role in such situations. The background noise has to be removed from the noisy speech signal to increase the signal intelligibility and to reduce the listener fatigue. In this paper, a novel approach is used to enhance the perceived quality of the speech signal when the additive noise...

chapter

Detection of burst onset landmarks in speech using rate of change of spectral moments

A R Jayan, P S Rajath Bhat, P C Pandey

2011 National Conference on Communications (NCC) > 1 - 5

2011 National Conference on Communications (NCC)

Burst onset landmarks in the speech signal are transient segments with low energy and their accurate detection is important in applications involving landmark based speech modification, estimation of place of closure for speech training aids, and phoneme recognition. Rate of change measures of energy parameters from spectral bands with fixed boundaries are generally used for landmark detection. The...

chapter

Speech segmentation using a hypothesis test based on Random Matrix Theory

N Faraji, S M Ahadi, H Sheikhzadeh, A Moghaddamjoo

The 10th IEEE International Symposium on Signal Processing and Information Technology > 309 - 314

2010 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2010)

Speech segmentation to covariance-stationary regions is of interest, for example in subspace-based speech enhancement. However as the true covariance matrices of speech segments are unknown, it is usual to use their sample estimates. To check whether two sample covariance matrices have been drawn from the same distribution or not, we have used a test statistic previously proposed for image segmentation...

chapter

SURE-MSE speech enhancement for robust speech recognition

Nengheng Zheng, Xia Li, T Blu, Tan Lee

2010 7th International Symposium on Chinese Spoken Language Processing > 271 - 274

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

This paper presents a new approach to enhancing noisy (white Gaussian noise) speech signals for robust speech recognition. It is based on the minimization of an estimate of denoising MSE (known as SURE) and does not require any hypotheses on the original signal. The enhanced signal is obtained by thresholding coefficients in the DCT domain, with the parameters in the thresholding functions being specified...

chapter

Speech Emotion Analysis in Noisy Real-World Environment

A Tawari, M M Trivedi

2010 20th International Conference on Pattern Recognition > 4605 - 4608

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Automatic recognition of emotional states via speech signal has attracted increasing attention in recent years. A number of techniques have been proposed which are capable of providing reasonably high accuracy for controlled studio settings. However, their performance is considerably degraded when the speech signal is contaminated by noise. In this paper, we present a framework with adaptive noise...

chapter

A speech enhancement preprocessor for robust speech recognition

Hanwu Zhao, Xia Zou, Jia Yang

2010 Second International Conference on Communication Systems, Networks and Applications > 1 > 56 - 58

2010 Second International Conference on Communication Systems, Networks and Applications (ICCSNA 2010)

To improve the robustness of automatic speech recognition (ASR) system in adverse environments, speech enhancement preprocessor has been widely used recently to reduce the impact of noise. In this paper, an improved noise estimation approach is proposed in the enhancement preprocessor to keep enhancement performance with low distortion and complexity. First, the noisy speech is transformed into Bark...

chapter

Magnitude spectrum enhancement for robust speech recognition

Wen-hsiang Tu, Jeih-weih Hung

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4586 - 4589

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, an effective compensation scheme for the spectra of speech signals is proposed in order to improve their noise robustness. In this compensation scheme, named magnitude spectrum enhancement (MSE), a voice activity detection (VAD) process is first processed for the frame sequence of the utterance, and then the magnitude spectra of non-speech frames are set to be small while those of speech...

chapter

Voice activity detection using harmonic frequency components in likelihood ratio test

Lee Ngee Tan, B J Borgstrom, A Alwan

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4466 - 4469

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain reliable speech / non-speech decisions. In the proposed method, the likelihood ratio (LR) is calculated differently for voiced frames, as opposed to unvoiced frames: only DFT bins containing harmonic spectral peaks are selected for LR computation. To evaluate the new VAD's effectiveness in improving the noise-robustness...

chapter

Noise robust isolated word recognition

M G Sumithra, M S Ramya, K Thanuskodi

2010 International Conference on Communication and Computational Intelligence (INCOCCI) > 362 - 367

2010 International Conference on Communication and Computational Intelligence (INCOCCI)

In this paper, we suggest a noise robust isolated word speech recognition system which can be applied in various noise environments. In this method, Kalman filter is used to remove the background noise and to enhance the speech signal. The enhanced signal is integrated into the front end of Dynamic Time Warping (DTW) isolated word recognition in order to guarantee high performance and robust recognition...

chapter

One fuzzy retrieval algorithm for speech navigation system

Yanqing Sun, Yu Zhou, Qingwei Zhao, Yonghong Yan, more

2009 International Conference on Future BioMedical Information Engineering (FBIE) > 85 - 89

2009 International Conference on Future BioMedical Information Engineering (FBIE 2009)

In this paper, one fuzzy retrieval algorithm is designed to work with LVCSR in the speech navigation system. Inverted indexing as well as other searching skills are utilized to speed up the searching while keeping the performance. Several cell levels are tried instead of just using word. Easily reaching 90% sentence accuracy within normal database, this framework can also handle very large database,...

Keywords:
ACCURACY
SPEECH
SPEECH ENHANCEMENT

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (18)
NOISE (14)
SIGNAL TO NOISE RATIO (9)
FEATURE EXTRACTION (7)
NOISE MEASUREMENT (7)
DATABASES (6)
ESTIMATION (6)
SPEECH PROCESSING (6)
TRAINING (5)
ACOUSTICS (4)
AUDITORY SYSTEM (4)
AUTOMATIC SPEECH RECOGNITION (4)
BACKGROUND NOISE (4)
COMPUTATIONAL MODELING (4)
HIDDEN MARKOV MODELS (4)
MEL FREQUENCY CEPSTRAL COEFFICIENT (4)
NOISE REDUCTION (4)
ADAPTATION MODEL (3)
ADDITIVE NOISE (3)
ARRAYS (3)
CEPSTRAL ANALYSIS (3)
DATA MINING (3)
FILTERING (3)
MICROPHONES (3)
NOISE ROBUSTNESS (3)
ROBUST SPEECH RECOGNITION (3)
ROBUSTNESS (3)
SIGNAL DENOISING (3)
SIGNAL PROCESSING (3)
SPECTRAL ANALYSIS (3)
SPEECH SIGNAL (3)
ALGORITHM DESIGN AND ANALYSIS (2)
ANALYTICAL MODELS (2)
APPROXIMATION ALGORITHMS (2)
ARRAY SIGNAL PROCESSING (2)
AWGN (2)
BEAMFORMING (2)
COMPUTERS (2)
CONFERENCES (2)
CONVOLUTION (2)
CORRELATION (2)
COVARIANCE MATRIX (2)
DEGRADATION (2)
DISCRETE FOURIER TRANSFORMS (2)
DISTANCE MEASUREMENT (2)
DISTORTION (2)
ELECTRONIC MAIL (2)
EMOTION RECOGNITION (2)
EQUATIONS (2)
FILTER BANK (2)
GAUSSIAN NOISE (2)
GAUSSIAN PROCESSES (2)
INTERFERENCE SUPPRESSION (2)
MASKING THRESHOLD (2)
MATHEMATICAL MODEL (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MICROPHONE ARRAYS (2)
MOBILE COMMUNICATION (2)
NATURAL LANGUAGE PROCESSING (2)
NAVIGATION (2)
OPTIMIZED PRODUCTION TECHNOLOGY (2)
SIGNAL PROCESSING ALGORITHMS (2)
SIGNAL RESOLUTION (2)
SPEAKER RECOGNITION (2)
SPECTRAL SUBTRACTION (2)
SYSTEM-ON-A-CHIP (2)
TRANSFER FUNCTIONS (2)
USA COUNCILS (2)
VOICE ACTIVITY DETECTION (2)
WHITE NOISE (2)
A PRIORI SNR ESTIMATOR (1)
ACF (1)
ACOUSTIC DISTORTION (1)
ACOUSTIC LANDMARK DETECTION (1)
ACOUSTIC LANDMARK DETECTION TECHNIQUE (1)
ACOUSTIC MEASUREMENTS (1)
ACOUSTIC NOISE (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTIVE ARRAYS (1)
ADAPTIVE NOISE CANCELLATION (1)
ADAPTIVE NOISE ESTIMATION (1)
AFFECTIVE COMPUTING (1)
ALARYNGEAL SPEECH (1)
AMDF (1)
AND RANKING (1)
APPROXIMATION METHODS (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASR (1)
ATTENUATION (1)
AUDIO CODING (1)
AURORA 2 (1)
AURORA-2 NOISY DIGIT DATABASE (1)
AURORA2 DATABASE (1)
AUTOMATED DETECTION (1)
AUTOMATIC RECOGNITION (1)
BACKGROUND NOISE REMOVAL (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Cepstral noise subtraction for robust automatic speech recognition

On the fundamental limitations of spectral subtraction: An assessment by automatic speech recognition

Extracting situational awareness from microblogs during disaster events

An inter-speaker evaluation through simulation of electrolarynx control based on statistical F₀ prediction

An improved pitch detection of speech combined with speech enhancement

On existence of optimal boundary value between early reflections and late reverberation

Signal and feature domain enhancement approaches for robust speech recognition

Cross-Channel Spectral Subtraction for meeting speech recognition

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Improving speech intelligibility in an adverse condition using subband spectral subtraction method

Detection of burst onset landmarks in speech using rate of change of spectral moments

Speech segmentation using a hypothesis test based on Random Matrix Theory

SURE-MSE speech enhancement for robust speech recognition

Speech Emotion Analysis in Noisy Real-World Environment

A speech enhancement preprocessor for robust speech recognition

Magnitude spectrum enhancement for robust speech recognition

Voice activity detection using harmonic frequency components in likelihood ratio test

Noise robust isolated word recognition

One fuzzy retrieval algorithm for speech navigation system

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options