Search results

chapter

Speech enhancement using a critical point based Wiener Filter

Meihui Lu, Xuan Zhou, Nabih Jaber, Kun Hua, more

2017 Advances in Wireless and Optical Communications (RTUWO) > 175 - 179

2017 Advances in Wireless and Optical Communications (RTUWO)

This paper presents an new approach to noise reduction for voice communication over Bluetooth technology. In the literature, several authors have compared the performance of different filtering techniques, such as the well-known Spectral Subtraction (SS), and Wiener Filter (WF) using simulated data, whereas this research uses real-time data samples collected from cars subjected to a noisy environment...

chapter

Robust features for automatic estimation of physical parameters from speech

Kalluri Shareef Babu, Deepu Vijayasenan

TENCON 2017 - 2017 IEEE Region 10 Conference > 1515 - 1519

TENCON 2017 - 2017 IEEE Region 10 Conference

Estimating speaker's physical parameters like height, weight and shoulder size can assist in voice forensics by providing additional knowledge about the speaker. In this work, statistics of the components of background GMM are employed as features in estimating the physical parameters. These features improved the performance of height and shoulder size estimation as compared to our earlier attempt...

chapter

PSD estimation of multiple sound sources in a reverberant room using a spherical microphone array

Abdullah Fahim, Prasanga N. Samarasinghe, Thushara D. Abhayapala

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 76 - 80

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We propose an efficient method to estimate source power spectral densities (PSDs) in a multi-source reverberant environment using a spherical microphone array. The proposed method utilizes the spatial correlation between the spherical harmonics (SH) coefficients of a sound field to estimate source PSDs. The use of the spatial cross-correlation of the SH coefficients allows us to employ the method...

chapter

Performance study of three different sparse adaptive filtering algorithms for echo cancellation in long acoustic impulse responses

A. Tedjani, A. Benallal

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 7

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

In this paper, the problem of echo cancellation in long acoustic impulse responses (AIRs) is highlighted. Three of the mostly-used recent NLMS-based sparse adaptive filtering algorithms are presented; and their performances in the context of acoustic echo cancellation (AEC) are studied and compared. The algorithms of interest include the improved proportionate normalized least mean square (IPNLMS),...

chapter

A convex optimization approach for time-frequency mask estimation

Feng Bao, Waleed H. Abdulla

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 31 - 35

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

In this paper, we propose a new time-frequency mask method for computational auditory scene analysis (CASA) based on convex optimization of the binary mask. In the proposed method, the pitch estimation and segment segregation in conventional CASA are completely replaced by the convex optimization of speech power. Considering the cross-correlation between the power spectra of noisy speech and noise...

chapter

Amplitude and phase dereverberation of harmonic signals

Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 294 - 298

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

While most dereverberation methods focus on how to estimate the magnitude of an anechoic signal in the time-frequency domain, we propose a method which also takes the phase into account. By applying a harmonic model to the anechoic signal, we derive a formulation to compute the amplitude and phase of each harmonic. These parameters are then estimated by our method in presence of reverberation. As...

chapter

Vocal dysperiodicities estimation using fractional order long-term linear prediction

Asma Kissoum, Abdellah Kacha

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 5

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

Objective markers obtained from acoustic analysis of speech are of great importance for clinical evaluation of voice disorders because they are non-invasive and provide a severity index of the disorder which allows clinicians to monitor the progress of patients and documents quantitatively the degree of perceived hoarseness. The object of the present study is to introduce a fractional order long-term...

chapter

Enhancing speech rate estimation techniques to improve dysarthria diagnosis

James Nathaniel Carmichael

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 309 - 313

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

This report discusses the implementation of a computerized algorithm specifically designed to measure the syllables-per-minute rate of abnormal speech typically produced by persons suffering from an articulatory disorder known as dysarthria. This speech rate measurement application — which can also serve as a diagnostic tool in itself — has been integrated into the computerised Frenchay Dysarthria...

chapter

A DNN regression approach to speech enhancement by artificial bandwidth extension

Johannes Abel, Tim Fingscheidt

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 219 - 223

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Artificial speech bandwidth extension (ABE) is an extremely effective means for speech enhancement at the receiver side of a narrowband telephony call. First approaches have been seen incorporating deep neural networks (DNNs) into the estimation of the upper band speech representation. In this paper we propose a regression-based DNN ABE being trained and tested on acoustically different speech databases,...

chapter

Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates

Daniel Marquardt, Simon Doclo

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 234 - 238

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Noise reduction algorithms for head-mounted assistive listening devices are crucial to improve speech quality and intelligibility in background noise. For binaural hearing devices with one microphone per device, the noise power spectral density (PSD) is commonly estimated using various assumptions about the acoustic scenario. Since these methods lack robustness if the underlying assumptions are not...

chapter

Broadband doa estimation using convolutional neural networks trained with noise signals

Soumitro Chakrabarty, Emanuel A. P. Habets

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 136 - 140

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

A convolution neural network (CNN) based classification method for broadband DOA estimation is proposed, where the phase component of the short-time Fourier transform coefficients of the received microphone signals are directly fed into the CNN and the features required for DOA estimation are learned during training. Since only the phase component of the input is used, the CNN can be trained with...

chapter

Multiple fundamental frequencies estimation approaches based on multi-scale product analysis

Jihen Zeremdini, Mohamed Anouar Ben Messaoud, Aicha Bouzid

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 55 - 58

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

This paper describes three methods for multiple fundamental frequencies estimation based on the multi-scale product analysis. The three methods use the autocorrelation of the multi-scale product analysis for the target pitch estimation. For the intrusion pitch, each one has its techniques. The first one uses the classic comb filtering. The second method employs the rectangular comb filter followed...

chapter

Dictionary learning for pitch estimation in speech signals

Feng Huang, Peter Balazs

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper presents an automatic approach for parameter training for a sparsity-based pitch estimation method that has been previously published. For this pitch estimation method, the harmonic dictionary is a key parameter that needs to be carefully prepared beforehand. In the original method, extensive human supervision and involvement are required to construct and label the dictionary. In this study,...

chapter

Comparison of glottal closure instant estimation algorithms for singing voices in Indian context

K. S. Gokul Krishnan, D. Govind

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1447 - 1452

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Glottal closure instant (GCI) is an important feature in many speech processing applications. Many algorithms have been proposed for GCI estimation from speech signals. The objective of the proposed work is to provide a comprehensive analysis of the performance of various GCI estimation algorithms for singing voice in Indian context. GCI estimation algorithms such as Dynamic Programming Phase Slope...

chapter

Robust pitch estimation in distant speech signals collected from vehicle

Dipesh Mudatkar, S. Adarsh, D. Govind

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1784 - 1791

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Due to significant signal attenuation, the speech signals collected at different distances show degradation in the estimation of speech parameters. Therefore the work presented in this paper proposes an alternate method for improving the F₀ parameter estimation from distant speech (DS) signals which are collected through microphones at various distances. The proposed method achieves improved F₀ estimation...

chapter

Overdetermined blind source separation using approximate joint diagonalization

Taiki Asamizu, Shinya Saito, Kunio Oishi, Toshihiro Furukawa

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 168 - 171

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

Blind separation of mixtures has been achieved by approximate joint diagonalization (AJD) approaches. This paper presents an approach for overdetermined blind source separation (BSS) using AJD. The approach is based on an alternative minimization of the indirect and direct least-squares criteria to the diagonal matrices in the first phase and to the mixing matrix in the second phase, respectively...

chapter

Multi-channel estimation of power spectral density matrix using inter-frame and inter-banc information

Raziyeh Ranjbaryan, Hamid Reza Abutalebi

2017 25th European Signal Processing Conference (EUSIPCO) > 1634 - 1638

2017 25th European Signal Processing Conference (EUSIPCO)

In this paper, we address the estimation of power spectral density (PSD) matrix. The accurate estimation of PSD matrix plays an important role in many speech enhancement methods. In traditional PSD estimation methods, only the information of previous frames is employed through a forgetting factor. In the current research, we consider the correlation of inter-band components and incorporate their information...

chapter

Real time noise suppression in social settings comprising a mixture of non-stationary anc transient noise

Pei Chee Yong, Sven Nordholm

2017 25th European Signal Processing Conference (EUSIPCO) > 588 - 592

2017 25th European Signal Processing Conference (EUSIPCO)

Hearable is a recently emerging term that describes a wireless earpiece that enhances the user's listening experience in various acoustic environment. Another important feature of hearable devices is their capability to improve speech communication in difficult social settings, which usually consist of a mixture of different non-stationary noise. In this paper, we present techniques to suppress a...

chapter

Noise estimation with an inverse comb filter in non-stationary noise environments

Tetsuya Shimamura, Fumiya Kato

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 156 - 159

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

We propose a new noise estimation method using only the current frame of noisy speech. The proposed method utilizes an inverse comb filter for noisy speech to suppress the power of speech, and estimates the noise from the resulting spectrum. It is shown by experiments that the spectral subtraction combined with the proposed noise estimation method is superior to the conventional speech enhancement...

chapter

Comparison of different filter approaches for the online frequency analysis of speech signals

Andreas Rauh, Susann Tiede, Cornelia Klenke

2017 22nd International Conference on Methods and Models in Automation and Robotics (MMAR) > 605 - 610

2017 22nd International Conference on Methods and Models in Automation and Robotics (MMAR)

The fundamental building block of spoken languages is a list of phonemes from which syllables and, hence, also words are formed. A systematic distinction between these phonemes becomes possible by the characteristic frequency components that are included in each sound. On the one hand, voiced phonemes are characterized by several sharp frequency components. On the other hand, wide, typically blurred...

INFONA - science communication portal

Search results

Speech enhancement using a critical point based Wiener Filter

Robust features for automatic estimation of physical parameters from speech

PSD estimation of multiple sound sources in a reverberant room using a spherical microphone array

Performance study of three different sparse adaptive filtering algorithms for echo cancellation in long acoustic impulse responses

A convex optimization approach for time-frequency mask estimation

Amplitude and phase dereverberation of harmonic signals

Vocal dysperiodicities estimation using fractional order long-term linear prediction

Enhancing speech rate estimation techniques to improve dysarthria diagnosis

A DNN regression approach to speech enhancement by artificial bandwidth extension

Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates

Broadband doa estimation using convolutional neural networks trained with noise signals

Multiple fundamental frequencies estimation approaches based on multi-scale product analysis

Dictionary learning for pitch estimation in speech signals

Comparison of glottal closure instant estimation algorithms for singing voices in Indian context

Robust pitch estimation in distant speech signals collected from vehicle

Overdetermined blind source separation using approximate joint diagonalization

Multi-channel estimation of power spectral density matrix using inter-frame and inter-banc information

Real time noise suppression in social settings comprising a mixture of non-stationary anc transient noise

Noise estimation with an inverse comb filter in non-stationary noise environments

Comparison of different filter approaches for the online frequency analysis of speech signals

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options