Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

For this year's 36th edition of ICASSP, we received 2946 submissions, which is probably an all-time high: it represents an increase of 5% over last year and 12% over two years ago. The overall acceptance rate was 49%. Distributed over the various technical areas, as covered by the Signal Processing Society Technical Committees (TCs), the submission statistics are as follows:

chapter

General chair's message

Petr Tichavsky, Honza Cernocky, Ales Prochazka

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > xvii - xviii

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The organizing committee of ICASSP 2011 is delighted to welcome you to the 36th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), which is being held at the Prague Congress Centre, May 22–27, 2011. This is the flagship conference for the IEEE Signal Processing Society. In 1997, ICASSP was held in Munich, Germany, and now, in 2011, ICASSP is back in Central Europe....

chapter

Future SPS conferences

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > xxii

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

chapter

Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio

Yun Wang, Zhijian Ou

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1 - 4

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Modern monaural voice and accompaniment separation systems usually consist of two main modules: melody extraction and time-frequency masking. A main distinction between different separation systems lies in what approaches are used for the two modules. Popular techniques for melody extraction include hidden Markov models (HMMs) and non-negative matrix factorization (NMF), and masking includes hard...

chapter

Adaptation of source-specific dictionaries in Non-Negative Matrix Factorization for source separation

Xabier Jaureguiberry, Pierre Leveau, Simon Maller, Juan Jose Burred

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5 - 8

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper concerns the adaptation of spectrum dictionaries in audio source separation with supervised learning. Supposing that samples of the audio sources to separate are available, a filter adaptation in the frequency domain is proposed in the context of Non-Negative Matrix Factorization with the Itakura-Saito divergence. The algorithm is able to retrieve the acoustical filter applied to the sources...

chapter

An acoustically-motivated spatial prior for under-determined reverberant source separation

Ngoc Q. K. Duong, Emmanuel Vincent, Remi Gribonval

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 9 - 12

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random vector with full-rank spatial covariance matrix. We introduce an inverse Wishart prior over the covariance matrices, whose mean is given by the theory of statistical room acoustics and whose variance...

chapter

Resolving FD-BSS permutation for arbitrary array in presence of spatial aliasing

Jani Even, Norihiro Hagita

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 13 - 16

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a novel method for solving the permutation problem inherent to frequency domain blind signal separation of multiple simultaneous speakers. As conventional methods, the proposed method exploits the direction of arrival (DOA) of the different speakers to resolve the permutation. But it is designed to exploit the information from pairs of microphones that are usually discarded because...

chapter

A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics

Gautham J. Mysore, Paris Smaragdis

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 17 - 20

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a semi-supervised source separation methodology to denoise speech by modeling speech as one source and noise as the other source. We model speech using the recently proposed non-negative hidden Markov model, which uses multiple non-negative dictionaries and a Markov chain to jointly model spectral structure and temporal dynamics of speech. We perform separation of the speech and noise using...

chapter

Itakura-Saito nonnegative matrix factorization with group sparsity

Augustin Lefevre, Francis Bach, Cedric Fevotte

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 21 - 24

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose an unsupervised inference procedure for audio source separation. Components in nonnegative matrix factorization (NMF) are grouped automatically in audio sources via a penalized maximum likelihood approach. The penalty term we introduce favors sparsity at the group level, and is motivated by the assumption that the local amplitude of the sources are independent. Our algorithm extends multiplicative...

chapter

Multipitch estimation by joint modeling of harmonic and transient sounds

Jun Wu, Emmanuel Vincent, Stanislaw Andrzej Raczynski, Takuya Nishimoto, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 25 - 28

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Multipitch estimation techniques are widely used for music transcription and acquisition of musical data from digital signals. In this paper, we propose a flexible harmonic temporal timbre model to decompose the spectral energy of the signal in the time-frequency domain into individual pitched notes. Each note is modeled with a 2-dimensional Gaussian mixture. Unlike previous approaches, the proposed...

chapter

Frequency selective pitch transposition of audio signals

Sascha Disch, Bernd Edler

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 29 - 32

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Modern music production often uses pre-recorded pieces of audio, so-called samples, taken from a huge sample database. Consequently, there is an increasing demand to extensively adapt these samples to their intended new musical environment in a flexible way. Such an application, for instance, retroactively changes the key mode of audio recordings, e.g. from a major key to minor key by a frequency...

chapter

Improving melody extraction using Probabilistic Latent Component Analysis

Jinyu Han, Ching-Wei Chen

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 33 - 36

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose a new approach for automatic melody extraction from polyphonic audio, based on Probabilistic Latent Component Analysis (PLCA).An audio signal is first divided into vocal and non-vocal segments using a trained Gaussian Mixture Model (GMM) classifier. A statistical model of the non-vocal segments of the signal is then learned adaptively from this particular input music by PLCA. This model...

INFONA - science communication portal

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Author index

Front cover

Title page

Copyright page

Blank page

ICASSP 2011 conference committee

Technical program committee

Table of contents

Technical chair's overview

General chair's message

Future SPS conferences

Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio

Adaptation of source-specific dictionaries in Non-Negative Matrix Factorization for source separation

An acoustically-motivated spatial prior for under-determined reverberant source separation

Resolving FD-BSS permutation for arbitrary array in presence of spatial aliasing

A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics

Itakura-Saito nonnegative matrix factorization with group sparsity

Multipitch estimation by joint modeling of harmonic and transient sounds

Frequency selective pitch transposition of audio signals

Improving melody extraction using Probabilistic Latent Component Analysis

Filter options

Publication date

Keywords

INFONA - science communication portal

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)