Advanced search

From:

To:

Items from 1 to 20 out of 93 results

chapter

Vowel-non vowel classification of speech using an MLP and rules

John Sirigos, Vassilis Darsinos, Nikos Fakotakis, George Kokkinakis

1996 8th European Signal Processing Conference (EUSIPCO 1996) > 1 - 4

1996 8th European Signal Processing Conference (EUSIPCO 1996)

In this paper we present a high precision speaker independent vowel/non vowel classifier based on a simple feed forward MLP (Multi Layer Perceptron) and several rules. RASTA-PLP analysis of the speech signal resulting to mel-cepstral coefficients and a formant tracking method are used in order to provide the feature vectors for the MLP. To train and test the system we used a part of the TIMIT database...

chapter

Text-constrained speaker verification using fuzzy C means vector quantization

Debnath Saswati, Soni Badal, Das Pradip K.

2015 International Conference on Communications and Signal Processing (ICCSP) > 1511 - 1515

2015 International Conference on Communications and Signal Processing (ICCSP)

The most successful approach to speech and speaker recognition is to treat the speech signal as a stochastic pattern and to use a statistical pattern recognition technique for matching utterances. This paper attempts to study the performance of Text dependent speaker verification system using Delta-Delta Mel Frequency Cepstral Coefficients (MFCC-Δ-Δ) feature vector and Fuzzy C means (FCM) speaker...

chapter

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Khan Suhail Ahmad, Anil S. Thosar, Jagannath H. Nirmal, Vinay S. Pande

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC) and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian filter banks for text independent speaker recognition. MFCC modeled on the human auditory system shows robustness against noise and session changes and hence has become synonymous with speaker recognition. Our main aim is to test...

chapter

A reliable speaker verification system based on LPCC and DTW

Rekha Nair, Nirmala Salam

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Human voice can serve as a password/key for access to various services. This voice is used for verifying speaker in speaker verification system based on the features extracted from the voice signal. In automated speaker verification the speaker's voice signal is processed to extract speaker-specific information which is used to generate voiceprint also known as a template that cannot be replicated...

chapter

Variational learning and inference algorithms for extended Gaussian mixture model

Xin Wei, Jianxin Chen, Lei Wang, Jingwu Cui, more

2014 IEEE/CIC International Conference on Communications in China (ICCC) > 236 - 240

2014 IEEE/CIC International Conference on Communications in China (ICCC)

In this paper, in order to properly evaluate the relative importance of priors and observed data in the Bayesian framework, we propose an extended Gaussian mixture model (EGMM) and design the corresponding learning inference algorithms. First, we define the likelihood function of the EGMM and then propose the variational learning algorithm for this EGMM. Moreover, the proposed model and approach are...

chapter

Comparative Analysis of Prosodic Features and Linear Predictive Coefficients for Speaker Recognition Using Machine Learning Technique

Varinder Singh Baidwan, Shruti Gujral

2014 International Conference on Devices, Circuits and Communications (ICDCCom) > 1 - 8

2014 International Conference on Devices, Circuits and Communications (ICDCCom)

Speaker recognition is a biometric identification method that uses different features of individual's voice for automatically identifying a speaker among a population. Two different features set for text dependent speaker recognition. A comparison is performed between Linear Predictive Coefficients (LPC) and Prosodic Features (F0, F1, F2, and F3) along with Radial Basis Function Network (RBFN) for...

chapter

Experimental framework for mel-scaled LP based Bangla speech recognition

Umme Muslima, M. Babul Islam

16th Int'l Conf. Computer and Information Technology > 56 - 59

2013 16th International Conference on Computer and Information Technology (ICCIT)

This paper deals with the recognition process of Bangla speech. The used database consists of two sets of data - one is for training containing 3824 utterances of Bangla digit sequences of 25 male and 25 female speakers and the other one is test dataset containing 1985 utterances of 26 male and 26 female speakers. The test set is subdivided into four groups such as clean1, clean2, clean3 and clean4...

chapter

Spectral matching based voice activity detector for improved speaker recognition

K. T. Sreekumar, Kuruvachan K. George, K. Arunraj, C. Santhosh Kumar

2014 International Conference on Power Signals Control and Computations (EPSCICON) > 1 - 4

2014 International Conference on Power Signals Control and Computations (EPSCICON)

For spoken language processing applications like speaker recognition/verification, not only that the silence segments do not contribute any speaker specific information, but also it dilutes the already available information content in the speech segments in the audio data. It has been experimentally studied that removing silence segments with the help of a voice activity detector(VAD) from the utterance...

chapter

A Map-Reduce based fast speaker recognition

Fei Wang, Mingqing Liao

2013 9th International Conference on Information, Communications & Signal Processing > 1 - 5

2013 9th International Conference on Information, Communications & Signal Processing (ICICS)

In text-independent speaker identification, there are a large number of likelihood computations, especial in large population. To speed up the recognition, we proposed a lightweight algorithm called CBF (Codebook Filtering). CBF provides two phase of speaker pruning to accelerate the speaker recognition. To make CBF could process large population, this paper implements CBF on Map-Reduce framework...

chapter

GMM and i-vector based speaker verification using speaker-specific-text for short utterances

B. Bharathi, T. Nagarajan

2013 IEEE International Conference of IEEE Region 10 (TENCON 2013) > 1 - 4

TENCON 2013 - 2013 IEEE Region 10 Conference

In speaker recognition tasks, one of the reasons for reduced accuracy is due to closely resembling speakers in the acoustic space. In order to increase the discriminative power of the classifier, the system must be able to use only the unique features of a given speaker with respect to his/her acoustically resembling speaker. This paper proposes a technique to reduce the confusion errors, by finding...

chapter

Vocal source features for bilingual speaker identification

Jianglin Wang, Michael T. Johnson

2013 IEEE China Summit and International Conference on Signal and Information Processing > 170 - 173

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

This paper introduces the use of two new features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC) and Glottal Flow Cepstrum Coefficients (GLFCC), to capture speaker-specific characteristics from their vocal excitation patterns. Results on a cross-lingual speaker identification task taken from the NIST 2004 SRE demonstrate that these RPCC and GLFCC features are significantly...

chapter

Speaker Recognition System: Vulnerable and Challenges

Naufal Alee, Phaklen Ehkan, R. Badlishah Ahmad, Shahrul Nizam Yaakob, more

2013 International Conference on Information Science and Applications (ICISA) > 1 - 4

2013 International Conference on Information Science and Applications (ICISA)

Recently speaker recognition system became high interesting by researchers for both software and hardware solutions. Different technologies have been adopted to implement speaker recognition system that has performance with optimal time response with acceptable accuracy. Research progresses are going on to provide highly durable and precise recognition system that can be embedded into critical implementation...

chapter

Speaker identification in a multimodal interface

Juraj Kacur, Mario Varga, Gregor Rozinaj

Proceedings ELMAR-2013 > 191 - 194

2013 55th International Symposium ELMAR

The article presents the development of a speaker identification system as one part of the multimodal interface for the HBB-NEXT project. A short introduction to a speaker identification problem in the context of HBB-NEXT project is given. Then we focus on the design, optimization and method selection process in order to realize a real time, text independent speaker identification application, namely:...

chapter

Single-channel speaker-pair identification: A new approach based on automatic frame selection

Ramji Srinivasan, Ji Ming, Danny Crookes

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4369 - 4372

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Given single-channel recordings of simultaneous speakers, we may need to identify the individual speakers for separating their voices. In this paper, we consider the problem of identifying two simultaneous speakers based on single-channel data, i.e., speakerpair identification. We model the problem as identifying speakers using noisy speech with partial temporal corruption, which corresponds to the...

chapter

Multidirectional Local Feature for Speaker Recognition

Awais Mahmood, Mansour Alsulaiman, Ghulam Muhammad

2012 Third International Conference on Intelligent Systems Modelling and Simulation > 308 - 311

2012 3rd International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

This paper proposes a new feature extraction method called multi-directional local feature (MDLF) to apply on an automatic speaker recognition system. To obtain MDLF, a linear regression is applied on FFT signal in four different directions which are horizontal (time axis), vertical (frequency axis), diagonal 45 degree (time-frequency) and diagonal 135 degree (time-frequency). In the experiments,...

chapter

Personalized voice command systems in multi modal user interface

Evelyn Kurniawati, Luca Celetto, Nicola Capovilla, Sapna George

2012 IEEE International Conference on Emerging Signal Processing Applications > 45 - 47

2012 IEEE International Conference on Emerging Signal Processing Applications (ESPA 2012)

The goal of this paper is to describe the voice command system as part of the multi modal user interface for residential application project demoed at CES 2012. The application is a 3D TV panel which can be controlled through face recognition, gesture, and speech. The speech interface is invoked using activation keyword, and terminated in similar fashion with de-activation keyword. Speaker recognition...

chapter

Speaker identification in smart environments with multilayer perceptron

Jasmina Novakovic

2011 19thTelecommunications Forum (TELFOR) Proceedings of Papers > 1418 - 1421

2011 19th Telecommunications Forum Telfor (TELFOR)

This paper presents reliability of MLP in speaker identification using characteristics extracted from their voices. Classification accuracy depends on speaking condition and varies up to 23% depending on the selected speaking condition. Results of simulation experiment show that MLP is effective in speaker identification, especially in the case of retelling and synchronous speech where we achieved...

chapter

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

Ruiling Luo, Wenqing Cai, Min Chen, Zhongling Han

2011 3rd International Workshop on Intelligent Systems and Applications > 1 - 4

2011 3rd International Workshop on Intelligent Systems and Applications (ISA)

SVM is a novel type of statistical learning method that has been successfully used in speaker recognition. However, training SVM consumes long computing time and large storage space with all training examples. This paper proposes an improved sparse least-squares support vector machine (LS-SVM) for speaker identification. Firstly KPCA is exploited to reduce the dimension of input vectors and to denoise...

chapter

User verification: Matching the uploaders of videos across accounts

Howard Lei, Jaeyoung Choi, Adam Janin, Gerald Friedland

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2404 - 2407

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This article presents an attempt to link the uploaders of videos based on the audio track of the videos. Using a subset of the MediaEval [10] Placing Task's Flickr video set, which is labeled with the uploader's name, we conducted an experiment with a similar setup as a typical NIST speaker recognition evaluation run. Based on the assumption that the audio might be matched in various ways (speaker,...

chapter

Simplification and optimization of i-vector extraction

Ondrej Glembek, Lukas Burget, Pavel Matejka, Martin Karafiat, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4516 - 4519

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces some simplifications to the i-vector speaker recognition systems. I-vector extraction as well as training of the i-vector extractor can be an expensive task both in terms of memory and speed. Under certain assumptions, the formulas for i-vector extraction—also used in i-vector extractor training—can be simplified and lead to a faster and memory more efficient code. The first...

Keywords:
ACCURACY
SPEECH
SPEAKER RECOGNITION

Publication date

Set your own date range

INFONA - science communication portal

Advanced search

Advanced search

Vowel-non vowel classification of speech using an MLP and rules

Text-constrained speaker verification using fuzzy C means vector quantization

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

A reliable speaker verification system based on LPCC and DTW

Variational learning and inference algorithms for extended Gaussian mixture model

Comparative Analysis of Prosodic Features and Linear Predictive Coefficients for Speaker Recognition Using Machine Learning Technique

Experimental framework for mel-scaled LP based Bangla speech recognition

Spectral matching based voice activity detector for improved speaker recognition

A Map-Reduce based fast speaker recognition

GMM and i-vector based speaker verification using speaker-specific-text for short utterances

Vocal source features for bilingual speaker identification

Speaker Recognition System: Vulnerable and Challenges

Speaker identification in a multimodal interface

Single-channel speaker-pair identification: A new approach based on automatic frame selection

Multidirectional Local Feature for Speaker Recognition

Personalized voice command systems in multi modal user interface

Speaker identification in smart environments with multilayer perceptron

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

User verification: Matching the uploaders of videos across accounts

Simplification and optimization of i-vector extraction

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options