Advanced search

From:

To:

Items from 1 to 20 out of 36 results

chapter

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Garg Mamta, Arora Ajat Shatru, Gupta Savita

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3151 - 3155

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper suggested a technique based on MFCC analysis for audio signals with speech classification application. The proposed work used multi-resolution (wavelet) analysis and spectral analysis based features for feature extraction. The proposed approach uses a no. of features like Mel Frequency Cepstral Coefficient (MFCC), and FFT Coefficients combined with wavelet based features. In addition, accuracy...

chapter

Feature extraction analysis on Indonesian speech recognition system

Untari N. Wisesty, Adiwijaya, Widi Astuti

2015 3rd International Conference on Information and Communication Technology (ICoICT) > 54 - 58

2015 3rd International Conference on Information and Communication Technology (ICoICT )

Speech recognition is widely applied to speech to text, speech to emotion, in order to make gadget and computer easier to use, or to help people with hearing disability. Feature extraction is one of significant step in the performance of speech recognition. Therefore, the proper selection is really needed. In this paper, we analyze feature extraction that can have good performance for Indonesian speech...

chapter

Text-constrained speaker verification using fuzzy C means vector quantization

Debnath Saswati, Soni Badal, Das Pradip K.

2015 International Conference on Communications and Signal Processing (ICCSP) > 1511 - 1515

2015 International Conference on Communications and Signal Processing (ICCSP)

The most successful approach to speech and speaker recognition is to treat the speech signal as a stochastic pattern and to use a statistical pattern recognition technique for matching utterances. This paper attempts to study the performance of Text dependent speaker verification system using Delta-Delta Mel Frequency Cepstral Coefficients (MFCC-Δ-Δ) feature vector and Fuzzy C means (FCM) speaker...

chapter

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Khan Suhail Ahmad, Anil S. Thosar, Jagannath H. Nirmal, Vinay S. Pande

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC) and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian filter banks for text independent speaker recognition. MFCC modeled on the human auditory system shows robustness against noise and session changes and hence has become synonymous with speaker recognition. Our main aim is to test...

chapter

Neural response based phoneme classification under noisy condition

Md.Shariful Alam, Wissam A. Jassim, Muhammad S.A. Zilany

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) > 175 - 179

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

Human listeners are capable of recognizing speech in noisy environment, while most of the traditional speech recognition methods do not perform well in the presence of noise. Unlike traditional Mel-frequency cepstral coefficient (MFCC)-based method, this study proposes a phoneme classification technique using the neural responses of a physiologically-based computational model of the auditory periphery...

chapter

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

Tanzeela, Arbab Waseem Abbas, Zakir Ali, Burhan Uddin

2014 12th International Conference on Frontiers of Information Technology > 350 - 354

2014 12th International Conference on Frontiers of Information Technology (FIT)

This paper revealed the analysis of speaker independent isolated Pashto spoken numbers for determination of automatic speech recognition. Initially the database was developed, the database encompasses isolated Pashto numbers from sefer (0) to sul (100). Fifty speakers (25 male, 25 females with different ages) that can frequently speak yousafzai dialect were selected for recording. The recording has...

chapter

A new direct access framework for speaker identification system

Hery Heryanto, Saiful Akbar, Benhard Sitohang

2014 International Conference on Data and Software Engineering (ICODSE) > 1 - 5

2014 International Conference on Data and Software Engineering (ICODSE)

We present in this paper a new Direct Access Framework (DAF) for speaker identification system, to identify a speaker based on original characteristics of the human voice. Direct access method is a process to identify an object based on parts of the object itself, the parts called original characteristics. The proposed framework consists of two parts, the enrolment process and the identification process...

chapter

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Anaum Riaz, Sana Akhtar, Shanza Iftikhar, Amir Ali Khan, more

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 712 - 717

2014 2nd International Conference on Systems and Informatics (ICSAI)

The use of Electroencephalography (EEG) in the domain of Brain Computer Interface is a now common place. EEG for imagined speech reproduction and observation of brain response to audio stimuli are active areas of research. In this paper, we consider the case of imagined and mouthed non-audible speech recorded with EEG electrodes. We analyze different feature extraction techniques such as Mel Frequency...

chapter

Gender specific emotion recognition through speech signals

Vinay, Shilpi Gupta, Anu Mehra

2014 International Conference on Signal Processing and Integrated Networks (SPIN) > 727 - 733

2014 International Conference on Signal Processing and Integrated Networks (SPIN)

This paper proposes an emotion recognition system which allows recognizing a person's emotional state from speech signal. The aim of proposed solution is to improve the interaction among humans and computers. The emotion recognition system must be capable of recognizing at least six basic emotions (happiness, anger, surprise, disgust, fear, sadness) and the neutral circumstances. The proposed system...

chapter

Speaker identification in a multimodal interface

Juraj Kacur, Mario Varga, Gregor Rozinaj

Proceedings ELMAR-2013 > 191 - 194

2013 55th International Symposium ELMAR

The article presents the development of a speaker identification system as one part of the multimodal interface for the HBB-NEXT project. A short introduction to a speaker identification problem in the context of HBB-NEXT project is given. Then we focus on the design, optimization and method selection process in order to realize a real time, text independent speaker identification application, namely:...

chapter

Automatic recognition of major language families in India

Debapriya Sengupta, Goutam Saha

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI) > 1 - 4

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)

India is a vast country with a large number of languages. Among these some languages descend from a single mother language giving rise to a language family. The major official languages in India fall under two language families namely Indo-European and Dravidian. In this paper, we have discussed about a system which takes speech file as input and identifies the language family to which it belongs...

chapter

Frequency Shift Detection of Speech with GMMs AND SVMs

Hua Xing, Philipos C. Loizou

2012 IEEE Workshop on Signal Processing Systems > 215 - 219

2012 IEEE Workshop on Signal Processing Systems (SiPS)

In certain situations, speech might be shifted in the frequency domain amid the presence of noise. To be able to compensate for the spectral shift, it is important to know the amount of frequency shift present. A method based on Mel-frequency-cepstral-coefficient (MFCC) and Gaussian Mixture model (GMM) super vector is proposed for detecting frequency shifts in speech. MFCC or LFCC is extracted to...

chapter

Fourier-Bessel cepstral coefficients for robust speech recognition

Chetana Prakash, Suryakanth V. Gangashetty

2012 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2012 International Conference on Signal Processing and Communications (SPCOM)

In this paper we propose Fourier-Bessel cepstral coefficients (FBCC) features for robust speech recognition. The Fourier-Bessel representation of the speech signal is obtained using Bessel function as a basis set. The FBCC are extracted from zero^th order Bessel coefficients taking into account of the perceptual characteristics of human auditory system. Recognition accuracy is measured using the CMU...

chapter

Noise classification using Gaussian Mixture Models

Hitesh Anand Gupta, Vinay M Varma

2012 1st International Conference on Recent Advances in Information Technology (RAIT) > 821 - 825

2012 1st International Conference on Recent Advances in Information Technology (RAIT)

Gaussian Mixture Models (GMMs) have been proven effective in modeling speech and other acoustic signals. In this study, we have used GMMs to model different noise sources, viz. subway, babble, car and exhibition. Expectation maximization algorithm has been implemented to fit the model. Further, we present the ‘threshold’ method which uses the energy coefficient of the Mel - Frequency Cepstral Coefficients...

chapter

Multidirectional Local Feature for Speaker Recognition

Awais Mahmood, Mansour Alsulaiman, Ghulam Muhammad

2012 Third International Conference on Intelligent Systems Modelling and Simulation > 308 - 311

2012 3rd International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

This paper proposes a new feature extraction method called multi-directional local feature (MDLF) to apply on an automatic speaker recognition system. To obtain MDLF, a linear regression is applied on FFT signal in four different directions which are horizontal (time axis), vertical (frequency axis), diagonal 45 degree (time-frequency) and diagonal 135 degree (time-frequency). In the experiments,...

chapter

Speaker identification system based on a web interface

Juraj Kacur, Ivan Lapin, Juraj Durajka, Gregor Rozinaj

Proceedings ELMAR-2012 > 191 - 194

2012 54th International Symposium ELMAR

We present a new web-based application designed for human computer interface that currently supports speaker identification module. It is based on Java EE and Spring Framework and is designed to be invoked by users through their Internet browsers. Due to a flexible design various feature extraction methods, signal processing and classification algorithms can be easily implemented and used in different...

chapter

Feature extraction using fusion MFCC for continuous marathi speech recognition

Santosh Gaikwad, Bharti Gawali, Pravin Yannawar, Suresh Mehrotra

2011 Annual IEEE India Conference > 1 - 5

2011 Annual IEEE India Conference (INDICON)

This paper presents the performance of feature extraction techniques for speech recognition, for the classification of speech represented by a particular continuous sentence model. The goal of this study is to present independent as well as comparative performances of popular appearance based feature extraction techniques i.e. Linear Discriminative Analysed and Mel Frequency Cestrum Coefficient. Mel...

chapter

Spoken arabic digits recognition based on wavelet neural networks

Xiaohui Hu, Lvjun Zhan, Yun Xue, Weixing Zhou, more

2011 IEEE International Conference on Systems, Man, and Cybernetics > 1481 - 1485

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

The paper describes a novel method for discrete speech recognition based on spoken Arabic digit recognition by means of wavelet neural network in which Morlet wavelet is introduced to the hidden layer. The speech signal is extracted by means of Mel Frequency Cepstral Coefficients (MFCCs) and followed by vector quantization (VQ). The experimental results obtained on a spoken Arabic digit dataset proved...

chapter

Speaker identification by K-nearest neighbors: Application of PCA and LDA prior to KNN

Juraj Kacur, Radoslav Vargic, Pavol Mulinka

2011 18th International Conference on Systems, Signals and Image Processing > 1 - 4

2011 18th International Conference on Systems, Signals and Image Processing (IWSSIP)

This article presents the task of speaker identification in a closed group. It discusses main steps of the identification process ranging from the proper speech features to the classification methods and statistical signal processing. However, its main focus is on tuning the final system using KNN classification method by setting up the number of neighbors, and reducing the feature vector dimension...

chapter

Effect of MFCC normalization on vector quantization based speaker identification

M H Shirali-Shahreza, Sajad Shirali-Shahreza

The 10th IEEE International Symposium on Signal Processing and Information Technology > 250 - 253

2010 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2010)

Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these...

Keywords:
ACCURACY
SPEECH
MFCC

Publication date

Set your own date range

Publication type

book (34)
article (2)

Keywords

FEATURE EXTRACTION (25)
MEL FREQUENCY CEPSTRAL COEFFICIENT (25)
SPEECH RECOGNITION (19)
SPEAKER RECOGNITION (14)
TRAINING (12)
HIDDEN MARKOV MODELS (10)
CEPSTRAL ANALYSIS (9)
GMM (9)
SPEAKER IDENTIFICATION (7)
SPEECH PROCESSING (7)
DATABASES (5)
NOISE (5)
SUPPORT VECTOR MACHINES (5)
CLASSIFICATION ALGORITHMS (4)
GAUSSIAN PROCESSES (4)
LPC (4)
COMPUTATIONAL MODELING (3)
EMOTION RECOGNITION (3)
GAUSSIAN MIXTURE MODEL (3)
HIDDEN MARKOV MODEL (3)
KNN (3)
LDA (3)
SVM (3)
VECTOR QUANTIZATION (3)
VQ (3)
ADAPTATION MODEL (2)
AUDIO SIGNAL PROCESSING (2)
AUTOMATIC SPEECH RECOGNITION (2)
CLASSIFICATION (2)
DATA MINING (2)
FILTER BANKS (2)
GAUSSIAN MIXTURE MODELS (2)
GENETIC ALGORITHMS (2)
HMM (2)
HUMANS (2)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (2)
NOISE MEASUREMENT (2)
PRINCIPAL COMPONENT ANALYSIS (2)
ROBUSTNESS (2)
SPECTRAL ANALYSIS (2)
TESTING (2)
VECTOR QUANTISATION (2)
3-D ACCELERATION SIGNALS (1)
ACOUSTIC ANALYSIS (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
ADAPTATION (1)
ADAPTIVE SPEAKER IDENTIFICATION (1)
ADOLESCENTS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AR (1)
ARABIC DIGITS RECOGNITION (1)
ARABIC SPEAKER RECOGNITION (1)
ARTIFICIAL NEURAL NETWORK (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASR SYSTEM (1)
ATTENUATION (1)
AUDIO ANNOTATION (1)
AUDIO DATABASES (1)
AUDIO TRACK (1)
AUDIO-BASED SPEAKER CHARACTERISTIC CLASSIFICATION (1)
AUDIO/VIDEO SEARCH CUE (1)
AUDIO/VIDEO SEARCH RETRIEVAL (1)
AUDITORY NERVE MODEL (1)
AURORA DATABASE (1)
AURORA-2 DATABASE (1)
AUTOMATIC EMOTION CLASSIFICATION (1)
BAND PASS FILTERS (1)
BAND-PASS FILTER (1)
BAND-PASS FILTERS (1)
BANGLA SPEECH SEGMENTATION (1)
BANGLA SPEECH SIGNAL (1)
BESSEL EXPANSION (1)
BILINEAR TRANSFORMATION (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIOLOGICAL SYSTEM MODELING (1)
BOOKS (1)
CASCADED CLASSIFIER (1)
CEPSTRUM (1)
CHARACTER RECOGNITION (1)
CHINESE ACCENTED SPEECH (1)
CHINESE ACCENTED SPEECH IDENTIFICATION (1)
CLINICAL DEPRESSION (1)
CLUSTERING ALGORITHMS (1)
COGNITION (1)
COMPACT CASCADED CLASSIFIER (1)
COMPANIES (1)
COMPUTERS (1)
CONFERENCES (1)
CORRELATION (1)
COVARIANCE MATRIX (1)
CUBIC POLYNOMIAL METHOD (1)
DECISION MAKING (1)
DELAY (1)
DELTA COEFFICIENTS (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Feature extraction analysis on Indonesian speech recognition system

Text-constrained speaker verification using fuzzy C means vector quantization

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Neural response based phoneme classification under noisy condition

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

A new direct access framework for speaker identification system

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Gender specific emotion recognition through speech signals

Speaker identification in a multimodal interface

Automatic recognition of major language families in India

Frequency Shift Detection of Speech with GMMs AND SVMs

Fourier-Bessel cepstral coefficients for robust speech recognition

Noise classification using Gaussian Mixture Models

Multidirectional Local Feature for Speaker Recognition

Speaker identification system based on a web interface

Feature extraction using fusion MFCC for continuous marathi speech recognition

Spoken arabic digits recognition based on wavelet neural networks

Speaker identification by K-nearest neighbors: Application of PCA and LDA prior to KNN

Effect of MFCC normalization on vector quantization based speaker identification

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options