Advanced search

From:

To:

Items from 1 to 20 out of 25 results

chapter

Neural response based phoneme classification under noisy condition

Md.Shariful Alam, Wissam A. Jassim, Muhammad S.A. Zilany

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) > 175 - 179

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

Human listeners are capable of recognizing speech in noisy environment, while most of the traditional speech recognition methods do not perform well in the presence of noise. Unlike traditional Mel-frequency cepstral coefficient (MFCC)-based method, this study proposes a phoneme classification technique using the neural responses of a physiologically-based computational model of the auditory periphery...

chapter

Voice pathology detection using auto-correlation of different filters bank

Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 50 - 55

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

This paper investigates the contribution of frequency bands for automatic voice pathology detection. First, the input voice signal is passed through a number of time-domain band-pass filters. The center frequencies are spaced on an octave scale. Each filter output is then divided into overlapping frames. Auto-correlation function is applied to each block to find the first largest peak, in areas other...

chapter

Significance of CV transition and steady vowel regions for language identification

Dipanjan Nandi, Arup Kumar Dutta, K. Sreenivasa Rao

2014 Seventh International Conference on Contemporary Computing (IC3) > 513 - 517

2014 Seventh International Conference on Contemporary Computing (IC3)

The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze...

chapter

Experiments on automatic language identification for philippine languages using acoustic Gaussian Mixture Models

Ann Franchesca Laguna, Rowena Cristina Guevara

2014 IEEE REGION 10 SYMPOSIUM > 657 - 662

2014 IEEE Region 10 Symposium

A Philippine LID system has not been previously created because of the limited amount of recorded speech data. This research initiates the LID research using the Philippine Language Database (PLD) collected by the Digital Signal Processing Laboratory of the University of the Philippines Diliman (DSP-UPD). Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP), Shifted Delta...

chapter

Speaker identification in a multimodal interface

Juraj Kacur, Mario Varga, Gregor Rozinaj

Proceedings ELMAR-2013 > 191 - 194

2013 55th International Symposium ELMAR

The article presents the development of a speaker identification system as one part of the multimodal interface for the HBB-NEXT project. A short introduction to a speaker identification problem in the context of HBB-NEXT project is given. Then we focus on the design, optimization and method selection process in order to realize a real time, text independent speaker identification application, namely:...

chapter

Automatic recognition of major language families in India

Debapriya Sengupta, Goutam Saha

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI) > 1 - 4

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)

India is a vast country with a large number of languages. Among these some languages descend from a single mother language giving rise to a language family. The major official languages in India fall under two language families namely Indo-European and Dravidian. In this paper, we have discussed about a system which takes speech file as input and identifies the language family to which it belongs...

chapter

Frequency Shift Detection of Speech with GMMs AND SVMs

Hua Xing, Philipos C. Loizou

2012 IEEE Workshop on Signal Processing Systems > 215 - 219

2012 IEEE Workshop on Signal Processing Systems (SiPS)

In certain situations, speech might be shifted in the frequency domain amid the presence of noise. To be able to compensate for the spectral shift, it is important to know the amount of frequency shift present. A method based on Mel-frequency-cepstral-coefficient (MFCC) and Gaussian Mixture model (GMM) super vector is proposed for detecting frequency shifts in speech. MFCC or LFCC is extracted to...

chapter

Noise classification using Gaussian Mixture Models

Hitesh Anand Gupta, Vinay M Varma

2012 1st International Conference on Recent Advances in Information Technology (RAIT) > 821 - 825

2012 1st International Conference on Recent Advances in Information Technology (RAIT)

Gaussian Mixture Models (GMMs) have been proven effective in modeling speech and other acoustic signals. In this study, we have used GMMs to model different noise sources, viz. subway, babble, car and exhibition. Expectation maximization algorithm has been implemented to fit the model. Further, we present the ‘threshold’ method which uses the energy coefficient of the Mel - Frequency Cepstral Coefficients...

chapter

Multidirectional Local Feature for Speaker Recognition

Awais Mahmood, Mansour Alsulaiman, Ghulam Muhammad

2012 Third International Conference on Intelligent Systems Modelling and Simulation > 308 - 311

2012 3rd International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

This paper proposes a new feature extraction method called multi-directional local feature (MDLF) to apply on an automatic speaker recognition system. To obtain MDLF, a linear regression is applied on FFT signal in four different directions which are horizontal (time axis), vertical (frequency axis), diagonal 45 degree (time-frequency) and diagonal 135 degree (time-frequency). In the experiments,...

chapter

Study of automatic biosounds detection and classification using SVM and GMM

Bor Jenq Chua, Xue Jun Li, Huy Dat Tran

2011 IEEE/NIH Life Science Systems and Applications Workshop (LiSSA) > 155 - 158

2011 IEEE/NIH 5th Life Science Systems and Applications Workshop (LiSSA)

Ambulatory devices can be used to detect heart diseases and save lives in critical time. These devices are based on sound classification that usually adopts a suitable data mining algorithm. This paper investigates the performance of Support Vector Machine (SVM) and Gaussian Mixture Model (GMM) classifiers in classifying sound samples. SVM classifier makes use of a linearly separable hyperplane to...

chapter

Comparison of features extracted using time-frequency and frequency-time analysis approach for text-independent speaker identification

N Sen, T Basu, S Chakroborty

2011 National Conference on Communications (NCC) > 1 - 5

2011 National Conference on Communications (NCC)

This paper compares the feature sets extracted using time-frequency analysis approach and frequency-time analysis approach for text-independent speaker identification. Mel-frequency cepstral coefficient (MFCC) feature set and Inverted Mel-frequency cepstral coefficient (IMFCC) feature set are extracted using time-frequency analysis approach. Temporal energy subband cepstral coefficient (TESBCC) feature...

chapter

Towards better GMM-based acoustic modeling for spoken language identification

Fahime Ghasemian, Mohammad Mahdi Homayounpour

2011 19th Iranian Conference on Electrical Engineering > 1 - 4

2011 19th Iranian Conference on Electrical Engineering (ICEE)

Gaussian Mixture Model (GMM) is a widely used, simple and effective modeling approach for spoken language identification. Traditionally EM algorithm is used to train this model. In this paper we propose a new method named WA-GMM (Weight Adapted GMM) for estimating the weights of GMM Gaussian components using bag-of-unigram and Support Vector Machine (SVM): SVM weights which are trained on bag-of-unigram...

chapter

Performance improvement in automatic gender identification using hierarchical clustering

M A Keyvanrad, M M Homayounpour

2010 5th International Symposium on Telecommunications > 900 - 903

2010 5th International Symposium on Telecommunications (IST)

In this paper a hierarchical structure is proposed for automatic gender identification (AGI). In this structure two clustering techniques are used. The first technique is divisive clustering for dividing speakers from each gender to some classes of speakers. The second clustering technique is agglomerative clustering for creating a hierarchical structure. Feature reduction is done by SOAP feature...

chapter

New features extracted from Nyquist filter bank for text-independent speaker identification

N Sen, T K Basu, H A Patil

2010 Annual IEEE India Conference (INDICON) > 1 - 5

2010 Annual IEEE India Conference (INDICON 2010)

This paper introduces the use of a new method of feature extraction based on frequency-time analysis approach for text-independent speaker identification. The impetus for this new feature extraction technique comes from the filter bank summation method of STFT using Nyquist filter bank. The focus of this work is on applications which yield higher identification accuracy without increasing the computational...

chapter

Excited commentator speech detection with unsupervised model adaptation for soccer highlight extraction

Yi Sun, Zhijian Ou, Wei Hu, Yimin Zhang

2010 International Conference on Audio, Language and Image Processing > 747 - 751

2010 International Conference on Audio, Language and Image Processing (ICALIP)

Soccer highlight detection is an active research topic in recent years. In this paper, we present our effort to detect an important audio keyword - excited commentator speech, which contributes to a state-of-the-art soccer highlight extraction system. We propose an approach of using statistical classifier based on Gaussian mixture models (GMMs) with unsupervised model adaptation. The excited speech...

chapter

Tone pronunciation quality scoring of Mandarin multi-syllable words

Junbo Zhang, Hemin Wu, Yonghong Yan

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 545 - 548

2010 10th International Conference on Signal Processing (ICSP 2010)

This paper discusses tone pronunciation scoring for Mandarin multi-syllable words in Computer Assisted Language Learning (CALL) System. A commonly used tone evaluation method is using GMM to model various pitch sequence. Because the pattern of pitch sequence will change a lot in the multisyllable context, tone models trained on mono-tone database will not have good performance on multi-syllable speech...

chapter

Visualized Feature Fusion and Style Evaluation for Musical Genre Analysis

Qingjun Yao, Haifeng Li, Jiayin Sun, Lin Ma

2010 First International Conference on Pervasive Computing, Signal Processing and Applications > 883 - 886

2010 First International Conference on Pervasive Computing, Signal Processing and Applications (PCSPA 2010)

Different kinds of features in time domain, spectral domain and cepstral domain are used for musical genre classification. In this paper, through the fusion of short-term timbral features and long-term rhythmic feature, we propose a novel method where: musical genre vector is constructed using the likelihood ratio of GMM (Gaussian Mixture Model) and radar chart is applied to provide visualized style...

chapter

Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals

Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, more

2010 20th International Conference on Pattern Recognition > 4565 - 4568

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, we consider speaker identification for the co-channel scenario in which speech mixture from speakers is recorded by one microphone only. The goal is to identify both of the speakers from their mixed signal. High recognition accuracies have already been reported when an accurately estimated signal-to-signal ratio (SSR) is available. In this paper, we approach the problem without estimating...

chapter

Significant improvement in the closed set text-independent speaker identification using features extracted from Nyquist filter bank

N Sen, T K Basu, H A Patil

2010 5th International Conference on Industrial and Information Systems > 303 - 308

2010 5th International Conference on Industrial and Information Systems (ICIIS 2010)

This paper introduces the use of a new method of feature extraction for robust text-independent speaker identification. The focus of this work is on applications which yield higher identification accuracy without increasing the computational effort. The impetus for this new feature extraction technique comes from a new transformation which is based on the Nyquist filter bank. We have proposed this...

chapter

Research on Adaptive Speaker Identification Based on GMM

Yuhuan Zhou, Jinming Wang, Xiongwei Zhang

2009 International Forum on Computer Science-Technology and Applications > 2 > 330 - 332

2009 International Forum on Computer Science-Technology and Applications (IFCSTA 2009)

In this paper, an adaptive speaker identification method combined with the human behavioral trait based on Gaussian mixture model (GMM) is constructed. The method can automatically select different length of speech for different speakers in identification process according to the feedback probability estimation, so it can guarantee identification accuracy without reducing, and to reduce the identification...

Keywords:
ACCURACY
SPEECH
GMM

Publication date

Set your own date range

Publication type

book (24)
article (1)

Keywords

FEATURE EXTRACTION (17)
MEL FREQUENCY CEPSTRAL COEFFICIENT (12)
SPEAKER RECOGNITION (11)
GAUSSIAN PROCESSES (10)
TRAINING (10)
MFCC (9)
SPEECH RECOGNITION (9)
GAUSSIAN MIXTURE MODEL (7)
SPEAKER IDENTIFICATION (7)
SPEECH PROCESSING (7)
SUPPORT VECTOR MACHINES (6)
SVM (5)
ADAPTATION MODEL (4)
CEPSTRAL ANALYSIS (4)
COMPUTATIONAL MODELING (4)
CLASSIFICATION ALGORITHMS (3)
DATABASES (3)
FILTER BANK (3)
GAUSSIAN MIXTURE MODELS (3)
GAUSSIAN MIXTURE SPEAKER MODEL (3)
HIDDEN MARKOV MODELS (3)
NYQUIST FILTER (3)
SIGNAL CLASSIFICATION (3)
ACOUSTICS (2)
ALGORITHM DESIGN AND ANALYSIS (2)
AUDIO SIGNAL PROCESSING (2)
CHANNEL BANK FILTERS (2)
CLUSTERING ALGORITHMS (2)
CONFERENCES (2)
EQUATIONS (2)
FILTER BANKS (2)
GAIN (2)
LANGUAGE IDENTIFICATION (2)
MAP ADAPTATION (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (2)
MUSIC (2)
NOISE (2)
NOISE MEASUREMENT (2)
PATTERN CLASSIFICATION (2)
PATTERN CLUSTERING (2)
POLYCOST DATABASE (2)
PROBABILITY DENSITY FUNCTION (2)
ROBUSTNESS (2)
STATISTICAL ANALYSIS (2)
TEXT-INDEPENDENT SPEAKER IDENTIFICATION (2)
TIME FREQUENCY ANALYSIS (2)
VECTOR QUANTIZATION (2)
ACOUSTIC (1)
ACOUSTIC ANALYSIS (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTATION (1)
ADAPTATION MODELS (1)
ADAPTIVE SPEAKER IDENTIFICATION (1)
ADOLESCENTS (1)
AGGLOMERATIVE CLUSTERING (1)
AMBULATORY DEVICE (1)
ARABIC SPEAKER RECOGNITION (1)
ARTIFICIAL NEURAL NETWORKS (1)
ATTENUATION (1)
AUDIO CLASSIFICATION (1)
AUDIO KEYWORD-EXCITED COMMENTATOR SPEECH (1)
AUDITORY NERVE MODEL (1)
AUTO-CORRELATION (1)
AUTOMATIC BIOSOUND DETECTION (1)
AUTOMATIC GENDER IDENTIFICATION (1)
BAG-OF-UNIGRAM (1)
BAND-PASS FILTERS (1)
BEAT HISTOGRAM (1)
BIOSOUND CLASSIFICATION (1)
BOOKS (1)
CALL (1)
CARDIOLOGY (1)
CHARACTER RECOGNITION (1)
CLASSIFICATION (1)
CLASSIFICATION RUN TIME (1)
CLINICAL DEPRESSION (1)
CLOSED SET TEXT-INDEPENDENT SPEAKER IDENTIFICATION (1)
CLUSTERING (1)
CLUSTERING METHODS (1)
CO-CHANNEL SPEECH (1)
COCHANNEL SPEECH SIGNALS (1)
COGNITION (1)
COMPANIES (1)
COMPUTER ASSISTED LANGUAGE LEARNING SYSTEM (1)
COMPUTERS (1)
CORRELATION (1)
CV TRANSITION REGION (1)
DATA MINING (1)
DATA MINING ALGORITHM (1)
DATA MODELS (1)
DECISION MAKING (1)
DECISION TREE RULE (1)
DECISION TREES (1)
DIGITAL FILTERS (1)
DIGITAL SIGNAL PROCESSING (1)
DISCRETE FOURIER TRANSFORMS (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Neural response based phoneme classification under noisy condition

Voice pathology detection using auto-correlation of different filters bank

Significance of CV transition and steady vowel regions for language identification

Experiments on automatic language identification for philippine languages using acoustic Gaussian Mixture Models

Speaker identification in a multimodal interface

Automatic recognition of major language families in India

Frequency Shift Detection of Speech with GMMs AND SVMs

Noise classification using Gaussian Mixture Models

Multidirectional Local Feature for Speaker Recognition

Study of automatic biosounds detection and classification using SVM and GMM

Comparison of features extracted using time-frequency and frequency-time analysis approach for text-independent speaker identification

Towards better GMM-based acoustic modeling for spoken language identification

Performance improvement in automatic gender identification using hierarchical clustering

New features extracted from Nyquist filter bank for text-independent speaker identification

Excited commentator speech detection with unsupervised model adaptation for soccer highlight extraction

Tone pronunciation quality scoring of Mandarin multi-syllable words

Visualized Feature Fusion and Style Evaluation for Musical Genre Analysis

Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals

Significant improvement in the closed set text-independent speaker identification using features extracted from Nyquist filter bank

Research on Adaptive Speaker Identification Based on GMM

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options