Advanced search

From:

To:

Items from 1 to 20 out of 39 results

chapter

Part-of-speech labeling for Reuters database

R. Cretulescu, A. David, D. Morariu, L. Vintan

2015 19th International Conference on System Theory, Control and Computing (ICSTCC) > 117 - 122

2015 19th International Conference on System Theory, Control and Computing (ICSTCC)

Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms...

chapter

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

Tanzeela, Arbab Waseem Abbas, Zakir Ali, Burhan Uddin

2014 12th International Conference on Frontiers of Information Technology > 350 - 354

2014 12th International Conference on Frontiers of Information Technology (FIT)

This paper revealed the analysis of speaker independent isolated Pashto spoken numbers for determination of automatic speech recognition. Initially the database was developed, the database encompasses isolated Pashto numbers from sefer (0) to sul (100). Fifty speakers (25 male, 25 females with different ages) that can frequently speak yousafzai dialect were selected for recording. The recording has...

chapter

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, more

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) > 1 - 5

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature...

chapter

Investigating the impact of phonetic cross language modeling on Arabic and English speech recognition

Yousef A. Alotaibi, Ali H. Meftah, Sid-Ahmed Selouani

2014 9th International Symposium on Communication Systems, Networks & Digital Sign (CSNDSP) > 585 - 590

2014 9th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP)

The lack of speech resources in the Arabic language is one of the most important obstacles facing speech researchers. Previously, we designed two Arabic and English automatic speech recognition systems (ASR) using two corpora: TIMIT for English language and West Point for Arabic language. Cross-language experiments were conducted using the two systems, and the results were determined with respect...

chapter

The case for sampling on very large file systems

George Goldberg, Danny Harnik, Dmitry Sotnikov

2014 30th Symposium on Mass Storage Systems and Technologies (MSST) > 1 - 11

2014 30th Symposium on Mass Storage Systems and Technologies (MSST)

Sampling has long been a prominent tool in statistics and analytics, first and foremost when very large amounts of data are involved. In the realm of very large file systems (and hierarchical data stores in general), however, sampling has mostly been ignored and for several good reasons. Mainly, running sampling in such an environment introduces technical challenges that make the entire sampling process...

chapter

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

Jasdeep Singh Bhalla, Anmol Aggarwal

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE) > 1 - 6

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE)

Emotion Recognition from speech has evolved itself as the most significant research area in the field of affective computing. In this paper, two emotional speech datasets, have been analyzed, based on gender distinction (male and female speech). This paper introduces a new approach of speech-emotion recognition based on the use of AdaBoost classification Algorithm. Artificial neural network has been...

chapter

Isolated digit recognition for Malayalam- An application perspective

Renjith S., Aju Joseph, Anish Babu K.K.

2013 International Conference on Control Communication and Computing (ICCC) > 190 - 193

2013 International Conference on Control Communication and Computing (ICCC)

Speech recognition is one of the promising technologies of the future. Voice user interfaces play an important role in many real world applications. This paper presents speaker independent isolated digit recognition for Malayalam language and reveals some application areas of digit recognition. Mel-Frequency Cepstral Coefficient(MFCC) is used as feature and Hidden Markov Model(HMM) is used as the...

chapter

GMM and i-vector based speaker verification using speaker-specific-text for short utterances

B. Bharathi, T. Nagarajan

2013 IEEE International Conference of IEEE Region 10 (TENCON 2013) > 1 - 4

TENCON 2013 - 2013 IEEE Region 10 Conference

In speaker recognition tasks, one of the reasons for reduced accuracy is due to closely resembling speakers in the acoustic space. In order to increase the discriminative power of the classifier, the system must be able to use only the unique features of a given speaker with respect to his/her acoustically resembling speaker. This paper proposes a technique to reduce the confusion errors, by finding...

chapter

Reduction of confusion pairs on different rates of speech in Telugu language

N. Usha Rani, P. N. Girija

2013 15th International Conference on Advanced Computing Technologies (ICACT) > 1 - 4

2013 15th International Conference on Advanced Computing Technologies (ICACT)

Research in speech recognition area has made considerable progress in achieving the task with tremendous growth of technology. Speech rate is one of the important factors which affect the speech recognition accuracy. In the present work, training is performed on different speech rates (Normal, Slow and Fast) and testing also done on different rates of speech. Error rate will increase when the major...

chapter

Survey of Automated Speaker Identification Methods

Maxim Sidorov, Alexander Schmitt, Sergey Zablotskiy, Wolfgang Minker

2013 9th International Conference on Intelligent Environments > 236 - 239

2013 9th International Conference on Intelligent Environments (IE)

In this paper we present an overview of state-of-the-art approaches for speaker identification. Due to the increased number of dialogue system applications the interest in that field has grown significantly in recent years. Nevertheless, there are many open issues in the field of automatic speaker identification. Among them the choice of the appropriate speech signal features and machine learning...

chapter

Effects of carriers on Mandarin tone categorical perception

Dazuo Wang, Xiuxiu Wang, Gang Peng

2012 8th International Symposium on Chinese Spoken Language Processing > 417 - 421

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This study investigated the effects of three different carriers on Mandarin tone perception. Three tone continua were constructed: Modified speech, synthesized speech, and nonspeech. Identification tests were conducted for the two speech continua, while discrimination tests were conducted for all the three continua. Results showed that category boundary position differed significantly between the...

chapter

Comparison of different multiclass SVM methods for speaker independent phoneme recognition

M. Cutajar, E. Gatt, I. Grech, O. Casha, more

2012 5th International Symposium on Communications, Control and Signal Processing > 1 - 5

2012 5th International Symposium on Communications, Control and Signal Processing (ISCCSP)

Four multiclass Support Vector Machines (SVMs) methods were designed for the task of speaker independent phoneme recognition. These are the All-at-once, One-against-all, One-against-one, and the Directed Acyclic Graph SVM (DAGSVM). The Discrete Wavelet Transform (DWT) 8 frequency band power percentages are used for feature extraction. All tests were carried out on the TIMIT database. Comparable recognition...

chapter

A personalized emotion recognition system using an unsupervised feature adaptation scheme

Tauhidur Rahman, Carlos Busso

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5117 - 5120

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

A personalized emotion recognition system aims to tune the model to recognize the expressive behaviors of a targeted person. Such a system can play an important role in various domains including call center and health care applications. Adapting any general emotion recognition system for a particular individual requires speech samples and prior knowledge about their emotional content. These assumptions...

chapter

Early prediction of major depression in adolescents using glottal wave characteristics and Teager Energy parameters

Kuan Ee Brian Ooi, Lu-Shih Alex Low, Margaret Lech, Nicholas Allen

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4613 - 4616

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Previous studies of an automated detection of Major Depression in adolescents based on acoustic speech analysis identified the glottal and the Teager Energy features as the strongest correlates of depression. This study investigates the effectiveness of these features in an early prediction of Major Depression in adolescents using a fully automated speech analysis and classification system. The prediction...

article

Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification

Sungrack Yun, Chang D. Yoo

IEEE Transactions on Audio, Speech, and Language Processing > 2012 > 20 > 2 > 585 - 598

This paper considers a learning framework for speech emotion classification using a discriminant function based on Gaussian mixture models (GMMs). The GMM parameter set is estimated by margin scaling with a loss function to reduce the risk of predicting emotions with high loss. Here, the loss function is defined as a function of a distance metric using the Watson and Tellegen's emotion model. Margin...

chapter

The Effect of Speaker and Noise Type on the Accuracy of Estimated Speech Intelligibility Using Objective Measures

Kazuhiro Kondo

2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 294 - 297

2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Previously, we compared several objective measures to estimate the subjective speech intelligibility scores of the Japanese Diagnostic Rhyme Test (DRT). PESQ-derived MOS, segmental SNR (SNRseg), frequency-weighed segmental SNR (fwSNRseg), and composite measures were tested. We mapped these measures to its corresponding intelligibility scores using quadratic equations trained on one speaker and one...

chapter

Study of automatic biosounds detection and classification using SVM and GMM

Bor Jenq Chua, Xue Jun Li, Huy Dat Tran

2011 IEEE/NIH Life Science Systems and Applications Workshop (LiSSA) > 155 - 158

2011 IEEE/NIH 5th Life Science Systems and Applications Workshop (LiSSA)

Ambulatory devices can be used to detect heart diseases and save lives in critical time. These devices are based on sound classification that usually adopts a suitable data mining algorithm. This paper investigates the performance of Support Vector Machine (SVM) and Gaussian Mixture Model (GMM) classifiers in classifying sound samples. SVM classifier makes use of a linearly separable hyperplane to...

chapter

Speaker identification using utterances correspond to speaker-specific-text

B Bharathi, P Vijayalakshmi, T Nagarajan

IEEE Technology Students' Symposium > 171 - 174

2011 IEEE Students' Technology Symposium (TechSym)

In speaker recognition tasks, the main reason for reduced accuracy is due to closely resembling speakers in the acoustic space. Conventional GMM-based modelling technique captures unique features along with common features among various classes. Further, it ignores knowledge of phonetic content of the speech. In order to increase the discriminative power of the classifier, the system must be able...

chapter

Generalized cyclic transformations in speaker-independent speech recognition

F. Muller, E. Belilovsky, A. Mertins

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 211 - 215

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

A feature extraction method is presented that is robust against vocal tract length changes. It uses the generalized cyclic transformations primarily used within the field of pattern recognition. In matching training and testing conditions the resulting accuracies are comparable to the ones of MFCCs. However, in mismatching training and testing conditions with respect to the mean vocal tract length...

chapter

A Speaker Identification System Using MFCC Features with VQ Technique

A. Zulfiqar, A. Muhammad, A.M.M. Enriquez

2009 Third International Symposium on Intelligent Information Technology Application > 3 > 115 - 118

2009 Third International Symposium on Intelligent Information Technology Application

The performance of speaker identification systems has improved due to recent advances in speech processing techniques but there is still need of improvement in term of text-independent speaker identification and suitable modelling techniques for voice feature vectors. It becomes difficult for person to recognize a voice when an uncontrollable noise adds in to it. In this paper, feature vectors from...

Keywords:
ACCURACY
SPEECH
TESTING

Publication date

Set your own date range

Publication type

book (36)
article (3)

Keywords

TRAINING (21)
SPEECH RECOGNITION (15)
FEATURE EXTRACTION (13)
HIDDEN MARKOV MODELS (13)
ACOUSTICS (11)
DATABASES (10)
COMPUTERS (9)
SPEECH PROCESSING (9)
MEL FREQUENCY CEPSTRAL COEFFICIENT (8)
SIGNAL PROCESSING (8)
ARTIFICIAL NEURAL NETWORKS (7)
COMPLEXITY THEORY (7)
DATA MINING (7)
ESTIMATION (7)
SPEAKER RECOGNITION (7)
ALGORITHM DESIGN AND ANALYSIS (6)
CONFERENCES (6)
EQUATIONS (6)
TRAINING DATA (6)
TRANSFORMS (6)
ANALYTICAL MODELS (5)
CLASSIFICATION ALGORITHMS (5)
MATHEMATICAL MODEL (5)
NOISE (5)
PATTERN RECOGNITION (5)
REAL TIME SYSTEMS (5)
SIGNAL PROCESSING ALGORITHMS (5)
SPEAKER IDENTIFICATION (5)
COMPUTATIONAL MODELING (4)
CORRELATION (4)
DATA MODELS (4)
EDUCATIONAL INSTITUTIONS (4)
EIGENVALUES AND EIGENFUNCTIONS (4)
NATURAL LANGUAGE PROCESSING (4)
PERIODIC STRUCTURES (4)
PRESSES (4)
ROBUSTNESS (4)
SUPPORT VECTOR MACHINE CLASSIFICATION (4)
SUPPORT VECTOR MACHINES (4)
CEPSTRAL ANALYSIS (3)
DETECTION ALGORITHMS (3)
DETECTORS (3)
ELECTRONIC MAIL (3)
IMAGE RECOGNITION (3)
INSTRUMENTS (3)
LABORATORIES (3)
MACHINE LEARNING (3)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (3)
OBJECT RECOGNITION (3)
OPTIMIZATION (3)
PRINCIPAL COMPONENT ANALYSIS (3)
REVIEWS (3)
SIGNAL CLASSIFICATION (3)
SIGNAL TO NOISE RATIO (3)
USA COUNCILS (3)
VISUALIZATION (3)
WAVELET TRANSFORMS (3)
ADAPTATION MODEL (2)
BAYESIAN METHODS (2)
BIOLOGICAL SYSTEM MODELING (2)
BRIGHTNESS (2)
CARDIOLOGY (2)
CIRCUITS AND SYSTEMS (2)
COMPUTED TOMOGRAPHY (2)
COMPUTER ARCHITECTURE (2)
COMPUTER SCIENCE (2)
COMPUTER VISION (2)
CONTEXT MODELING (2)
CONVOLUTION (2)
COVARIANCE MATRIX (2)
DATA SELECTION (2)
DEGRADATION (2)
EMOTION RECOGNITION (2)
ENCODING (2)
ENTROPY (2)
FACE RECOGNITION (2)
FILTERING THEORY (2)
FOURIER TRANSFORMS (2)
FREQUENCY MODULATION (2)
GABOR FILTERS (2)
GALLIUM NITRIDE (2)
GAUSSIAN MIXTURE MODEL (2)
GAUSSIAN MIXTURE MODELS (2)
GAUSSIAN PROCESSES (2)
HARMONIC ANALYSIS (2)
IMAGE COLOR ANALYSIS (2)
IMAGE PROCESSING (2)
IMAGING (2)
IMPEDANCE MATCHING (2)
KERNEL (2)
LANGUAGE IDENTIFICATION (2)
LDA (2)
MACHINE LEARNING ALGORITHMS (2)
MACHINE VISION (2)
MATERIALS (2)
MAXIMUM LIKELIHOOD DETECTION (2)
MFCC (2)
more

INFONA - science communication portal

Advanced search

Advanced search

Part-of-speech labeling for Reuters database

Analyzing the Impact of MFCC and LDA for the Development of Isolated Pashto Spoken Numbers ASR

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Investigating the impact of phonetic cross language modeling on Arabic and English speech recognition

The case for sampling on very large file systems

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

Isolated digit recognition for Malayalam- An application perspective

GMM and i-vector based speaker verification using speaker-specific-text for short utterances

Reduction of confusion pairs on different rates of speech in Telugu language

Survey of Automated Speaker Identification Methods

Effects of carriers on Mandarin tone categorical perception

Comparison of different multiclass SVM methods for speaker independent phoneme recognition

A personalized emotion recognition system using an unsupervised feature adaptation scheme

Early prediction of major depression in adolescents using glottal wave characteristics and Teager Energy parameters

Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification

The Effect of Speaker and Noise Type on the Accuracy of Estimated Speech Intelligibility Using Objective Measures

Study of automatic biosounds detection and classification using SVM and GMM

Speaker identification using utterances correspond to speaker-specific-text

Generalized cyclic transformations in speaker-independent speech recognition

A Speaker Identification System Using MFCC Features with VQ Technique

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options