Advanced search

From:

To:

Items from 1 to 20 out of 39 results

chapter

Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification

Noraziahtulhidayu Kamarudin, S.A.R Al-Haddad, Shaiful Jahari Hashim, Mohammad Ali Nematollahi, more

2014 IEEE Student Conference on Research and Development > 1 - 6

2014 IEEE Student Conference on Research and Development (SCOReD)

This paper presents the process of Quranic Accent Automatic Identification. Recent feature extraction technique that is used for Quranic verse rule identification/Tajweed include Mel Frequency Cepstral Coefficients (MFCC) which prone to additive noise and may reduce the classification result. Therefore, to improve the performance of MFCC with addition of Spectral Centroid features and is proposed...

chapter

A Novel pattern recognition model for real-time voice data input

Yogesh Kumar Sen, R. K. Chaurasiya, Shrish Verma

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 715 - 718

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

The classical front end analysis in speech recognition is a spectral analysis which parameterizes the speech signal into feature vectors. This paper proposes a voice recognition model that is able to automatically classify and recognize a voice signal with background noise. The model uses the concept of spectrogram, pitch period, short time energy, zero crossing rate, mel frequency scale and cepestral...

chapter

Speaker Recognition System: Vulnerable and Challenges

Naufal Alee, Phaklen Ehkan, R. Badlishah Ahmad, Shahrul Nizam Yaakob, more

2013 International Conference on Information Science and Applications (ICISA) > 1 - 4

2013 International Conference on Information Science and Applications (ICISA)

Recently speaker recognition system became high interesting by researchers for both software and hardware solutions. Different technologies have been adopted to implement speaker recognition system that has performance with optimal time response with acceptable accuracy. Research progresses are going on to provide highly durable and precise recognition system that can be embedded into critical implementation...

chapter

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

Ruiling Luo, Wenqing Cai, Min Chen, Zhongling Han

2011 3rd International Workshop on Intelligent Systems and Applications > 1 - 4

2011 3rd International Workshop on Intelligent Systems and Applications (ISA)

SVM is a novel type of statistical learning method that has been successfully used in speaker recognition. However, training SVM consumes long computing time and large storage space with all training examples. This paper proposes an improved sparse least-squares support vector machine (LS-SVM) for speaker identification. Firstly KPCA is exploited to reduce the dimension of input vectors and to denoise...

article

Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions

J Dennis, H D Tran, Haizhou Li

IEEE Signal Processing Letters > 2011 > 18 > 2 > 130 - 133

In this letter, we present a novel feature extraction method for sound event classification, based on the visual signature extracted from the sound's time-frequency representation. The motivation stems from the fact that spectrograms form recognisable images, that can be identified by a human reader, with perception enhanced by pseudo-coloration of the image. The signal processing in our method is...

chapter

Performance improvement in automatic gender identification using hierarchical clustering

M A Keyvanrad, M M Homayounpour

2010 5th International Symposium on Telecommunications > 900 - 903

2010 5th International Symposium on Telecommunications (IST)

In this paper a hierarchical structure is proposed for automatic gender identification (AGI). In this structure two clustering techniques are used. The first technique is divisive clustering for dividing speakers from each gender to some classes of speakers. The second clustering technique is agglomerative clustering for creating a hierarchical structure. Feature reduction is done by SOAP feature...

chapter

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

N Farra, E Challita, R A Assi, H Hajj

2010 IEEE International Conference on Data Mining Workshops > 1114 - 1119

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In this work, we investigate sentiment mining of Arabic text at both the sentence level and the document level. Existing research in Arabic sentiment mining remains very limited. For sentence-level classification, we investigate two approaches. The first is a novel grammatical approach that employs the use of a general structure for the Arabic sentence. The second approach is based on the semantic...

chapter

Automatic lexical stress detection for Chinese learners' of English

Jin-Yu Chen, Lan Wang

2010 7th International Symposium on Chinese Spoken Language Processing > 407 - 411

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

This paper investigates lexical stress detection for Chinese learners of English, where a combined differential acoustic feature is developed to represent the lexical stress of polysyllabic words in continuous speech. The use of frame-averaged feature and the contextual information intra-word can be input to the classifiers without normalization. The word-based stress detection method proposed in...

chapter

Emotions analysis of speech for call classification

Esraa Ali Hassan, Neamat El Gayar, M Ghanem Moustafa

2010 10th International Conference on Intelligent Systems Design and Applications > 242 - 247

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Most existing research in the area of emotions recognition has focused on short segments or utterances of speech. In this paper we propose a machine learning system for classifying the overall sentiment of long conversations as being Positive or Negative. Our system has three main phases, first it divides a call into short segments, second it applies machine learning to recognize the emotion for each...

chapter

Factor Analysis and Majority Voting Based Speech Emotion Recogntion

Lu Xu, Mingxing Xu, Dali Yang

2010 International Conference on Intelligent System Design and Engineering Application > 1 > 716 - 720

2010 International Conference on Intelligent System Design and Engineering Application (ISDEA 2010)

There are some problems to be resolved for speech emotion recognition, such as the dimension of feature sets is usually too high and the redundancy among various features is relatively stronger. Considering these problems, the factor analysis and majority voting based speech emotion recognition was proposed. How to extract emotional factors from global statistical features and GMM super vectors was...

chapter

Framewise Phone Classification Using Weighted Fuzzy Classification Rules

Omid Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

2010 20th International Conference on Pattern Recognition > 4186 - 4189

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Our aim in this paper is to propose a rule-weight learning algorithm in fuzzy rule-based classifiers. The proposed algorithm is presented in two modes: first, all training examples are assumed to be equally important and the algorithm attempts to minimize the error-rate of the classifier on the training data by adjusting the weight of each fuzzy rule in the rule-base, and second, a weight is assigned...

chapter

A Study of Voice Source and Vocal Tract Filter Based Features in Cognitive Load Classification

Phu Ngoc Le, Julien Epps, Eric H C Choi, Eliathamby Ambikairajah

2010 20th International Conference on Pattern Recognition > 4516 - 4519

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Speech has been recognized as an attractive method for the measurement of cognitive load. Previous approaches have used mel frequency cepstral coefficients (MFCCs) as discriminative features to classify cognitive load. The MFCCs contain information from both the voice source and the vocal tract, so that the individual contributions of each to cognitive load variation are unclear. This paper aims to...

chapter

Classification of four affective modes in online songs and speeches

Chien Hung Chen, Ping Tsung Lu, Oscal T.-C Chen

The 19th Annual Wireless and Optical Communications Conference (WOCC 2010) > 1 - 4

2010 19th Annual Wireless and Optical Communications Conference (WOCC 2010)

The amount of multimedia sources from websites is extremely growing up every day. How to effectively search data and to find out what we need becomes a critical issue. In this work, four affective modes of exciting/happy, angry, sad and calm in songs and speeches are investigated. A song clip is partitioned into the main and refrain parts each of which is analyzed by the tempo, normalized intensity...

chapter

Classification of Audio Data Using a Centroid Neural Network

Dong-Chul Park

2010 International Conference on Information Science and Applications > 1 - 6

2010 International Conference on Information Science and Applications (ICISA 2010)

The automatic classification of audio data is an effective way to organize a large-scale audio data files. In this paper, an automatic content-based audio classification model using Centroid Neural Networks (CNN) with a Divergence Measure is proposed. The Divergence-based Centroid Neural Network (DCNN) algorithm, which employs the divergence measure as its distance measure, is used for clustering...

chapter

Error corrective classifier fusion for spoken Language Recognition

Omid Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 1994 - 1997

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

A number of effective classification algorithms have been developed for spoken language recognition, and it has been a common practice in the NIST Language Recognition Evaluations (LREs) that an information fusion is applied to boost the performance of the recognition system. This paper investigates the fusion of multiple output scores generated using different classifiers that complement to further...

chapter

Automatic acquisition device identification from speech recordings

D Garcia-Romero, C Y Espy-Wilson

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 1806 - 1809

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper we present a study on the automatic identification of acquisition devices when only access to the output speech recordings is possible. A statistical characterization of the frequency response of the device contextualized by the speech content is proposed. In particular, the intrinsic characteristics of the device are captured by a template, constructed by appending together the means...

chapter

Combining regression and classification methods for improving automatic speaker age recognition

C van Heerden, E Barnard, M Davel, C van der Walt, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5174 - 5177

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We present a novel approach to automatic speaker age classification, which combines regression and classification to achieve competitive classification accuracy on telephone speech. Support vector machine regression is used to generate finer age estimates, which are combined with the posterior probabilities of well-trained discriminative gender classifiers to predict both the age and gender of a speaker...

chapter

Instrument identification in polyphonic music signals based on individual partials

Jayme Garcia Arnal Barbedo, George Tzanetakis

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 401 - 404

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

A new approach to instrument identification based on individual partials is presented. It makes identification possible even when the concurrently played instrument sounds have a high degree of spectral overlapping. A pairwise comparison scheme which emphasizes the specific differences between each pair of instruments is used for classification. Finally, the proposed method only requires a single...

chapter

Dimensionality reduction methods for HMM phonetic recognition

Hongbing Hu, Stephen A Zahorian

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4854 - 4857

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper presents two nonlinear feature dimensionality reduction methods based on neural networks for a HMM-based phone recognition system. The neural networks are trained as feature classifiers to reduce feature dimensionality as well as maximize discrimination among speech features. The outputs of different network layers are used for obtaining transformed features. Moreover, the training of the...

chapter

Partitioned Feature-based Classifier model

Dong-Chul Park

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 412 - 417

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2009)

The Partitioned Feature-based Classifier (PFC) is proposed in this paper. PFC does not use entire feature vectors extracted from the original data at once to classify each datum, but use only groups of features related to each feature vector to classify data separately. In the training stage, the contribution rate calculated from each feature vector group is drawn throughout the accuracy of each feature...

Keywords:
ACCURACY
SPEECH
PATTERN CLASSIFICATION

Publication date

Set your own date range

INFONA - science communication portal

Advanced search

Advanced search

Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification

A Novel pattern recognition model for real-time voice data input

Speaker Recognition System: Vulnerable and Challenges

The Application of Improved Sparse Least-Squares Support Vector Machine in Speaker Identification

Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions

Performance improvement in automatic gender identification using hierarchical clustering

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

Automatic lexical stress detection for Chinese learners' of English

Emotions analysis of speech for call classification

Factor Analysis and Majority Voting Based Speech Emotion Recogntion

Framewise Phone Classification Using Weighted Fuzzy Classification Rules

A Study of Voice Source and Vocal Tract Filter Based Features in Cognitive Load Classification

Classification of four affective modes in online songs and speeches

Classification of Audio Data Using a Centroid Neural Network

Error corrective classifier fusion for spoken Language Recognition

Automatic acquisition device identification from speech recordings

Combining regression and classification methods for improving automatic speaker age recognition

Instrument identification in polyphonic music signals based on individual partials

Dimensionality reduction methods for HMM phonetic recognition

Partitioned Feature-based Classifier model

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options