Advanced search

From:

To:

Items from 1 to 20 out of 25 results

chapter

Voice pathology detection by fuzzy logic

Daria Panek, Andrzej Skalski, Janusz Gajda

2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) Proceedings > 289 - 293

2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)

In this paper an efficient feature extraction methods and fuzzy logic based disorder assessment technique were used to investigate voice signals of patients suffering from functional dysphonia, hyperfunctional dysphonia, vocal cord paralysis and laryngitis. In this work, a vector made up from 28 acoustic parameters was an input for Principal Component Analysis, kernel Principal Component Analysis...

chapter

Multi-shift principal component analysis based primary component extraction for spatial audio reproduction

Jianjun He, Woon-Seng Gan

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 350 - 354

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In spatial audio analysis-synthesis, one of the key issues is to decompose a signal into primary and ambient components based on their spatial features. Principal component analysis (PCA) has been widely employed in primary component extraction, and shifted PCA (SPCA) is employed to enhance the primary extraction for input signals involving the inter-channel time difference. However, SPCA generally...

chapter

EEG dimensionality reduction in automatic identification of synonymy

Emilio Parisotto, Youness Aliyari Ghassabeh, Siamak Freydoonnejad, Frank Rudzicz

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 847 - 851

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recent work has demonstrated the feasibility of extracting semantic categories directly from cortical measures (e.g., electroencephalography, EEG) during receptive tasks. Here, we automatically classify speech stimuli as either synonymous or non-synonymous with a prior prime in a speech-receptive task given only EEG data with up to 86.84% accuracy. An analysis of variance reveals no significant difference...

chapter

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Khan Suhail Ahmad, Anil S. Thosar, Jagannath H. Nirmal, Vinay S. Pande

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC) and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian filter banks for text independent speaker recognition. MFCC modeled on the human auditory system shows robustness against noise and session changes and hence has become synonymous with speaker recognition. Our main aim is to test...

chapter

Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification

Noraziahtulhidayu Kamarudin, S.A.R Al-Haddad, Shaiful Jahari Hashim, Mohammad Ali Nematollahi, more

2014 IEEE Student Conference on Research and Development > 1 - 6

2014 IEEE Student Conference on Research and Development (SCOReD)

This paper presents the process of Quranic Accent Automatic Identification. Recent feature extraction technique that is used for Quranic verse rule identification/Tajweed include Mel Frequency Cepstral Coefficients (MFCC) which prone to additive noise and may reduce the classification result. Therefore, to improve the performance of MFCC with addition of Spectral Centroid features and is proposed...

chapter

Combination of PCA and SVM for diagnosis of Parkinson's disease

Mohammad Shahbakhti, Danial Taherifar, Zahra Zareei

2013 2nd International Conference on Advances in Biomedical Engineering > 137 - 140

2013 2nd International Conference on Advances in Biomedical Engineering (ICABME)

Parkinson's disease (PD) is a neurodegenerative brain disorder that occurs when approximately 60% to 80% of the dopamine-producing cells are damaged. PD is the second common neurodegenerative disorder after Alzheimer. PD could be diagnosed by various signals such as EEG, gait and speech. Approximately, 90 percent of people with PD suffer from speech disorder, thus it might be considered as the easiest...

article

Reduce the dimensions of emotional features by principal component analysis for speech emotion recognition

Changqin Quan, Dongyu Wan, Bin Zhang, Fuji Ren

Proceedings of the 02013 IEEE/SICE International Symposium on System... > 2013 > 222 - 226

2013 IEEE/SICE International Symposium on System Integration (SII)

in this paper, the principal component analysis (PCA) is applied to speech emotion recognition for improving the accuracy of the system. The traditional prosodic features like pitch-related features and formant-related features are extracted from the Berlin speech database [7] and a Chinese database. These collected feature data is processed by PCA to remove the irrelevant information. After that,...

chapter

Parkinson's disease feature subset selection based on voice samples

Zahari Abu Bakar, Nur Farahiah Ibrahim, Rohilah Sahak, Nooritawati Md Tahir

2012 International Symposium on Computer Applications and Industrial Electronics (ISCAIE) > 163 - 166

2012 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)

In this study, semi automation prediction of PD is investigated based on twenty two features of voice samples extracted from 147 subjects. Firstly, the original features of voice are used for recognition of PD or otherwise with MLP as classifier and Levenberg Marquardt and Scaled Conjugate Gradient as training algorithm. Next, to identify the number of significant features amongst the original attributes,...

chapter

Dimensionality Reduction for Emotional Speech Recognition

Pouria Fewzee, Fakhri Karray

2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing > 532 - 537

2012 International Conference on Privacy, Security, Risk and Trust (PASSAT)

The number of speech features that are introduced to emotional speech recognition exceeds some thousands and this makes dimensionality reduction an inevitable part of an emotional speech recognition system. The elastic net, the greedy feature selection, and the supervised principal component analysis are three recently developed dimensionality reduction algorithms that we have considered their application...

chapter

A novel approach to identify problematic call center conversations

Meghna Abhishek Pandharipande, Sunil Kumar Kopparapu

2012 Ninth International Conference on Computer Science and Software Engineering (JCSSE) > 1 - 5

2012 International Joint Conference on Computer Science and Software Engineering (JCSSE)

Voice based call centers enable customers to query for information by speaking to agents in the call center. Most often these call conversations are recorded for analysis with the intent of trying to identify things that can help improve the performance of the call center to serve the customer better. Today the recorded conversations are analyzed by humans by listening to call conversations, which...

chapter

Relevant mRMR features for visual speech recognition

Preety Singh, V. Laxmi, M.S. Gaur

2012 International Conference on Recent Advances in Computing and Software Systems > 148 - 153

2012 International Conference on Recent Advances in Computing and Software Systems (RACSS)

To improve the accuracy of visual speech recognition systems, forming a subset of relevant visual features, from a large set of extracted visual cues, is of fundamental importance. In this paper, two feature selection techniques, Principal Component Analysis (PCA) and a relatively recent method, Minimum Redundancy Maximum Relevance (mRMR), are separately applied on the extracted visual features. Prominent...

chapter

An acoustic analysis of shared enjoyment in ECA interactions of children with autism

Theodora Chaspari, Emily Mower Provost, Athanasios Katsamanis, Shrikanth Narayanan

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4485 - 4488

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

The quality of shared enjoyment in interactions is a key aspect related to Autism Spectrum Disorders (ASD). This paper discusses two types of enjoyment: the first refers to humorous events and is associated with one's positive affective state and the second is used to facilitate social interactions between people. These types of shared enjoyment are objectively specified by their proximity to a voiced...

chapter

Toward multi-command auditory brain computer interfacing using speech stimuli

Shuho Yoshimoto, Yoshikazu Washizawa, Toshihisa Tanaka, Hiroshi Higashi, more

Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 4

2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Brain-computer interfaces (BCIs) based on event-related potentials (ERP) are promising tools to communicate with patients suffering from some severe disabled diseases. ERP is evoked by various stimuli such as auditory, olfactory, and visual stimuli. Some auditory based BCIs with certain synthetic tone have been proposed, however, it is still challenging to increase the number of commands in auditory-based...

article

Principal factor analysis and SVM based effective speaker recognition

P Rama Koteswara Rao, Y Srinivasa Rao, D Vijaya Kumar

02012 Third International Conference on Computing, Communication and... > 2012 > 1 - 7

2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT 2012)

Speaker recognition is important for successful development of speech recognizers in various real world applications. In this paper, the speaker recognizer was developed using sizable collection of various speakers both male as well as female with pitch strength as the feature. We proposed Principal Factor Analysis (PFA) technique for dimensionality reduction for accurate speaker recognition system...

chapter

Speaker identification by K-nearest neighbors: Application of PCA and LDA prior to KNN

Juraj Kacur, Radoslav Vargic, Pavol Mulinka

2011 18th International Conference on Systems, Signals and Image Processing > 1 - 4

2011 18th International Conference on Systems, Signals and Image Processing (IWSSIP)

This article presents the task of speaker identification in a closed group. It discusses main steps of the identification process ranging from the proper speech features to the classification methods and statistical signal processing. However, its main focus is on tuning the final system using KNN classification method by setting up the number of neighbors, and reducing the feature vector dimension...

chapter

Spoken emotion recognition using local Fisher discriminant analysis

Shiqing Zhang, Bicheng Lei, Aihua Chen, Caiming Chen, more

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 538 - 540

2010 10th International Conference on Signal Processing (ICSP 2010)

Spoken emotion recognition is an interesting and challenging subject. In this paper, a new feature extraction method based on local Fisher discriminant analysis (LFDA) is proposed for spoken emotion recognition. LFDA is used to extract the low-dimensional discriminant embedded feature data from high-dimensional emotional speech features on spoken emotion recognition tasks. The performance of LFDA...

chapter

Dimensionality reduction methods for HMM phonetic recognition

Hongbing Hu, Stephen A Zahorian

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4854 - 4857

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper presents two nonlinear feature dimensionality reduction methods based on neural networks for a HMM-based phone recognition system. The neural networks are trained as feature classifiers to reduce feature dimensionality as well as maximize discrimination among speech features. The outputs of different network layers are used for obtaining transformed features. Moreover, the training of the...

chapter

Using local features based face experts in multimodal biometrics identification systems

O. Toygar, C. Ergun, H. Altincay

2009 Fifth International Conference on Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control > 1 - 4

2009 Fifth International Conference on Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control

Using local features generally provides higher accuracies compared to a global feature vector in face identification. In this study, taking into account the fact that better multimodal systems generally include individually good experts, multimodal identification using speech and local feature based face experts is studied. Both spPCA and mPCA are considered for this purpose. Experiments on XM2VTS...

chapter

Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum

K. Laskowski, Qin Jin

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4541 - 4544

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In recent years, the field of automatic speaker identification has begun to exploit high-level sources of speaker-discriminative information, in addition to traditional models of spectral shape. These sources include pronunciation models, prosodic dynamics, pitch, pause, and duration features, phone streams, and conversational interaction. As part of this broader thrust, we explore a new frame-level...

chapter

A Novel Reduction Method for Text-Independent Speaker Identification

Yan Wang, Xueyan Liu, Yujuan Xing, Ming Li

2008 Fourth International Conference on Natural Computation > 4 > 66 - 70

2008 Fourth International Conference on Natural Computation (ICNC)

SVM is a novel statistical learning method that has been successfully applied in speaker recognition. However, Extractive feature vectors from the speech are overlapped and noisy is included in the original data space, these problems can lead to experience difficulties, training complication during training SVM, and the result will be reduced during the recognition phase. In this paper, a novel method...

Keywords:
ACCURACY
SPEECH
PRINCIPAL COMPONENT ANALYSIS

Publication date

Set your own date range

Publication type

book (23)
article (2)

Keywords

FEATURE EXTRACTION (17)
SPEECH RECOGNITION (11)
TRAINING (8)
DATABASES (6)
SPEAKER RECOGNITION (6)
ACOUSTICS (5)
ARTIFICIAL NEURAL NETWORKS (5)
MEL FREQUENCY CEPSTRAL COEFFICIENT (5)
VECTORS (5)
COMPUTERS (4)
COVARIANCE MATRIX (4)
DATA MINING (4)
FACE RECOGNITION (4)
HIDDEN MARKOV MODELS (4)
PATTERN RECOGNITION (4)
ROBUSTNESS (4)
SIGNAL PROCESSING (4)
SPEAKER IDENTIFICATION (4)
TRANSFORMS (4)
ALGORITHM DESIGN AND ANALYSIS (3)
CLASSIFICATION ALGORITHMS (3)
CONFERENCES (3)
DATA MODELS (3)
DIMENSIONALITY REDUCTION (3)
EIGENVALUES AND EIGENFUNCTIONS (3)
EMOTION RECOGNITION (3)
GALLIUM NITRIDE (3)
PCA (3)
REAL TIME SYSTEMS (3)
SUPPORT VECTOR MACHINES (3)
SVM (3)
TESTING (3)
TRAINING DATA (3)
ACOUSTIC MEASUREMENTS (2)
ADAPTATION MODEL (2)
COMPLEXITY THEORY (2)
COMPUTATIONAL MODELING (2)
COMPUTER ARCHITECTURE (2)
CONVERGENCE (2)
COST FUNCTION (2)
EDUCATIONAL INSTITUTIONS (2)
ELECTROENCEPHALOGRAPHY (2)
ELECTRONIC MAIL (2)
EQUATIONS (2)
FREQUENCY MEASUREMENT (2)
IMAGE ANALYSIS (2)
IMAGE PROCESSING (2)
IMAGE RECOGNITION (2)
MACHINE LEARNING (2)
MATHEMATICAL MODEL (2)
MFCC (2)
MICROPHONES (2)
NOISE (2)
PARKINSON'S DISEASE (2)
PATTERN CLASSIFICATION (2)
PERIODIC STRUCTURES (2)
PRESSES (2)
PROBABILITY (2)
PSYCHOLOGY (2)
SIGNAL PROCESSING ALGORITHMS (2)
SPEECH FEATURE (2)
SPEECH PROCESSING (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
USA COUNCILS (2)
VISUALIZATION (2)
WRITING (2)
ACOUSTIC TRANSDUCERS (1)
ACTIVE CONTOURS (1)
ACTIVE SHAPE MODEL (1)
ADDITIVE NOISE (1)
ANALYTICAL MODELS (1)
ARTIFICIAL INTELLIGENCE (1)
ASSOCIATIVE MEMORY (1)
ATTENUATION (1)
AUDITORY SYSTEM (1)
AUTHENTICATION (1)
AUTISM (1)
BANCA DATABASES (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIOMEDICAL MONITORING (1)
BIOMEDICAL ULTRASONICS (1)
BIOMETRICS (1)
BIOMETRICS (ACCESS CONTROL) (1)
BLIND SOURCE SEPARATION (1)
BRIDGE CRACK DETECTION (1)
BRIDGES (1)
BRIDGES (STRUCTURES) (1)
BRIGHTNESS (1)
BUSINESS (1)
CALIBRATION (1)
CALL CONVERSATIONS (1)
CEPSTRAL ANALYSIS (1)
CEPSTRAL FEATURE (1)
CHEMICAL ENGINEERING (1)
CIVIL ENGINEERING (1)
CLUSTERING ALGORITHMS (1)
CLUSTERING METHODS (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Voice pathology detection by fuzzy logic

Multi-shift principal component analysis based primary component extraction for spatial audio reproduction

EEG dimensionality reduction in automatic identification of synonymy

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification

Combination of PCA and SVM for diagnosis of Parkinson's disease

Reduce the dimensions of emotional features by principal component analysis for speech emotion recognition

Parkinson's disease feature subset selection based on voice samples

Dimensionality Reduction for Emotional Speech Recognition

A novel approach to identify problematic call center conversations

Relevant mRMR features for visual speech recognition

An acoustic analysis of shared enjoyment in ECA interactions of children with autism

Toward multi-command auditory brain computer interfacing using speech stimuli

Principal factor analysis and SVM based effective speaker recognition

Speaker identification by K-nearest neighbors: Application of PCA and LDA prior to KNN

Spoken emotion recognition using local Fisher discriminant analysis

Dimensionality reduction methods for HMM phonetic recognition

Using local features based face experts in multimodal biometrics identification systems

Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum

A Novel Reduction Method for Text-Independent Speaker Identification

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options