Advanced search

From:

To:

Items from 1 to 20 out of 25 results

chapter

Voice pathology detection with MDVP parameters using Arabic voice pathology database

Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman, more

2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW) > 1 - 5

2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW)

This paper investigates the use of MultiDimensional Voice Program (MDVP) parameters to automatically detect voice pathology in Arabic voice pathology database (AVPD). MDVP parameters are very popular among the physician / clinician to detect voice pathology; however, MDVP is a commercial software. AVPD is a newly developed speech database designed to suit a wide range of experiments in the field of...

chapter

Classification of emotions from speech using implicit features

Mohit Srivastava, Anupam Agarwal

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Human computer interaction with the time has extended its branches to many different other fields like engineering, cognition, medical etc. Speech analysis has also become an important area of concern. People involved are using this mode for the interaction with the machines to bridge the gap between physical and digital world. Speech emotion recognition has become an integral subfield in the domain...

chapter

Voice pathology detection using auto-correlation of different filters bank

Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 50 - 55

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

This paper investigates the contribution of frequency bands for automatic voice pathology detection. First, the input voice signal is passed through a number of time-domain band-pass filters. The center frequencies are spaced on an octave scale. Each filter output is then divided into overlapping frames. Auto-correlation function is applied to each block to find the first largest peak, in areas other...

chapter

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Anaum Riaz, Sana Akhtar, Shanza Iftikhar, Amir Ali Khan, more

The 2014 2nd International Conference on Systems and Informatics (ICSAI 2014) > 712 - 717

2014 2nd International Conference on Systems and Informatics (ICSAI)

The use of Electroencephalography (EEG) in the domain of Brain Computer Interface is a now common place. EEG for imagined speech reproduction and observation of brain response to audio stimuli are active areas of research. In this paper, we consider the case of imagined and mouthed non-audible speech recorded with EEG electrodes. We analyze different feature extraction techniques such as Mel Frequency...

chapter

Speech emotion recognition

S. Lalitha, Abhishek Madhavan, Bharath Bhushan, Srinivas Saketh

2014 International Conference on Advances in Electronics Computers and Communications > 1 - 4

2014 International Conference on Advances in Electronics, Computers and Communications (ICAECC)

In the past decade a lot of research has gone into Automatic Speech Emotion Recognition(SER). The primary objective of SER is to improve man-machine interface. It can also be used to monitor the psycho physiological state of a person in lie detectors. In recent time, speech emotion recognition also find its applications in medicine and forensics. In this paper 7 emotions are recognized using pitch...

chapter

Performance comparison of heterogeneous classifiers for detection of Parkinson's disease using voice disorder (dysphonia)

Mohammad S Islam, Imtiaz Parvez, Hai Deng, Parijat Goswami

2014 International Conference on Informatics, Electronics & Vision (ICIEV) > 1 - 7

2014 International Conference on Informatics, Electronics & Vision (ICIEV)

Speech signal processing and its recognition system have gained a lot of attention from last few years due to its widespread application. In this study, we have conducted a comparative analysis for effective detection of Parkinson's disease using various machine learning classifiers from voice disorder known as dysphonia. To investigate robust detection process, three independent classifier topologies...

chapter

The challenges of SVM optimization using Adaboost on a phoneme recognition problem

Rimah Amami, Dorra Ben Ayed, Noureddine Ellouze

2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom) > 463 - 468

2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom)

The use of digital technology is growing at a very fast pace which led to the emergence of systems based on the cognitive infocommunications. The expansion of this sector impose the use of combining methods in order to ensure the robustness in cognitive systems.

chapter

Classification of emotional speech units in call centre interactions

Dimitrios Galanis, Sotiris Karabetsos, Maria Koutsombogera, Harris Papageorgiou, more

2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom) > 403 - 406

2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom)

Detecting emotional traits in call centre interactions can be beneficial to the quality management of the services provided, since this reveals the positioning of both speakers, i.e. satisfaction or frustration and anger on the customers' side, and stress detection, disappointment mitigation or failure to provide the requested service on the operators' side. This paper describes a machine learning...

chapter

Combination of PCA and SVM for diagnosis of Parkinson's disease

Mohammad Shahbakhti, Danial Taherifar, Zahra Zareei

2013 2nd International Conference on Advances in Biomedical Engineering > 137 - 140

2013 2nd International Conference on Advances in Biomedical Engineering (ICABME)

Parkinson's disease (PD) is a neurodegenerative brain disorder that occurs when approximately 60% to 80% of the dopamine-producing cells are damaged. PD is the second common neurodegenerative disorder after Alzheimer. PD could be diagnosed by various signals such as EEG, gait and speech. Approximately, 90 percent of people with PD suffer from speech disorder, thus it might be considered as the easiest...

chapter

Frequency Shift Detection of Speech with GMMs AND SVMs

Hua Xing, Philipos C. Loizou

2012 IEEE Workshop on Signal Processing Systems > 215 - 219

2012 IEEE Workshop on Signal Processing Systems (SiPS)

In certain situations, speech might be shifted in the frequency domain amid the presence of noise. To be able to compensate for the spectral shift, it is important to know the amount of frequency shift present. A method based on Mel-frequency-cepstral-coefficient (MFCC) and Gaussian Mixture model (GMM) super vector is proposed for detecting frequency shifts in speech. MFCC or LFCC is extracted to...

chapter

A Music Retrieval System Using Melody and Lyric

Zhiyuan Guo, Qiang Wang, Gang Liu, Jun Guo, more

2012 IEEE International Conference on Multimedia and Expo Workshops > 343 - 348

2012 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

Using melody and/or lyric to query a music retrieval system is convenient for users but challenging for developers. This paper proposes efficient schemes for realizing key algorithms in such a kind of system. Specifically, we characterize our system by adding lyric to query as follows: A Support Vector Machine (SVM) is employed to distinguish humming queries from singing queries, For a singing query,...

article

Principal factor analysis and SVM based effective speaker recognition

P Rama Koteswara Rao, Y Srinivasa Rao, D Vijaya Kumar

02012 Third International Conference on Computing, Communication and... > 2012 > 1 - 7

2012 Third International Conference on Computing, Communication and Networking Technologies (ICCCNT 2012)

Speaker recognition is important for successful development of speech recognizers in various real world applications. In this paper, the speaker recognizer was developed using sizable collection of various speakers both male as well as female with pitch strength as the feature. We proposed Principal Factor Analysis (PFA) technique for dimensionality reduction for accurate speaker recognition system...

chapter

SVM binary decision tree architecture for multi-class audio classification

Jozef Vavrek, Anton Cizmar, Jozef Juhar

Proceedings ELMAR-2012 > 202 - 206

2012 54th International Symposium ELMAR

The paper presents the support vector machine binary decision tree scheme (SVM-BDT) used for broadcast news (BN) audio classification. The SVM-BDT architecture was designed to solve multi-class discrimination problem of considered acoustic events: pure speech, speech with music, speech with environment sound, music, and environment sound. Its performance was investigated by using Mel-frequency cepstral...

chapter

An SVM based classification approach to speech separation

Kun Han, DeLiang Wang

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4632 - 4635

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Monaural speech separation is a very challenging task. CASA-based systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classification approach to monaural separation problem. Our feature set consists of pitch-based features and amplitude modulation spectrum features, which can discriminate both voiced and unvoiced speech from nonspeech interference...

chapter

Study of automatic biosounds detection and classification using SVM and GMM

Bor Jenq Chua, Xue Jun Li, Huy Dat Tran

2011 IEEE/NIH Life Science Systems and Applications Workshop (LiSSA) > 155 - 158

2011 IEEE/NIH 5th Life Science Systems and Applications Workshop (LiSSA)

Ambulatory devices can be used to detect heart diseases and save lives in critical time. These devices are based on sound classification that usually adopts a suitable data mining algorithm. This paper investigates the performance of Support Vector Machine (SVM) and Gaussian Mixture Model (GMM) classifiers in classifying sound samples. SVM classifier makes use of a linearly separable hyperplane to...

chapter

Towards better GMM-based acoustic modeling for spoken language identification

Fahime Ghasemian, Mohammad Mahdi Homayounpour

2011 19th Iranian Conference on Electrical Engineering > 1 - 4

2011 19th Iranian Conference on Electrical Engineering (ICEE)

Gaussian Mixture Model (GMM) is a widely used, simple and effective modeling approach for spoken language identification. Traditionally EM algorithm is used to train this model. In this paper we propose a new method named WA-GMM (Weight Adapted GMM) for estimating the weights of GMM Gaussian components using bag-of-unigram and Support Vector Machine (SVM): SVM weights which are trained on bag-of-unigram...

chapter

Performance improvement in automatic gender identification using hierarchical clustering

M A Keyvanrad, M M Homayounpour

2010 5th International Symposium on Telecommunications > 900 - 903

2010 5th International Symposium on Telecommunications (IST)

In this paper a hierarchical structure is proposed for automatic gender identification (AGI). In this structure two clustering techniques are used. The first technique is divisive clustering for dividing speakers from each gender to some classes of speakers. The second clustering technique is agglomerative clustering for creating a hierarchical structure. Feature reduction is done by SOAP feature...

chapter

Multi-layered features with SVM for Chinese accent identification

Jue Hou, Yi Liu, T F Zheng, J Olsen, more

2010 International Conference on Audio, Language and Image Processing > 25 - 30

2010 International Conference on Audio, Language and Image Processing (ICALIP)

In this paper, we propose an approach of multi-layered feature combination associated with support vector machine (SVM) for Chinese accent identification. The multi-layered features include both segmental and suprasegmental information, such as MFCC and pitch contour, to capture the diversity of variations in Chinese accented speech. The pitch contour is estimated using cubic polynomial method to...

chapter

Automatic lexical stress detection for Chinese learners' of English

Jin-Yu Chen, Lan Wang

2010 7th International Symposium on Chinese Spoken Language Processing > 407 - 411

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

This paper investigates lexical stress detection for Chinese learners of English, where a combined differential acoustic feature is developed to represent the lexical stress of polysyllabic words in continuous speech. The use of frame-averaged feature and the contextual information intra-word can be input to the classifiers without normalization. The word-based stress detection method proposed in...

chapter

Combining Support Vector Machines, Border Revised Rules and Transformation-based Error-driven Learning for Chinese Chunking

Wei Yuan, Zhang Ling-yu, Zhang Ya-xuan, He Lu, more

2010 International Conference on Artificial Intelligence and Computational Intelligence > 1 > 383 - 387

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

In research work, we found that grammatical information in the Modern Chinese Grammar Information Dictionary is very effective to revise chunk border. So the Modern Chinese Grammar Information Dictionary used to extract the chunk Border Revised Rules (BRR). In this paper, a new method of chunking is proposed--combined with BRR and TBL, SVM used for chunking. We reduced the number of SVM feature vector,...

Keywords:
ACCURACY
SPEECH
SVM

Publication date

Set your own date range

Publication type

book (24)
article (1)

Keywords

SUPPORT VECTOR MACHINES (23)
TRAINING (15)
FEATURE EXTRACTION (13)
SPEECH RECOGNITION (11)
SUPPORT VECTOR MACHINE (7)
GMM (5)
SPEECH PROCESSING (5)
HIDDEN MARKOV MODELS (4)
MEL FREQUENCY CEPSTRAL COEFFICIENT (4)
ACOUSTICS (3)
CLASSIFICATION ALGORITHMS (3)
DATABASES (3)
KERNEL (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
MFCC (3)
PRINCIPAL COMPONENT ANALYSIS (3)
SPEAKER RECOGNITION (3)
TAGGING (3)
DICTIONARIES (2)
GRAMMARS (2)
HMM (2)
NATURAL LANGUAGE PROCESSING (2)
NATURAL LANGUAGES (2)
PARKINSON'S DISEASE (2)
PATHOLOGY (2)
PATTERN CLASSIFICATION (2)
PCA (2)
TEXT ANALYSIS (2)
VOICE PATHOLOGY DETECTION (2)
ADABOOST (1)
ADAPTATION MODELS (1)
AGGLOMERATIVE CLUSTERING (1)
AMBULATORY DEVICE (1)
ANN (1)
AR (1)
ARABIC PART-OF-SPEECH TAGGER (1)
ARTIFICIAL INTELLIGENCE (1)
AUTO-CORRELATION (1)
AUTOMATIC BIOSOUND DETECTION (1)
AUTOMATIC GENDER IDENTIFICATION (1)
AUTOMATIC LEXICAL STRESS DETECTION (1)
AVPD (1)
BAG-OF-UNIGRAM (1)
BAND-PASS FILTERS (1)
BERLIN DATABASE (1)
BIOSOUND CLASSIFICATION (1)
BN AUDIO STREAM (1)
BOOSTING (1)
BORDER REVISED RULE (1)
C4.5 (1)
CALL CENTRE INTERACTIONS (1)
CARDIOLOGY (1)
CEPSTRAL ANALYSIS (1)
CHINESE ACCENTED SPEECH (1)
CHINESE ACCENTED SPEECH IDENTIFICATION (1)
CHINESE CHUNKING (1)
CHINESE LEARNER (1)
CHUNKING (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHM (1)
CLASSIFICATION RUN TIME (1)
CLASSIFIER (1)
CLUSTERING (1)
COMBINED DIFFERENTIAL ACOUSTIC FEATURES (1)
COMPUTATIONAL MODELING (1)
CONFERENCES (1)
CONTEXTUAL INFORMATION INTRAWORD (1)
CONTINUOUS SPEECH (1)
CUBIC POLYNOMIAL METHOD (1)
DATA MINING (1)
DATA MINING ALGORITHM (1)
DATA MODELS (1)
DECISION TREES (1)
DICTIONARY OF MODERN CHINESE GRAMMAR INFORMATION (1)
DIMENSIONALITY REDUCTION (1)
DISCRETE COSINE TRANSFORMS (1)
DISEASES (1)
DISTANCE MEASURE (1)
DSS (1)
DTAK (1)
DYNAMIC TIME ALIGNMENT KERNEL (1)
DYSPHONIA MEASUREMENTS (1)
ELECTRODES (1)
ELECTROENCEPHALOGRAPHY (1)
EMOTION CLASSIFICATION (1)
EMOTION RECOGNITION (1)
EMOTIONAL SPEECH (1)
EMOTIONS (1)
ENCODING (1)
ENGLISH (1)
ERROR CORRECTING OUTPUT CODES (1)
ERROR CORRECTION CODES (1)
EXTRACTIVE FEATURE VECTORS (1)
FBANN (1)
FEATURE SELECTION (1)
FEATURE VECTOR (1)
FILTER BANKS (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Voice pathology detection with MDVP parameters using Arabic voice pathology database

Classification of emotions from speech using implicit features

Voice pathology detection using auto-correlation of different filters bank

Inter comparison of classification techniques for vowel speech imagery using EEG sensors

Speech emotion recognition

Performance comparison of heterogeneous classifiers for detection of Parkinson's disease using voice disorder (dysphonia)

The challenges of SVM optimization using Adaboost on a phoneme recognition problem

Classification of emotional speech units in call centre interactions

Combination of PCA and SVM for diagnosis of Parkinson's disease

Frequency Shift Detection of Speech with GMMs AND SVMs

A Music Retrieval System Using Melody and Lyric

Principal factor analysis and SVM based effective speaker recognition

SVM binary decision tree architecture for multi-class audio classification

An SVM based classification approach to speech separation

Study of automatic biosounds detection and classification using SVM and GMM

Towards better GMM-based acoustic modeling for spoken language identification

Performance improvement in automatic gender identification using hierarchical clustering

Multi-layered features with SVM for Chinese accent identification

Automatic lexical stress detection for Chinese learners' of English

Combining Support Vector Machines, Border Revised Rules and Transformation-based Error-driven Learning for Chinese Chunking

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options