Search results

Items from 1 to 20 out of 269 results

chapter

Ensemble-based depression detection in speech

Zhenyu Liu, Changcong Li, Xiang Gao, Gang Wang, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 975 - 980

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Depression detection using speech signal is becoming an attractive topic because it is fast, convenient and non-invasive. Many researches aimed at improving depression classification performance. This study investigated application of ensemble learners in depression detection and compared three speaking styles (interview, reading and picture description) in ensembles. A speech dataset collecting from...

chapter

Speech emotion recognition based on Gaussian kernel nonlinear proximal support vector machine

Zhiyan Han, Jian Wang

2017 Chinese Automation Congress (CAC) > 2513 - 2516

2017 Chinese Automation Congress (CAC)

For the sake of improving the precision of speech emotion recognition, this paper proposed a novel speech emotion recognition approach based on Gaussian Kernel Nonlinear Proximal Support Vector Machine (PSVM) to recognize four basic human emotions (angry, joy, sadness, surprise). Firstly, preprocess speech signal containing sampling, quantification, pre-emphasizing, framing, adding window and endpoint...

chapter

Data hiding method in speech using echo embedding and voicing correction

Bartosz Tabara, Jaroslaw Wojtun, Zbigniew Piotrowski

2017 Signal Processing Symposium (SPSympo) > 1 - 6

2017 Signal Processing Symposium (SPSympo)

Ste gano graphic systems are used for the transmission of hidden data in the original signal. The article describes the algorithm of the hidden data transmission using the speech signal as a carrier. The echo method is used for data embedding. In order to improve the decoding efficiency of embedded data, the procedure of voicing correction and mechanism of informed coding were developed and implemented...

chapter

Voice conversion based on continuous frequency warping and magnitude scaling

Yuhang Ye, Bob Lawlor

2017 28th Irish Signals and Systems Conference (ISSC) > 1 - 6

2017 28th Irish Signals and Systems Conference (ISSC)

In this paper, we present a novel spectrum mapping method — Continuous Frequency Warping and Magnitude Scaling (CFWMS) for voice conversion under the Joint Density Gaussian Mixture Model (JDGMM) framework. JDGMM is a mature clustering technique that models the joint probability density of speech signals from paired speakers. The conventional JDGMM-based approaches morph the spectral features via least...

chapter

Speech emotion recognition using derived features from speech segment and kernel principal component analysis

Matee Charoendee, Atiwong Suchato, Proadpran Punyabukkana

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 1 - 6

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)

Speech emotion recognition is a challenging problem, with identifying efficient features being of particular concern. This paper has two components. First, it presents an empirical study that evaluated four feature reduction methods, chi-square, gain ratio, RELIEF-F, and kernel principal component analysis (KPCA), on utterance level using a support vector machine (SVM) as a classifier. KPCA had the...

chapter

Research and application of combined kernel SVM in dynamic voiceprint password authentication system

Sen Zhu, Chengji Xu, Jinming Wang, Yingcai Xiao, more

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1052 - 1055

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

The kernel function plays an important role in the classification of support vector machines (SVM). In order to solve the problem that a single SVM kernel function can not achieve optimal learning ability and generalization ability in recognition classification at the same time, here we present a new combined kernel function by analyzing and comparing the characteristics of various kernel functions...

chapter

Learning utterance-level representations for speech emotion and age/gender recognition using deep neural networks

Zhong-Qiu Wang, Ivan Tashev

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5150 - 5154

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Accurately recognizing speaker emotion and age/gender from speech can provide better user experience for many spoken dialogue systems. In this study, we propose to use deep neural networks (DNNs) to encode each utterance into a fixed-length vector by pooling the activations of the last hidden layer over time. The feature encoding process is designed to be jointly trained with the utterance-level classifier...

chapter

Fusion of multiple emotion perspectives: Improving affect recognition through integrating cross-lingual emotion information

Chun-Min Chang, Chi-Chun Lee

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5820 - 5824

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Developing cross-corpus, cross-domain, and cross-language emotion recognition algorithm has becoming more prevalent recently to ensure the wide applicability of robust emotion recognizer. In this work, we propose a computational framework on fusing multiple emotion perspectives by integrating cross-lingual emotion information. By assuming that each data is ‘perceived’ not only by a main perspective...

chapter

Kernel weighted Fisher sparse analysis on multiple maps for audio event recognition

Yu-Hao Chin, Bo-Wei Chen, Jia-Ching Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6010 - 6014

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This work presents a novel approach for audio event recognition. The approach develops a weighted kernel fisher sparse analysis method based on multiple maps. The proposed method consists of maps extraction and kernel weighted Fisher sparse analysis. Two maps are firstly extracted from each audio file, i.e. scale-frequency map and damping-frequency map. The scale and frequency of the Gabor atoms are...

chapter

An FFT-based synchronization approach to recognize human behaviors using STN-LFP signal

Hosein M. Golshan, Adam O. Hebb, Sara J. Hanrahan, Joshua Nedrud, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 979 - 983

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Classification of human behavior is a key step to developing closed-loop Deep Brain Stimulation (DBS) systems, which may decrease the power consumption and side effects of the existing systems. Recent studies have shown that the Local Field Potential (LFP) signals from both Subthalamic Nuclei (STN) of the brain can be used to recognize human behavior. Since the DBS leads implanted in each STN can...

chapter

Single sensor audiovisual speech source separation

Pierre Narvor, Bertrand Rivet, Christian Jutten

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 166 - 170

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

The Kernel Additive Modeling (KAM) is a recent promising framework for the separation of underdetermined convolutive mixture of audio signal. The principle of this method is to estimate the short term Power Spectral Densities (PSD) of the sources directly from the mixture by taking advantage of redundant features in the PSD of the source, such as periodicity or smoothness. The separation itself is...

chapter

BlowClick 2.0: A trigger based on non-verbal vocal input

Daniel Zielasko, Neha Neha, Benjamin Weyers, Torsten W. Kuhlen

2017 IEEE Virtual Reality (VR) > 319 - 320

2017 IEEE Virtual Reality (VR)

The use of non-verbal vocal input (NVVI) as a hand-free trigger approach has proven to be valuable in previous work [7]. Nevertheless, BlowClick's original detection method is vulnerable to false positives and, thus, is limited in its potential use, e.g., together with acoustic feedback for the trigger. Therefore, we extend the existing approach by adding common machine learning methods. We found...

chapter

Support Vector Machine-recursive feature elimination for the diagnosis of Parkinson disease based on speech analysis

Hengbo Ma, Tianyu Tan, Hongpeng Zhou, Tianyi Gao

2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP) > 34 - 40

2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP)

Parkinson disease has become a serious problem in the old people. There is no precise method to diagnosis Parkinson disease now. Considering the significance and difficulty of recognizing the Parkinson disease, the measurement of samples' voices is regard as one of the best non-invasive ways to find the real patient. Support Vector Machine is one of the most effective tools to classify in machine...

chapter

Emotional speaker recognition based on i-vector space model

Asma Mansour, Farah Chenchah, Zied Lachiri

2016 4th International Conference on Control Engineering & Information Technology (CEIT) > 1 - 6

2016 4th International Conference on Control Engineering & Information Technology (CEIT)

I-vector space feature has been recently proved to be very efficient in speaker recognition field. In this paper, we assess using the i-vector approach for emotional speaker recognition in order to boost the performance which is deteriorated by emotions. The key idea of the i-vector algorithm is to represent each speaker by a fixed length and low dimensional feature vector. The concatenation of these...

chapter

Comparison Analysis of Classifiers for Speech under Stress

Xiao Yao, Ning Xu, Mingsheng Gao, Aiming Jiang, more

2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) > 429 - 432

In this paper, we focus on the classification of neutral and stressed speech. The parameters representing airflow patterns in physiological system are achieved using a physical model. Speech features were modeled using Gaussian Mixture Models (GMM) and Support Vector Machines (SVM). A comparison is made of different classifiers to determine their performance in stressed speech classification. Results...

chapter

Application of i-vector in speech and music classification

Hao Zhang, Xu-Kui Yang, Wei-Qiang Zhang, Wen-Lin Zhang, more

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 1 - 5

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

This paper proposes a speech/music classification system based on i-vector. An analysis of two classification methods, namely cosine distance score (CDS) and support vector machine (SVM) is performed. Two session compensation methods, within-class covariance normalization (WCCN) and linear discriminant analysis (LDA) are also discussed. The performance of proposed systems yields better results compared...

chapter

Weakly supervised user intent detection for multi-domain dialogues

Ming Sun, Aasish Pappu, Yun-Nung Chen, Alexander I. Rudnicky

2016 IEEE Spoken Language Technology Workshop (SLT) > 91 - 97

2016 IEEE Spoken Language Technology Workshop (SLT)

Users interact with mobile apps with certain intents such as finding a restaurant. Some intents and their corresponding activities are complex and may involve multiple apps; for example, a restaurant app, a messenger app and a calendar app may be needed to plan a dinner with friends. However, activities may be quite personal and third-party developers would not be building apps to specifically handle...

chapter

Inferring Hearing Loss from Learned Speech Kernels

Bonny Banerjee, Masoumeh Heidari Kapourchali, Shamima Najnin, Lisa Lucks Mendel, more

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 26 - 31

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Does a hearing-impaired individual's speech reflect his hearing loss, and if it does, can the nature of hearing loss be inferred from his speech? To investigate these questions, at least four hours of speech data were recorded from each of 37 adult individuals, both male and female, belonging to four classes: 7 normal, and 30 severely-to-profoundly hearing impaired with high, medium or low speech...

chapter

Speech emotion classification using multiple kernel Gaussian process

Sih-Huei Chen, Jia-Ching Wang, Wen-Chi Hsieh, Yu-Hao Chin, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Given the increasing attention paid to speech emotion classification in recent years, this work presents a novel speech emotion classification approach based on the multiple kernel Gaussian process. Two major aspects of a classification problem that play an important role in classification accuracy are addressed, i.e. feature extraction and classification. Prosodic features and other features widely...

chapter

Relative entropy normalized Gaussian supervector for speech emotion recognition using kernel extreme learning machine

Ruru Li, Dali Yang, Xinxing Li, Renyu Wang, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Speech emotion recognition is a challenging and significant task. On the one hand, the emotion features need to be robust enough to capture the emotion information, and while on the other, machine learning algorithms need to be insensitive to model the utterance. In this paper, we presented a novel framework of speech emotion recognition to address the two above-mentioned challenges. Relative Entropy...

Keywords:
KERNEL
SPEECH

Publication date

Set your own date range

Content availability

Available (268)
None (1)

Keywords

SUPPORT VECTOR MACHINES (134)
SPEECH RECOGNITION (91)
TRAINING (88)
FEATURE EXTRACTION (84)
SPEAKER RECOGNITION (52)
MEL FREQUENCY CEPSTRAL COEFFICIENT (38)
DATA MINING (34)
SPEECH PROCESSING (34)
HIDDEN MARKOV MODELS (33)
SUPPORT VECTOR MACHINE (32)
ACCURACY (29)
EMOTION RECOGNITION (23)
GAUSSIAN PROCESSES (23)
NOISE (23)
VECTORS (23)
ACOUSTICS (22)
SVM (22)
PRINCIPAL COMPONENT ANALYSIS (21)
NIST (19)
DATABASES (18)
LEARNING (ARTIFICIAL INTELLIGENCE) (18)
ROBUSTNESS (17)
PATTERN CLASSIFICATION (16)
GAUSSIAN MIXTURE MODEL (15)
ESTIMATION (14)
SPEAKER VERIFICATION (13)
ADAPTATION MODEL (12)
ARTIFICIAL NEURAL NETWORKS (12)
SUPPORT VECTOR MACHINE CLASSIFICATION (12)
NATURAL LANGUAGE PROCESSING (11)
COMPUTATIONAL MODELING (10)
MACHINE LEARNING (10)
POLYNOMIALS (10)
SIGNAL CLASSIFICATION (10)
SPEAKER IDENTIFICATION (10)
SPEECH ENHANCEMENT (10)
CLASSIFICATION ALGORITHMS (9)
ENCODING (9)
MATHEMATICAL MODEL (9)
MFCC (9)
CLASSIFICATION (8)
REVERBERATION (8)
SIGNAL PROCESSING (8)
SIGNAL TO NOISE RATIO (8)
SPEECH EMOTION RECOGNITION (8)
TESTING (8)
TIME-FREQUENCY ANALYSIS (8)
CEPSTRAL ANALYSIS (7)
COVARIANCE MATRIX (7)
ENTROPY (7)
EQUATIONS (7)
ERROR ANALYSIS (7)
GAUSSIAN MIXTURE MODELS (7)
KERNEL FUNCTION (7)
NOISE MEASUREMENT (7)
RADIAL BASIS FUNCTION NETWORKS (7)
REPRODUCING KERNEL HILBERT SPACE (7)
SPECTROGRAM (7)
TRAINING DATA (7)
ADAPTIVE FILTERS (6)
MICROPHONES (6)
MULTIPLE KERNEL LEARNING (6)
REGRESSION ANALYSIS (6)
SPEECH SIGNAL (6)
ACOUSTIC SIGNAL PROCESSING (5)
AUDIO SIGNAL PROCESSING (5)
CONFERENCES (5)
CORRELATION (5)
DATA MODELS (5)
DELAY (5)
FUZZY SET THEORY (5)
HILBERT SPACES (5)
I-VECTOR (5)
NONLINEAR FILTERS (5)
OPTIMIZATION (5)
PROBABILITY DENSITY FUNCTION (5)
SPEECH SYNTHESIS (5)
STATISTICAL ANALYSIS (5)
VECTOR QUANTIZATION (5)
ADAPTIVE VOLTERRA FILTER (4)
APPROXIMATION METHODS (4)
BLIND SOURCE SEPARATION (4)
CEPSTRUM (4)
CLUSTERING ALGORITHMS (4)
DECODING (4)
DICTIONARIES (4)
ECHO HIDING (4)
ECHO SUPPRESSION (4)
FEATURE SELECTION (4)
GAUSSIAN KERNEL (4)
GMM (4)
LATTICES (4)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (4)
NATURAL LANGUAGES (4)
NOISE REDUCTION (4)
NONLINEAR MAPPING (4)
PARTICLE SWARM OPTIMIZATION (4)
PATHOLOGY (4)
more

INFONA - science communication portal

Search results

Ensemble-based depression detection in speech

Speech emotion recognition based on Gaussian kernel nonlinear proximal support vector machine

Data hiding method in speech using echo embedding and voicing correction

Voice conversion based on continuous frequency warping and magnitude scaling

Speech emotion recognition using derived features from speech segment and kernel principal component analysis

Research and application of combined kernel SVM in dynamic voiceprint password authentication system

Learning utterance-level representations for speech emotion and age/gender recognition using deep neural networks

Fusion of multiple emotion perspectives: Improving affect recognition through integrating cross-lingual emotion information

Kernel weighted Fisher sparse analysis on multiple maps for audio event recognition

An FFT-based synchronization approach to recognize human behaviors using STN-LFP signal

Single sensor audiovisual speech source separation

BlowClick 2.0: A trigger based on non-verbal vocal input

Support Vector Machine-recursive feature elimination for the diagnosis of Parkinson disease based on speech analysis

Emotional speaker recognition based on i-vector space model

Comparison Analysis of Classifiers for Speech under Stress

Application of i-vector in speech and music classification

Weakly supervised user intent detection for multi-domain dialogues

Inferring Hearing Loss from Learned Speech Kernels

Speech emotion classification using multiple kernel Gaussian process

Relative entropy normalized Gaussian supervector for speech emotion recognition using kernel extreme learning machine

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options