Advanced search

From:

To:

Items from 1 to 20 out of 63 results

chapter

A novel feature selection based on Tibetan grammar for Tibetan text classification

Tao Jiang, Hongzhi Yu

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 445 - 448

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Feature selection is a strategy that aims at making text classifiers more efficient and accurate. In this paper, we proposed a novel feature selection method based on Tibetan grammar for Tibetan classification. Tibetan language express grammatical meaning through the function words and word order, and the function word has large proportions. By analyzing the Tibetan grammar and distribution of part...

chapter

Developmental pattern analysis and age prediction by extracting speech features and applying various classification techniques

Sumanlata Gautam, Latika Singh

International Conference on Computing, Communication & Automation > 83 - 87

2015 International Conference on Computing, Communication & Automation (ICCCA)

In speech development research, it's important to know how speech acoustic features vary as a function of age and the age when the variability and magnitude of acoustic features start to exhibit adult-like patterns. During the first few years of life, a child's speech changes from the cries and babbles of an infant to adult-like words and phrases of a young child. A number of acoustic studies observed...

chapter

Classification of emotions from speech using implicit features

Mohit Srivastava, Anupam Agarwal

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Human computer interaction with the time has extended its branches to many different other fields like engineering, cognition, medical etc. Speech analysis has also become an important area of concern. People involved are using this mode for the interaction with the machines to bridge the gap between physical and digital world. Speech emotion recognition has become an integral subfield in the domain...

chapter

A new direct access framework for speaker identification system

Hery Heryanto, Saiful Akbar, Benhard Sitohang

2014 International Conference on Data and Software Engineering (ICODSE) > 1 - 5

2014 International Conference on Data and Software Engineering (ICODSE)

We present in this paper a new Direct Access Framework (DAF) for speaker identification system, to identify a speaker based on original characteristics of the human voice. Direct access method is a process to identify an object based on parts of the object itself, the parts called original characteristics. The proposed framework consists of two parts, the enrolment process and the identification process...

chapter

A Novel pattern recognition model for real-time voice data input

Yogesh Kumar Sen, R. K. Chaurasiya, Shrish Verma

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 715 - 718

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

The classical front end analysis in speech recognition is a spectral analysis which parameterizes the speech signal into feature vectors. This paper proposes a voice recognition model that is able to automatically classify and recognize a voice signal with background noise. The model uses the concept of spectrogram, pitch period, short time energy, zero crossing rate, mel frequency scale and cepestral...

chapter

Butterfly-like D-tree fusion strategy for real-time speech and music classification

Min Lu, Weibei Dou

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) > 1 - 4

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

Aimed at the problem of real-time speech and music discrimination, this paper proposes a frame-level classification method by using a novel “butterfly-like” fusion strategy based on decision tree (D-Tree).In our method, some homotypes of long-term features but in different time lengths are extracted to train each sub-classifier and make the fusion resultful. A testing experiment indicates our approach...

article

Mixed Stereo Audio Classification Using a Stereo-Input Mixed-to-Panned Level Feature

Austin Chen, Mark A. Hasegawa-Johnson

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 12 > 2025 - 2033

Many past studies have been conducted on speech/music discrimination due to the potential applications for broadcast and other media; however, it remains possible to expand the experimental scope to include samples of speech with varying amounts of background music. This paper focuses on the development and evaluation of two measures of the ratio between speech energy and music energy: a reference...

chapter

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

Jasdeep Singh Bhalla, Anmol Aggarwal

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE) > 1 - 6

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE)

Emotion Recognition from speech has evolved itself as the most significant research area in the field of affective computing. In this paper, two emotional speech datasets, have been analyzed, based on gender distinction (male and female speech). This paper introduces a new approach of speech-emotion recognition based on the use of AdaBoost classification Algorithm. Artificial neural network has been...

chapter

Speech emotion recognition for SROL database using weighted KNN algorithm

Monica Feraru, Marius Zbancioc

Proceedings of the International Conference on ELECTRONICS, COMPUTERS and ARTIFICIAL INTELLIGENCE - ECAI-2013 > 1 - 4

2013 International Conference on Electronics, Computers and Artificial Intelligence (ECAI)

In this study, we utilized an improved version of the classical KNN algorithm which associates to each parameter from the features vectors weights according to their performance in the classification process. We obtained the recognition percents of emotions around 65–67%, for the Romanian language, on the SROL database, which are comparable with the results for other languages, with non-professional...

chapter

Parkinson's disease feature subset selection based on voice samples

Zahari Abu Bakar, Nur Farahiah Ibrahim, Rohilah Sahak, Nooritawati Md Tahir

2012 International Symposium on Computer Applications and Industrial Electronics (ISCAIE) > 163 - 166

2012 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)

In this study, semi automation prediction of PD is investigated based on twenty two features of voice samples extracted from 147 subjects. Firstly, the original features of voice are used for recognition of PD or otherwise with MLP as classifier and Levenberg Marquardt and Scaled Conjugate Gradient as training algorithm. Next, to identify the number of significant features amongst the original attributes,...

chapter

Classification of Cross-Correlation Functions for Speaker Localization

Xinwang Wan, Juan Liang

2012 International Conference on Computer Science and Service System > 494 - 497

2012 International Conference on Computer Science and Service System (CSSS)

Sound source localization plays a crucial role in many microphone arrays application, ranging from speech enhancement to human-computer interface in a reverberant noisy environment. The steered response power (SRP) using the phase transform (SRP-PHAT) method is one of the most popular modern localization algorithms. The SRP-based source localizers have been proved robust, however, the methods may...

chapter

A novel approach for emotion classification based on fusion of text and speech

Ali Houjeij, Layla Hamieh, Nader Mehdi, Hazem Hajj

2012 19th International Conference on Telecommunications (ICT) > 1 - 6

2012 19th International Conference on Telecommunications (ICT)

In this paper we design a system that adopts a novel approach for emotional classification from human dialogue based on text and speech context. Our main objective is to boost the accuracy of speech emotional classification by accounting for the features extracted from the spoken text. The proposed system concatenates text and speech features and feeds them as one input to the classifier. The work...

chapter

Sentence recognition from articulatory movements for silent speech interfaces

Jun Wang, Ashok Samal, Jordan R. Green, Frank Rudzicz

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4985 - 4988

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Recent research has demonstrated the potential of using an articulation-based silent speech interface for command-and-control systems. Such an interface converts articulation to words that can then drive a text-to-speech synthesizer. In this paper, we have proposed a novel near-time algorithm to recognize whole-sentences from continuous tongue and lip movements. Our goal is to assist persons who are...

chapter

On objective feature selection for affective sounds discrimination

Michal Chmulik, Roman Jarina, Michal Kuba

Proceedings ELMAR-2012 > 199 - 202

2012 54th International Symposium ELMAR

We present an objective acoustic feature selection for automatic affective sounds detection based on stochastic evolutionary optimization algorithms. Particle Swarm Optimization (PSO) as well as Genetic Algorithms (GA) are exploit to select the most appropriate audio features from a large set of available features. We performed experiments on a dataset containing about two hours of affective sounds...

chapter

Classifier combination for telegraphese restoration

Leo Willyanto Santoso

2011 International Conference on Uncertainty Reasoning and Knowledge Engineering > 1 > 79 - 82

2011 International Conference on Uncertainty Reasoning and Knowledge Engineering (URKE)

This paper presents a classifier combination to solve telegraphese restoration problem. By implementing more than one classifier, it can support other classifier, and finally it can improve the performance. Using supplied development data, training data and testing data, the best model had an accuracy F = 79 %.

chapter

Building robust emotion recognition system on heterogeneous speech databases

Won-Jung Yoon, Kyu-Sik Park

2011 IEEE International Conference on Consumer Electronics (ICCE) > 825 - 826

2011 IEEE International Conference on Consumer Electronics (ICCE)

This paper proposes a method to build a robust speech emotion recognition system for consumer electronic applications. Traditional method of two-class (neutral/anger) emotion recognition is extended into two-step hierarchical structure by using emotional characteristics and gender difference. Experimental results confirm the very stable and successful emotion classification performance over the traditional...

chapter

Vowel recognition from continuous articulatory movements for speaker-dependent applications

Jun Wang, J R Green, A Samal, T D Carrell

2010 4th International Conference on Signal Processing and Communication Systems > 1 - 7

2010 4th International Conference on Signal Processing and Communication Systems (ICSPCS 2010)

A novel approach was developed to recognize vowels from continuous tongue and lip movements. Vowels were classified based on movement patterns (rather than on derived articulatory features, e.g., lip opening) using a machine learning approach. Recognition accuracy on a single-speaker dataset was 94.02% with a very short latency. Recognition accuracy was better for high vowels than for low vowels....

chapter

Performance improvement in automatic gender identification using hierarchical clustering

M A Keyvanrad, M M Homayounpour

2010 5th International Symposium on Telecommunications > 900 - 903

2010 5th International Symposium on Telecommunications (IST)

In this paper a hierarchical structure is proposed for automatic gender identification (AGI). In this structure two clustering techniques are used. The first technique is divisive clustering for dividing speakers from each gender to some classes of speakers. The second clustering technique is agglomerative clustering for creating a hierarchical structure. Feature reduction is done by SOAP feature...

chapter

Speaker gender recognition using score level fusion by AdaBoost

M Ichino, N Komatsu, Wang Jian-Gang, Yau Wei Yun

2010 11th International Conference on Control Automation Robotics&Vision > 648 - 653

2010 11th International Conference on Control Automation Robotics & Vision (ICARCV 2010)

We propose speaker gender recognition achieved by using score level fusion by AdaBoost. Soft biometrics has been focused on because recognition by fusing biometric systems and soft biometric traits may improve the accuracy of recognition and decrease the time for this. Gender recognition is important for speaker recognition and can provide important information to speaker recognition systems. Mel-frequency...

chapter

Investigating analysis of speech content through text classification

S Ezzat, N E Gayar, M M Ghanem

2010 International Conference of Soft Computing and Pattern Recognition > 105 - 110

2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010)

The field of Text Mining has evolved over the past years to analyze textual resources. However, it can be used in several other applications. In this research, we are particularly interested in performing text mining techniques on audio materials after translating them into texts in order to detect the speakers' emotions. We describe our overall methodology and present our experimental results. In...

Keywords:
ACCURACY
SPEECH
CLASSIFICATION ALGORITHMS

Publication date

Set your own date range

Publication type

book (57)
article (6)

Keywords

FEATURE EXTRACTION (33)
SPEECH RECOGNITION (24)
TRAINING (24)
SPEECH PROCESSING (16)
SUPPORT VECTOR MACHINES (14)
MEL FREQUENCY CEPSTRAL COEFFICIENT (12)
ARTIFICIAL NEURAL NETWORKS (11)
PATTERN CLASSIFICATION (11)
ACOUSTICS (10)
DATA MINING (10)
DATABASES (10)
SIGNAL PROCESSING (10)
SPEAKER RECOGNITION (9)
LEARNING (ARTIFICIAL INTELLIGENCE) (8)
MACHINE LEARNING (8)
SUPPORT VECTOR MACHINE CLASSIFICATION (8)
HIDDEN MARKOV MODELS (7)
NATURAL LANGUAGE PROCESSING (7)
ROBUSTNESS (7)
TRAINING DATA (7)
COMPUTERS (6)
EMOTION RECOGNITION (6)
ESTIMATION (6)
NOISE (6)
PATTERN RECOGNITION (6)
PROBABILITY (6)
SIGNAL CLASSIFICATION (6)
SIGNAL PROCESSING ALGORITHMS (6)
ALGORITHM DESIGN AND ANALYSIS (5)
AUDIO SIGNAL PROCESSING (5)
COMPUTATIONAL MODELING (5)
MATHEMATICAL MODEL (5)
REAL TIME SYSTEMS (5)
TESTING (5)
TEXT ANALYSIS (5)
TRANSFORMS (5)
ADAPTATION MODEL (4)
CLUSTERING ALGORITHMS (4)
CORRELATION (4)
DATA MODELS (4)
DETECTION ALGORITHMS (4)
DETECTORS (4)
EDUCATIONAL INSTITUTIONS (4)
ELECTRONIC MAIL (4)
EQUATIONS (4)
FACE RECOGNITION (4)
IMAGE CLASSIFICATION (4)
IMAGE RECOGNITION (4)
INTERNET (4)
LABORATORIES (4)
MFCC (4)
PERIODIC STRUCTURES (4)
PREDICTION ALGORITHMS (4)
PROBABILITY DENSITY FUNCTION (4)
TAGGING (4)
ANALYTICAL MODELS (3)
AUDIO CLASSIFICATION (3)
BAYESIAN METHODS (3)
CEPSTRAL ANALYSIS (3)
CLASSIFICATION (3)
CLUSTERING (3)
COMPANIES (3)
COMPLEXITY THEORY (3)
CONFERENCES (3)
COVARIANCE MATRIX (3)
ELECTRODES (3)
ENCODING (3)
GAUSSIAN PROCESSES (3)
GMM (3)
HARMONIC ANALYSIS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE PROCESSING (3)
IMAGE RESOLUTION (3)
IMAGE SEGMENTATION (3)
INSTRUMENTS (3)
MAXIMUM LIKELIHOOD ESTIMATION (3)
MEDICAL SIGNAL PROCESSING (3)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (3)
MUSIC (3)
OBJECT RECOGNITION (3)
OPTIMIZATION (3)
PRINCIPAL COMPONENT ANALYSIS (3)
PSYCHOLOGY (3)
SENSITIVITY (3)
SENSORS (3)
SPEECH CODING (3)
SUPPORT VECTOR MACHINE (3)
SVM (3)
TEXT MINING (3)
VECTOR QUANTIZATION (3)
VITERBI ALGORITHM (3)
WAVELET ANALYSIS (3)
WHITE NOISE (3)
WRITING (3)
ACOUSTIC MEASUREMENTS (2)
AIRCRAFT (2)
AUDIO SIGNAL (2)
more

INFONA - science communication portal

Advanced search

Advanced search

A novel feature selection based on Tibetan grammar for Tibetan text classification

Developmental pattern analysis and age prediction by extracting speech features and applying various classification techniques

Classification of emotions from speech using implicit features

A new direct access framework for speaker identification system

A Novel pattern recognition model for real-time voice data input

Butterfly-like D-tree fusion strategy for real-time speech and music classification

Mixed Stereo Audio Classification Using a Stereo-Input Mixed-to-Panned Level Feature

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

Speech emotion recognition for SROL database using weighted KNN algorithm

Parkinson's disease feature subset selection based on voice samples

Classification of Cross-Correlation Functions for Speaker Localization

A novel approach for emotion classification based on fusion of text and speech

Sentence recognition from articulatory movements for silent speech interfaces

On objective feature selection for affective sounds discrimination

Classifier combination for telegraphese restoration

Building robust emotion recognition system on heterogeneous speech databases

Vowel recognition from continuous articulatory movements for speaker-dependent applications

Performance improvement in automatic gender identification using hierarchical clustering

Speaker gender recognition using score level fusion by AdaBoost

Investigating analysis of speech content through text classification

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options