Advanced search

From:

To:

Items from 1 to 20 out of 23 results

chapter

Noise impact assessment on the accuracy of the determination of speaker’s gender by using method of the cumulant coefficients

Kostiantyn Pylypenko, Arkadiy Prodeus

2015 XI International Conference on Perspective Technologies and Methods in MEMS Design (MEMSTECH) > 102 - 106

2015 XI International Conference on Perspective Technologies and Methods in MEMS Design (MEMSTECH

A new method of classification of a speaker’s gender based on cumulant coefficients is proposed. The effect of an additive noise and measurement error of classification signs on accuracy of classification is analyzed. The expediency of construction of an adaptive system of classification operating with considering of masking of a speech signal by noise is shown. Comparison of the proposed method of...

chapter

A learning-based approach for Romanian syllabification and stress assignment

Diana Balc, Anamaria Beleiu, Rodica Potolea, Camelia Lemnaru

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 37 - 42

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

This paper tackles the Romanian syllabification and stress assignment problems, and proposes an efficient machine learning based solution. We show that by designing the appropriate feature sets for each specific problem, learning algorithms achieve satisfactory accuracy rates for both problems (∼92% for syllabification, ∼85% for stress assignment), even for relatively small training set sizes. We...

chapter

Unsupervised feature learning for urban sound classification

Justin Salamon, Juan Pablo Bello

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 171 - 175

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recent studies have demonstrated the potential of unsupervised feature learning for sound classification. In this paper we further explore the application of the spherical k-means algorithm for feature learning from audio signals, here in the domain of urban sound classification. Spherical k-means is a relatively simple technique that has recently been shown to be competitive with other more complex...

chapter

A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs)

Burcu Can, Harun Artuner

2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR) > 219 - 224

2013 International Conference of Soft Computing and Pattern Recognition (SoCPaR)

In this paper, we present a model for Turkish speech recognition. The model is syllable-based, where the recognition is performed through syllables as speech recognition units. The main goal of the model is to recognize as much as possible of a given continuous speech by identifying only a small set of syllables in the language. For that purpose, only the syllable types with a higher frequency are...

chapter

Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits

Alexey Tarasov, Sarah Jane Delany, Brian Mac Namee

2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing > 979 - 980

2012 International Conference on Privacy, Security, Risk and Trust (PASSAT)

Many application areas that use supervised machine learning make use of multiple raters to collect target ratings for training data. Usage of multiple raters, however, inevitably introduces the risk that a proportion of them will be unreliable. The presence of unreliable raters can prolong the rating process, make it more expensive and lead to inaccurate ratings. The dominant, "static" approach...

chapter

Classifier combination for telegraphese restoration

Leo Willyanto Santoso

2011 International Conference on Uncertainty Reasoning and Knowledge Engineering > 1 > 79 - 82

2011 International Conference on Uncertainty Reasoning and Knowledge Engineering (URKE)

This paper presents a classifier combination to solve telegraphese restoration problem. By implementing more than one classifier, it can support other classifier, and finally it can improve the performance. Using supplied development data, training data and testing data, the best model had an accuracy F = 79 %.

chapter

Robust representations of cortical speech and language information

Janet M. Baker, Alexander M. Chan, Ksenija Marinkovic, Eric Halgren, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 785 - 788

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Cortical recordings with high temporal resolution enable the tracking of neuronal excitation in response to stimuli. Here intra and extracranial recordings are analyzed from experiments presenting varied speech and language stimuli to human subjects. These studies demonstrate that information about speech and language is widely distributed across the brain, both spatially and temporally. Analyses...

chapter

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

Iosif Mporas, Todor Ganchev, Otilia Kocsis, Nikos Fakotakis

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5176 - 5179

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We present a speech pre-processing scheme (SPPS) for robust speech recognition in the moving motorcycle environment. The SPPS is dynamically adapted during the run-time operation of the speech front-end, depending on short-time characteristics of the acoustic environment. In detail, the fast varying acoustic environment is modeled by GMM clusters based on which a selection function determines the...

chapter

Acted vs. natural frustration and delight: Many people smile in natural frustration

M Hoque, R W Picard

Face and Gesture 2011 > 354 - 359

2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011)

This work is part of research to build a system to combine facial and prosodic information to recognize commonly occurring user states such as delight and frustration. We create two experimental situations to elicit two emotional states: the first involves recalling situations while expressing either delight or frustration; the second experiment tries to elicit these states directly through a frustrating...

chapter

On Development and Evaluation of a Chunker for Bangla

S De, A Dhar, S Biswas, U Garain

2011 Second International Conference on Emerging Applications of Information Technology > 321 - 324

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

A rule based Local Word Grouper (LWG) or Chunker has been attempted and applied to Bangla. The Chunker has been evaluated with PARSEVAL method. The paper describes implementation of the Chunker as well as the evaluation method in detail. The evaluation shows a precision value of 95.05%, recall value of 94.33% and f-score value of 94.62 which are up to the standard. The results also show a substantial...

chapter

Vowel recognition from continuous articulatory movements for speaker-dependent applications

Jun Wang, J R Green, A Samal, T D Carrell

2010 4th International Conference on Signal Processing and Communication Systems > 1 - 7

2010 4th International Conference on Signal Processing and Communication Systems (ICSPCS 2010)

A novel approach was developed to recognize vowels from continuous tongue and lip movements. Vowels were classified based on movement patterns (rather than on derived articulatory features, e.g., lip opening) using a machine learning approach. Recognition accuracy on a single-speaker dataset was 94.02% with a very short latency. Recognition accuracy was better for high vowels than for low vowels....

chapter

Emotions analysis of speech for call classification

Esraa Ali Hassan, Neamat El Gayar, M Ghanem Moustafa

2010 10th International Conference on Intelligent Systems Design and Applications > 242 - 247

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Most existing research in the area of emotions recognition has focused on short segments or utterances of speech. In this paper we propose a machine learning system for classifying the overall sentiment of long conversations as being Positive or Negative. Our system has three main phases, first it divides a call into short segments, second it applies machine learning to recognize the emotion for each...

chapter

An Exploration of Native Speakers' Eye Fixations in Reading Chinese Text

Chao-Lin Liu, Juei-Yu Weng, Yi-Hsuan Chuang, Jie-Li Tsai

2010 International Conference on Technologies and Applications of Artificial Intelligence > 66 - 71

2010 International Conference on Technologies and Applications of Artificial Intelligence (TAAI 2010)

We collected the locations of eye fixations of Chinese native speakers when they read four Chinese articles, and attempted to analyze how the contextual linguistic and personal information influence the landing positions within the landing sites. In addition, we employed machine learning techniques to build models for the prediction of the landing positions. The models performed well for the closed...

chapter

Event-event relation identification: A CRF based approach

A K Kolya, A Ekbal, S Bandyopadhyay

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 8

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

Temporal information extraction is a popular and interesting research field in the area of Natural Language Processing (NLP). The main tasks involve the identification of event-time, event-document creation time and event-event relations in a text. In this paper, we take up Task C that involves identification of relations between the events in adjacent sentences under the TimeML framework. We use...

chapter

Using Chinese part-of-speech patterns for sentiment phrase identification and opinion extraction in user generated reviews

Ting-Chun Peng, Chia-Chun Shih

2010 Fifth International Conference on Digital Information Management (ICDIM) > 120 - 127

2010 Fifth International Conference on Digital Information Management (ICDIM 2010)

Accelerated growth of the Internet has enabled users worldwide to share their feelings and experiences. User-generated content (UGC) websites are the most abundant sources of user reviews. Accurately identifying sentiment phrases is essential to understand the expressed opinions in user reviews. To achieve this, part-of-speech (POS) patterns of phrases are useful. However, previous studies for Chinese...

chapter

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Minho Kim, Youngim Jung, Hyuk-Chul Kwon

2009 21st IEEE International Conference on Tools with Artificial Intelligence > 323 - 327

2009 21st IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2009)

Prediction of the prosodic phrase boundary is a potent influence on the performance of speech recognition and voice synthesis systems. We propose a statistical approach using efficient learning features for the natural prediction of the Korean prosodic phrase boundary. These new features reflect factors that affect the generation of the prosodic phrase boundary better than existing learning features...

chapter

Towards evidence based diagnosis of voice disorders using phonovibrograms

J. Lohscheller

2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies > 1 - 4

2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2009)

Clinical diagnosis of voice disorders is based on examination of the oscillating vocal folds during phonation with state-of-the-art endoscopic high-speed cameras. Commonly, the offline analysis is performed in a subjective and time-consuming manner via slow-motion playback. In this study an objective method for overcoming this drawback is presented being based on phonovibrogram (PVG) images. For a...

chapter

Classifiers combination to arabic morphosyntactic disambiguation

M. Albared, N. Omar, M.J. Ab Aziz

2009 International Conference on Electrical Engineering and Informatics > 1 > 163 - 171

2009 International Conference on Electrical Engineering and Informatics (ICEEI)

Parts of speech tagging forms the important pre-processing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine...

chapter

Recognizing Textual Entailment Based on WordNet

Jin Feng, Yiming Zhou, T. Martin

2008 Second International Symposium on Intelligent Information Technology Application > 2 > 27 - 31

2008 Second International Symposium on Intelligent Information Technology Application

Textual entailment recognition (RTE) is one of the fundamental problems in many natural language processing applications. This paper proposes a new method for lexical entailment measure which is based on exploiting the information in the WordNet glosses. Further we perform textual entailment recognition based on this method and cast the RTE problem to be a classification problem. The experimental...

chapter

Discriminating Mood Taxonomy of Chinese Traditional Music and Western Classical Music with Content Feature Sets

Wen Wu, Lingyun Xie

2008 Congress on Image and Signal Processing > 5 > 148 - 152

International Congress on Image and Signal Processing (CISP 2008)

According to numbers of music cognitive experiments, moods or emotions in music could be categorical. Since mood classifications are commonly used to structure the large collections of music available on the Web, automatic discrimination between mood taxonomy of Chinese traditional music and Western classical music would be a valuable addition to music information retrieval (MIR) systems. In this...

Keywords:
ACCURACY
SPEECH
MACHINE LEARNING

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (10)
TRAINING (9)
CLASSIFICATION ALGORITHMS (8)
SPEECH RECOGNITION (8)
LEARNING (ARTIFICIAL INTELLIGENCE) (6)
NATURAL LANGUAGE PROCESSING (6)
TRAINING DATA (6)
ARTIFICIAL NEURAL NETWORKS (5)
ACOUSTICS (4)
CONFERENCES (4)
DATA MINING (4)
NOISE (4)
SUPPORT VECTOR MACHINES (4)
TAGGING (4)
TRANSFORMS (4)
ALGORITHM DESIGN AND ANALYSIS (3)
EIGENVALUES AND EIGENFUNCTIONS (3)
ESTIMATION (3)
MACHINE LEARNING ALGORITHMS (3)
PATTERN CLASSIFICATION (3)
SIGNAL PROCESSING (3)
SPEECH PROCESSING (3)
TESTING (3)
WAVELET TRANSFORMS (3)
ANALYTICAL MODELS (2)
BRIGHTNESS (2)
CLUSTERING ALGORITHMS (2)
CLUSTERING METHODS (2)
COMPLEXITY THEORY (2)
COMPUTATIONAL MODELING (2)
COMPUTERS (2)
CONTEXT (2)
CORRELATION (2)
COVARIANCE MATRIX (2)
DATABASES (2)
DETECTION ALGORITHMS (2)
DETECTORS (2)
EMOTION RECOGNITION (2)
EQUATIONS (2)
FACE RECOGNITION (2)
FILTER BANK (2)
FOURIER TRANSFORMS (2)
GAIN (2)
HIDDEN MARKOV MODELS (2)
IMAGE PROCESSING (2)
IMAGE RECOGNITION (2)
INFORMATION RETRIEVAL (2)
MATHEMATICAL MODEL (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
OPTIMIZATION (2)
PATTERN RECOGNITION (2)
PERIODIC STRUCTURES (2)
PRESSES (2)
PRINCIPAL COMPONENT ANALYSIS (2)
ROBUSTNESS (2)
SIGNAL PROCESSING ALGORITHMS (2)
SIGNAL TO NOISE RATIO (2)
SPEECH ANALYSIS (2)
STRESS (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
TEXT ANALYSIS (2)
TUTORIALS (2)
ACOUSTIC NOISE (1)
ACTED FRUSTRATION (1)
ACTIVE CONTOURS (1)
ADDITIVES (1)
APPROXIMATION ALGORITHMS (1)
APPROXIMATION METHODS (1)
ARABIC MORPHOSYNTACTIC DISAMBIGUATION (1)
ARTICULATION (1)
ARTICULATION-TO-ACOUSTICS SYNTHESIZER (1)
ARTIFICIAL INTELLIGENCE (1)
AUDIO SIGNAL (1)
AUDIO SIGNAL PROCESSING (1)
AVATARS (1)
BANGLA (1)
BAYESIAN METHODS (1)
BINARY CLASSIFIER (1)
BIOMECHANICS (1)
BRAIN (1)
BRIDGE CRACK DETECTION (1)
BRIDGES (1)
BRIDGES (STRUCTURES) (1)
CALL CLASSIFICATION (1)
CAMERAS (1)
CATEGORIZATION (1)
CHINESE NATIVE SPEAKERS (1)
CHINESE PART-OF-SPEECH PATTERNS (1)
CHINESE SEGMENTATION (1)
CHINESE TEXT (1)
CHUNK PARSING (1)
CHUNKER (1)
CHUNKER IN BANGLA (1)
CIVIL ENGINEERING (1)
CLASSIFICATION OF CALLS (1)
CLASSIFICATION PROBLEM (1)
more

INFONA - science communication portal

Advanced search

Advanced search

Noise impact assessment on the accuracy of the determination of speaker’s gender by using method of the cumulant coefficients

A learning-based approach for Romanian syllabification and stress assignment

Unsupervised feature learning for urban sound classification

A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs)

Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-armed Bandits

Classifier combination for telegraphese restoration

Robust representations of cortical speech and language information

Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment

Acted vs. natural frustration and delight: Many people smile in natural frustration

On Development and Evaluation of a Chunker for Bangla

Vowel recognition from continuous articulatory movements for speaker-dependent applications

Emotions analysis of speech for call classification

An Exploration of Native Speakers' Eye Fixations in Reading Chinese Text

Event-event relation identification: A CRF based approach

Using Chinese part-of-speech patterns for sentiment phrase identification and opinion extraction in user generated reviews

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Towards evidence based diagnosis of voice disorders using phonovibrograms

Classifiers combination to arabic morphosyntactic disambiguation

Recognizing Textual Entailment Based on WordNet

Discriminating Mood Taxonomy of Chinese Traditional Music and Western Classical Music with Content Feature Sets

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options