The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Most of the facial expression recognition methods consider that both training and testing data are equally distributed. As facial image sequences may contain information for heterogeneous sources, facial data may be asymmetrically distributed between training and testing, as it may be difficult to maintain the same quality and quantity of information. In this work, we present a novel classification...
In storytelling style, a storyteller generally uses prosodic variations with subtle speech nuances for the better apprehension of the listeners. It is achieved by emphasizing prominent words, using various emotions, mimicking voices and providing appropriate pauses. This work is a part of building the Story Text-to-Speech (TTS) [1] synthesis systems in Indian Languages, which aims at synthesizing...
A combination of several activities is required to solve a development task, but in the end, developer reports only part of it. It is difficult to understand whether all committed files were changed because of the reason in a given description. Software developers work on multiple tasks at once and often fail to distinguish them with separate commits because of their unknowingness, as well as of limitations...
In this paper, we analyse the emotion of children's stories in sentence level by considering the context information. We demonstrate that the emotion of a sentence is not only dependent on its content, but also affected by its neighbours in a story. A Hidden Markov Model (HMM) based method is proposed to model the emotion sequence and to detect whether a sentence is neutral or not. We show the important...
This paper presents a new symbol segmentation method based on AdaBoost with confidence weighted predictions for online handwritten mathematical expressions. The handwritten mathematical expression is preprocessed and rendered to an image. Then for each stroke, we compute three kinds of shape context features (stroke pair, local neighborhood and global shape contexts) with different scales, 21 stroke...
In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed...
Smart environments are slowly but surely entering our everyday life. Their design provides many challenges. Not only heterogeneous devices acting and interacting in a dynamic environment but also intentions and activities of humans have to be taken into account. Diverse processes are responsible for achieving unobtrusive and pro-active user assistance. Those can be structured into a pipeline of perception,...
This paper addresses the ongoing issue of tone error detection for Mandarin Computer Assisted Language Learning (CALL) systems. A novel approach based on clustering is proposed. The selection of different contextual tonal factors including Uni-tone, LBi-tone and RBi-tone are explored. Experimental results show that our proposed approach is feasible, obtaining an Equal Error Rate (EER) of 18.75% by...
Phonetic dictionaries are essential components of large-vocabulary natural language speaker-independent speech recognition systems. This paper presents a rule-based technique to generate Arabic phonetic dictionaries for a large vocabulary speech recognition system. The system used classic Arabic pronunciation rules, common pronunciation rules of Modern Standard Arabic, as well as morphologically driven...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.