2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Items from 1 to 20 out of 27 results

book

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

IEEE

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

chapter

Semantics driven intelligent front-end

Tamas Gergely, Edit Halmay, Miklos Szots, George Suciu, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the work done in the context of the Speech2Process project for Speech Dialogue System applied in call-centers, specifically in the banking domain. In our proposed solution, the client communicates with the system by natural language sentences, which will be automatically recognized and semantically analysed. The paper describes innovative features of the selected approach, which...

chapter

The SWARA speech corpus: A large parallel Romanian read speech dataset

Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper introduces one of the largest Romanian speech datasets freely available for both academic and commercial use. The dataset comprises speech data recorded over the last year from 12 speakers, along with 5 other speakers previously recorded in a separate environment. The data was manually segmented at utterance-level and semi-automatically labelled at phone-level. The resulting corpus amounts...

chapter

Word associations in media posts related to disasters — A statistical analysis

Mironela Pirnau

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The paper aims to analyze the frequency of the posts in case of earthquakes and of the word associations included in such Social Media (SM) posts. Since important posts are shared by users in SM, the purpose was to identify the variation of a number of posts having unique content that occurred over a period of time in Social Media for a particular topic. The present study uses messages generated by...

chapter

Voice-related symptom and knowledge-bases using internet mining

Horia-Nicolai L. Teodorescu, Dan Gogalniceanu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

We report the first development of a set of symptoms for a medical condition where the set of symptoms is based exclusively on information collected on the Internet. Also, we lay down a general method for doing so. Third, we introduce the first systematic set of symptoms for temporo-mandibular disorder (TMD) exclusively related to speech and suggest a set of known quantitative parameters for the analysis...

chapter

[Front cover]

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Presents the front cover or splash screen of the proceedings record.

chapter

An analysis of tweets related to earthquakes, for the Romanian language

Speranta Cecilia Bolea

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper provides an analysis of tweets and of their vocabulary in a specific emergency situation — earthquakes, moreover, the correlations between several words from messages and on the linear regressions between word usages and the intensity of the earthquakes. We analyzed the vocabulary used on tweets about Romanian earthquakes with the vocabulary of tweets used for other European earthquakes.

chapter

Several classifiers for intruder detection applications

Elena Roxana Buhus, Lacrimioara Grama, Corneliu Rusu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The goal of this work is to present some possible intruder detection systems and the influence of impulse-like signals upon the overall classification accuracy. Two different scenarios are used: in the first scenario five sound classes are considered (last class belong to impulsive sounds — gunshots), while in the second scenario we dropped out the impulsive sound class. More classifiers are used...

chapter

Audio signal classification using Linear Predictive Coding and Random Forests

Lacrimioara Grama, Corneliu Rusu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 9

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The goal of this work is to present an audio signal classification system based on Linear Predictive Coding and Random Forests. We consider the problem of multiclass classification with imbalanced datasets. The signals under classification belong to the class of sounds from wildlife intruder detection applications: birds, gunshots, chainsaws, human voice and tractors. The proposed system achieves...

chapter

A rule-based approach to generating large phonetic databases for Romanian results of the AFLR project

Stefan - Stelian Diaconescu, Monica - Mihaela Rizea, Mihaela Ionescu, Andrei Minca, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents a rule-based approach for generating a large phonetic database for Romanian. The knowledge base is developed by means of the GRAALAN (Grammar Abstract Language) system. By inspecting dictionaries and corpora, we generate a phonetic database over 100,000 lemmas. Our database has a high degree of accuracy ensured by our rule-based method applied for generating phonetic transcriptions.

chapter

Fast method for ENF database build and search

Gheorghe Pop, Dragos Draghicescu, Dragos Burileanu, Horia Cucu, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The field of digital audio forensics has been driving a sustained research effort in the last decade. Current digital audio authentication frameworks include Electric Network Frequency (ENF) criterion as a must. The ENF-based techniques benefit greatly from the availability of reference databases, which are built using extraction mechanisms that continuously analyze the power line signal. To find...

chapter

SpeeD's DNN approach to Romanian speech recognition

Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the main improvements brought recently to the large-vocabulary, continuous speech recognition (LVCSR) system for Romanian language developed by the Speech and Dialogue (SpeeD) research laboratory. While the most important improvement consists in the use of DNN-based acoustic models, instead of the classic HMM-GMM approach, several other aspects are discussed in the paper: a significant...

chapter

Speech recognition results for voice-controlled assistive applications

Alexandru Caranica, Horia Cucu, Corneliu Burileanu, Francois Portet, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Until recently, controlling a “smart home” consisted in setting up a series of applications and automation tools: scheduling when the air conditioning system could cool the room, turn on the lighting system at sunset, or just use ones phone to control several TV appliances or the garage door. Recent advances in speech recognition technology have made voice-controlled smart homes attainable, and many...

chapter

Prosodic phrases and contrast units in intonation

Doina Jitca

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The paper explains the relation between prosodic phrases and Information Structure (IS) by decomposing phrases into hierarchies of embedded contrast/communicative units (CUs). At any level of hierarchies, CUs contains IS partitions supported by two contrasted functional constituents. The functional categories are defined by using a two level IS model. Topic-Focus and CU_predicate-CU_argument are the...

chapter

Speech recognition in education: Voice geometry painter application

Lucian-Petru Tuca, Adrian Iftene

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Nowadays, we find ourselves in an era when the education is reforming and on the other side the technology is getting better, greater and more accessible than ever [1]. The Internet of Things is already altering health care, security, utilities, transportation, and household management. The devices themselves might be small, but they bring about major changes in how we live, work, and educate our...

chapter

Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language

Nursuriati Jamil, Farihah Apandi, Raseeda Hamzah

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources...

chapter

Old geographical corpora: A methodology for interpretative transcription

Mihaela Onofrei, Daniela Gifu, Cecilia Bolea

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper describes a study of the evolution of Romanian language, belonging to 18h and 19h centuries, from geographical domain, in order to develop an automatic recognition and interpretative transcription of Romanian historical heritage writings from Cyrillic into Latin, in printed forms. It is well known that the operation of interpretative transcription of texts written in Cyrillic is extremely...

chapter

A “small-data”-driven approach to dialogue systems for natural language human computer interaction

Tiberiu Boros, Stefan Daniel Dumitrescu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper describes a data-driven approach to handling natural language interaction between humans and devices. This approach enables example-based definition and tuning of interaction scenarios. Actions and parameters can be easily configured, requiring no prior knowledge of natural language processing and no previous experience with this type of systems. The platform requires a small amount of...

chapter

Building a representative audio base of syllables for Romanian language

Stefan - Stelian Diaconescu, Monica - Mihaela Rizea, Mihaela Ionescu, Andrei Minca, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 10

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The aim of this work is to provide some insights regarding the effort of building a representative and wide coverage audio base of syllables for Romanian. The audio base comprises audio recordings of syllables extracted from the following types of syllable embedding: isolated-syllable, isolated-word and continuous speech. The list of syllables has been computed over the syllabified form of single-word...

chapter

Automatic speaker analysis 2.0: Hearing the bigger picture

Bjorn W. Schuller

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Automatic Speaker Analysis has largely focused on single aspects of a speaker such as her ID, gender, emotion, personality, or health state. This broadly ignores the interdependency of all the different states and traits impacting on the one single voice production mechanism available to a human speaker. In other words, sometimes we may sound depressed, but we simply have a flu, and hardly find the...

Publication date

Set your own date range

Content availability

Available (26)
None (1)

Keywords

SPEECH (10)
SPEECH RECOGNITION (7)
TRAINING (5)
ACOUSTICS (4)
HIDDEN MARKOV MODELS (4)
PRAGMATICS (4)
STRESS (4)
DATABASES (3)
DICTIONARIES (3)
STANDARDS (3)
AGRICULTURAL MACHINERY (2)
AUDIO SIGNAL CLASSIFICATION (2)
AUTOMATIC SPEECH RECOGNITION (2)
BIRDS (2)
CLASSIFICATION ALGORITHMS (2)
EARTHQUAKE (2)
EMOTION RECOGNITION (2)
FEATURE EXTRACTION (2)
GRAMMAR (2)
HUMAN VOICE (2)
INTRUDER DETECTION (2)
LINEAR PREDICTIVE CODING (2)
NATURAL LANGUAGE PROCESSING (2)
NATURAL LANGUAGES (2)
SPECTROGRAM (2)
SPEECH CORPUS (2)
SYNTACTICS (2)
WILDLIFE (2)
ADAPTATION MODELS (1)
ADAPTIVE FILTERS (1)
AFFINE PROJECTION ALGORITHM (APA) (1)
ARTIFICIAL INTELIGENCE (1)
ASPECT ORIENTED PROGRAMMING PRINCIPLES (1)
AUDIO AUTHENTICATION (1)
AUDIO BASE OF SYLLABLES (1)
AUDITORY SYSTEM (1)
BANKING (1)
BIOLOGICAL NEURAL NETWORKS (1)
BUILDINGS (1)
COMPILING (1)
COMPOUNDS (1)
COMPUTATIONAL MODELING (1)
COMPUTER LANGUAGES (1)
CONTINUOUS SPEECH (1)
CONTRAST UNITS (1)
CONTROL SYSTEMS (1)
CONVERGENCE (1)
COPPER (1)
CORRELATION (1)
DATA MINING (1)
DATA STRUCTURES (1)
DECISION TREES (1)
DIACHRONIC POS-TAGGER (1)
DIALOGUE SYSTEM (1)
DISTANT SPEECH RECOGNITION (1)
EARTHQUAKES (1)
EDUCATION (1)
ENCYCLOPEDIAS (1)
ENF CRITERION (1)
ENGINES (1)
ERROR ANALYSIS (1)
EUROPE (1)
FAST DATABASE SEARCH (1)
FILTERING ALGORITHMS (1)
FINITE IMPULSE RESPONSE FILTERS (1)
FRAME SEMANTICS (1)
FREQUENCY (1)
GEOMETRY (1)
GOOGLE (1)
GRAALAN (1)
GRAPHEME (1)
HEADPHONES (1)
HISTORICAL CORPUS (1)
HOME APPLIANCES (1)
HOME AUTOMATION (1)
HUMAN COMPUTER INTERACTION (1)
INFORMATION STRUCTURE (1)
INSTRUMENTS (1)
INTELLIGENT CALL-CENTRE (1)
INTERNET (1)
INTERNET OF THINGS (1)
INTERPRETATIVE TRASNCRIPTION (1)
INTERVIEWS (1)
IOT (1)
IP NETWORKS (1)
IS PARTITION HIEARCHY (1)
ISOLATED-SYLLABLE PRONUNCIATION (1)
ISOLATED-WORD PRONUNCIATION (1)
KALDI (1)
KNOWLEDGE BASE (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE REPRESENTATION (1)
LABELING (1)
LANGUAGE RESOURCES (1)
LATTICE RESCORING (1)
LEMMA (1)
LINEAR REGRESSION (1)
LINGUISTICS (1)
LOGISTICS (1)
MALAY LANGUAGE (1)
more

INFONA - science communication portal

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)