2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Items from 1 to 20 out of 26 results

chapter

[Front cover]

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Presents the front cover or splash screen of the proceedings record.

chapter

[Copyright notice]

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Presents the copyright notice for this conference proceedings record.

chapter

The SWARA speech corpus: A large parallel Romanian read speech dataset

Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper introduces one of the largest Romanian speech datasets freely available for both academic and commercial use. The dataset comprises speech data recorded over the last year from 12 speakers, along with 5 other speakers previously recorded in a separate environment. The data was manually segmented at utterance-level and semi-automatically labelled at phone-level. The resulting corpus amounts...

chapter

Word associations in media posts related to disasters — A statistical analysis

Mironela Pirnau

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The paper aims to analyze the frequency of the posts in case of earthquakes and of the word associations included in such Social Media (SM) posts. Since important posts are shared by users in SM, the purpose was to identify the variation of a number of posts having unique content that occurred over a period of time in Social Media for a particular topic. The present study uses messages generated by...

chapter

An analysis of tweets related to earthquakes, for the Romanian language

Speranta Cecilia Bolea

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper provides an analysis of tweets and of their vocabulary in a specific emergency situation — earthquakes, moreover, the correlations between several words from messages and on the linear regressions between word usages and the intensity of the earthquakes. We analyzed the vocabulary used on tweets about Romanian earthquakes with the vocabulary of tweets used for other European earthquakes.

chapter

Several classifiers for intruder detection applications

Elena Roxana Buhus, Lacrimioara Grama, Corneliu Rusu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The goal of this work is to present some possible intruder detection systems and the influence of impulse-like signals upon the overall classification accuracy. Two different scenarios are used: in the first scenario five sound classes are considered (last class belong to impulsive sounds — gunshots), while in the second scenario we dropped out the impulsive sound class. More classifiers are used...

chapter

A rule-based approach to generating large phonetic databases for Romanian results of the AFLR project

Stefan - Stelian Diaconescu, Monica - Mihaela Rizea, Mihaela Ionescu, Andrei Minca, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents a rule-based approach for generating a large phonetic database for Romanian. The knowledge base is developed by means of the GRAALAN (Grammar Abstract Language) system. By inspecting dictionaries and corpora, we generate a phonetic database over 100,000 lemmas. Our database has a high degree of accuracy ensured by our rule-based method applied for generating phonetic transcriptions.

chapter

Fast method for ENF database build and search

Gheorghe Pop, Dragos Draghicescu, Dragos Burileanu, Horia Cucu, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The field of digital audio forensics has been driving a sustained research effort in the last decade. Current digital audio authentication frameworks include Electric Network Frequency (ENF) criterion as a must. The ENF-based techniques benefit greatly from the availability of reference databases, which are built using extraction mechanisms that continuously analyze the power line signal. To find...

chapter

Prosodic phrases and contrast units in intonation

Doina Jitca

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The paper explains the relation between prosodic phrases and Information Structure (IS) by decomposing phrases into hierarchies of embedded contrast/communicative units (CUs). At any level of hierarchies, CUs contains IS partitions supported by two contrasted functional constituents. The functional categories are defined by using a two level IS model. Topic-Focus and CU_predicate-CU_argument are the...

chapter

Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language

Nursuriati Jamil, Farihah Apandi, Raseeda Hamzah

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources...

chapter

Old geographical corpora: A methodology for interpretative transcription

Mihaela Onofrei, Daniela Gifu, Cecilia Bolea

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper describes a study of the evolution of Romanian language, belonging to 18h and 19h centuries, from geographical domain, in order to develop an automatic recognition and interpretative transcription of Romanian historical heritage writings from Cyrillic into Latin, in printed forms. It is well known that the operation of interpretative transcription of texts written in Cyrillic is extremely...

chapter

A “small-data”-driven approach to dialogue systems for natural language human computer interaction

Tiberiu Boros, Stefan Daniel Dumitrescu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper describes a data-driven approach to handling natural language interaction between humans and devices. This approach enables example-based definition and tuning of interaction scenarios. Actions and parameters can be easily configured, requiring no prior knowledge of natural language processing and no previous experience with this type of systems. The platform requires a small amount of...

chapter

Automatic speaker analysis 2.0: Hearing the bigger picture

Bjorn W. Schuller

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Automatic Speaker Analysis has largely focused on single aspects of a speaker such as her ID, gender, emotion, personality, or health state. This broadly ignores the interdependency of all the different states and traits impacting on the one single voice production mechanism available to a human speaker. In other words, sometimes we may sound depressed, but we simply have a flu, and hardly find the...

chapter

Multi-resolution spectral input for convolutional neural network-based speech recognition

Laszlo Toth

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The convolutional deep neural network component applied frequently in current speech recognizers is trained on a context of consecutive spectral feature vectors. Here, we investigate whether we can extend the time span of this input and reduce the number of spectral features at the same time by using a multi-resolution spectrum as input. In the proposed multi-resolution scheme, the network processes...

chapter

Natural language processing model compiling natural language into byte code

Alexandru Trifan, Marilena Anghelus, Rodica Constantinescu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

The need of progress implies the need of time. Daily tasks have been automated to solve time issues but they still need the input of a user. The need for interaction with different applications may endanger the user's life. The simplest way for these automatizations to be “life-saving” is to fully support speech recognition. Although, right now, this is done in an acceptable manner, the main problem...

chapter

Towards a continuous speech corpus for banking domain automatic speech recognition

George Suciu, Stefan-Adrian Toma, Romulus Cheveresan

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the work done towards developing a speech corpus for Romanian, for automatic speech recognition for the banking domain. This work is done in the context of the Speech2Process project, which aims at creating a system which allows interaction between customers and agents in the contact center much easier. The application to use the banking corpus will provide automatic response to...

chapter

MaRePhoR — An open access machine-readable phonetic dictionary for Romanian

Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper introduces a novel open access resource, the machine-readable phonetic dictionary for Romanian — MaRePhoR. It contains over 70,000 word entries, and their manually performed phonetic transcription. The paper describes the dictionary format and statistics, as well as an initial use of the phonetic transcription entries by building a grapheme to phoneme converter based on decision trees....

chapter

Investigation on the performances of APA in forensic noise reduction

Robert Alexandru Dobre, Constantin Paleologu, Silviu Ciochina, Cristian Negrescu, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

Multimedia files, either video or audio, could greatly influence the final verdict of a trial when accepted as evidence. The abundance of free editing software available nowadays make forgeries a very easy operation. Audio messages, even if authentic, in some cases, can be heavily masked by other signals and declared unusable. This paper presents the investigations on the performance of the affine...

chapter

Cassandra smart-home system description

Stefan Daniel Dumitrescu

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the architecture and technologies used to develop a voice controlled system for home automation named Cassandra. We start with the goals of the project and a system description, then focusing on the main components and the way they interact with each other. We exemplify with a scenario where we ask the house to turn the lights off, going step-by-step over the communication sequence...

chapter

Semantics driven intelligent front-end

Tamas Gergely, Edit Halmay, Miklos Szots, George Suciu, more

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the work done in the context of the Speech2Process project for Speech Dialogue System applied in call-centers, specifically in the banking domain. In our proposed solution, the client communicates with the system by natural language sentences, which will be automatically recognized and semantically analysed. The paper describes innovative features of the selected approach, which...

Publication date

Set your own date range

Keywords

SPEECH (10)
SPEECH RECOGNITION (7)
TRAINING (5)
ACOUSTICS (4)
HIDDEN MARKOV MODELS (4)
PRAGMATICS (4)
STRESS (4)
DATABASES (3)
DICTIONARIES (3)
STANDARDS (3)
AGRICULTURAL MACHINERY (2)
AUDIO SIGNAL CLASSIFICATION (2)
AUTOMATIC SPEECH RECOGNITION (2)
BIRDS (2)
CLASSIFICATION ALGORITHMS (2)
EARTHQUAKE (2)
EMOTION RECOGNITION (2)
FEATURE EXTRACTION (2)
GRAMMAR (2)
HUMAN VOICE (2)
INTRUDER DETECTION (2)
LINEAR PREDICTIVE CODING (2)
NATURAL LANGUAGE PROCESSING (2)
NATURAL LANGUAGES (2)
SPECTROGRAM (2)
SPEECH CORPUS (2)
SYNTACTICS (2)
WILDLIFE (2)
ADAPTATION MODELS (1)
ADAPTIVE FILTERS (1)
AFFINE PROJECTION ALGORITHM (APA) (1)
ARTIFICIAL INTELIGENCE (1)
ASPECT ORIENTED PROGRAMMING PRINCIPLES (1)
AUDIO AUTHENTICATION (1)
AUDIO BASE OF SYLLABLES (1)
AUDITORY SYSTEM (1)
BANKING (1)
BIOLOGICAL NEURAL NETWORKS (1)
BUILDINGS (1)
COMPILING (1)
COMPOUNDS (1)
COMPUTATIONAL MODELING (1)
COMPUTER LANGUAGES (1)
CONTINUOUS SPEECH (1)
CONTRAST UNITS (1)
CONTROL SYSTEMS (1)
CONVERGENCE (1)
COPPER (1)
CORRELATION (1)
DATA MINING (1)
DATA STRUCTURES (1)
DECISION TREES (1)
DIACHRONIC POS-TAGGER (1)
DIALOGUE SYSTEM (1)
DISTANT SPEECH RECOGNITION (1)
EARTHQUAKES (1)
EDUCATION (1)
ENCYCLOPEDIAS (1)
ENF CRITERION (1)
ENGINES (1)
ERROR ANALYSIS (1)
EUROPE (1)
FAST DATABASE SEARCH (1)
FILTERING ALGORITHMS (1)
FINITE IMPULSE RESPONSE FILTERS (1)
FRAME SEMANTICS (1)
FREQUENCY (1)
GEOMETRY (1)
GOOGLE (1)
GRAALAN (1)
GRAPHEME (1)
HEADPHONES (1)
HISTORICAL CORPUS (1)
HOME APPLIANCES (1)
HOME AUTOMATION (1)
HUMAN COMPUTER INTERACTION (1)
INFORMATION STRUCTURE (1)
INSTRUMENTS (1)
INTELLIGENT CALL-CENTRE (1)
INTERNET (1)
INTERNET OF THINGS (1)
INTERPRETATIVE TRASNCRIPTION (1)
INTERVIEWS (1)
IOT (1)
IP NETWORKS (1)
IS PARTITION HIEARCHY (1)
ISOLATED-SYLLABLE PRONUNCIATION (1)
ISOLATED-WORD PRONUNCIATION (1)
KALDI (1)
KNOWLEDGE BASE (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE REPRESENTATION (1)
LABELING (1)
LANGUAGE RESOURCES (1)
LATTICE RESCORING (1)
LEMMA (1)
LINEAR REGRESSION (1)
LINGUISTICS (1)
LOGISTICS (1)
MALAY LANGUAGE (1)
more

INFONA - science communication portal

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

[Front cover]

[Copyright notice]

The SWARA speech corpus: A large parallel Romanian read speech dataset

Word associations in media posts related to disasters — A statistical analysis

An analysis of tweets related to earthquakes, for the Romanian language

Several classifiers for intruder detection applications

A rule-based approach to generating large phonetic databases for Romanian results of the AFLR project

Fast method for ENF database build and search

Prosodic phrases and contrast units in intonation

Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language

Old geographical corpora: A methodology for interpretative transcription

A “small-data”-driven approach to dialogue systems for natural language human computer interaction

Automatic speaker analysis 2.0: Hearing the bigger picture

Multi-resolution spectral input for convolutional neural network-based speech recognition

Natural language processing model compiling natural language into byte code

Towards a continuous speech corpus for banking domain automatic speech recognition

MaRePhoR — An open access machine-readable phonetic dictionary for Romanian

Investigation on the performances of APA in forensic noise reduction

Cassandra smart-home system description

Semantics driven intelligent front-end

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)