2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

chapter

Annotating conversational speech for corpus-based dialogue speech synthesizer — A first step

Hiroki Mori, Takatsugu Hitomi

2012 International Conference on Speech Database and Assessments > 135 - 140

This paper describes an HMM-based speech synthesis that allows dimensional description of emotion as inputs. A spontaneous dialogue speech corpus that was designed for studying paralinguistic phenomena in expressive social interactions was used to train the models, utilizing its emotional state description as additional contextual factors. In the perceptual experiment, a very high correlation was...

chapter

Automatic scoring method considering quality and content of speech for scat Japanese speaking test

Naoko Okubo, Yuto Yamahata, Takeshi Yamada, Shingo Imai, more

2012 International Conference on Speech Database and Assessments > 72 - 77

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

We are now developing a Japanese speaking test called SCAT, which is part of J-CAT (Japanese Computerized Adaptive Test), a free online proficiency test for Japanese language learners. In this paper, we focus on the sentence-reading-aloud task and the sentence generation task in SCAT, and propose an automatic scoring method for estimating the overall score of answer speech, which is holistically determined...

chapter

The development of a Chinese learner corpus

Maolin Wang, Qi Gong, Jie Kuang, Ziyu Xiong

2012 International Conference on Speech Database and Assessments > 1 - 6

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

A learner corpus is a very important resource for research on second language acquisition. In this paper, the design and development of a Chinese learner corpus is described. The corpus is based on compositions by learners from 59 countries. The procedure of the development of the corpus is reported, including coping, proofing, etc. Information such as type of writing, student number, time of writing,...

chapter

Phonetic manifestation and influential factors of pronominal anaphoric word “TA” in Chinese reading texts

Luying Hou, Yuan Jia

2012 International Conference on Speech Database and Assessments > 112 - 117

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

The present paper conducts a pioneering exploration on the phonetic manifestation and of pronominal anaphora and influential factors in Chinese reading texts, taking the third personal pronoun “ta” as example. The F₀ and duration of “ta” of varied types are compared; also, the stress degrees of “ta” and its surrounding syllables are examined. The results demonstrate that: i) syntactic position plays...

chapter

Acoustic analysis of the vowel system of Yongding Hakka Chinese

Wai-Sum Lee

2012 International Conference on Speech Database and Assessments > 107 - 111

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

The paper is an acoustic analysis of the formant frequencies and durations of the two sets of Yongding Hakka vowels, [i a i] in CV syllables and [i? a? u? ε? c?] in CVS syllables, from 10 male and 10 female speakers. The formant data show that the Yongding Hakka [i] has the F-pattern of a mid central, rather than a high central, vowel, and the vowel [i?] has the F-pattern of [I?]. The durations of...

chapter

[Front cover]

2012 International Conference on Speech Database and Assessments > 1

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Presents the front cover of this proceedings volume.

chapter

Review committee

2012 International Conference on Speech Database and Assessments > 1

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Provides a listing of current committee members.

chapter

Speech corpus script design for TTS system applied on railway passenger service information broadcasting

Y U Zhenli, W U Hong, W U Mengchu, Chen Guilin

2012 International Conference on Speech Database and Assessments > 97 - 100

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

An approach of speech corpus script design for a s customized TTS engine that is applied on railway passenger service information broadcasting is presented in this paper. Raw text material is collected according to railway service information classification. A modified greedy algorithm is proposed to generate an optimal corpus script based on the statistics of prosodic nature of the raw corpus. A...

chapter

Collection and annotation of Malay conversational speech corpus

Tze Yuang Chong, Xiong Xiao, Tien-Ping Tan, Eng Siong Chng, more

2012 International Conference on Speech Database and Assessments > 30 - 35

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

We report the development of a Malay conversational speech corpus as part of our research in spontaneous conversational speech LVCSR. This corpus development effort is the collaboration between NTU and USM. The goal is to collect, transcribe, and annotate 50 hours of conversational Malay speech. The conversation is recorded from both close-talk and telephone channels, and both speakers' utterances...

chapter

Thai ASR development for network-based speech translation

Chai Wutiwiwatchai, Kwanchiva Thangthai, Phuttapong Sertsi

2012 International Conference on Speech Database and Assessments > 92 - 96

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

A network-based multilingual speech translation service under the Universal Speech Translation Advanced Research (U-STAR) consortium requires a well-tuned Thai automatic speech recognition (ASR) service. This paper summarizes the development of the service by utilizing both Thai read-speech and telephone speech (LOTUS-CELL 2.0) corpora. Tuning is performed regarding different sets of acoustic unit...

chapter

An activity based spoken language corpus of Nepali

Jens Allwood, Bhim Narayan Regmi, Sagun Dhakhwa, Ram Kisun Uranw

2012 International Conference on Speech Database and Assessments > 24 - 29

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Language is used for communication and communication facilitates social activities. If we want to capture this, linguistic investigation has to be carried out within a wider context. Examination of linguistic communication in a wider context shows that it is multimodal. In order to study naturalistic multimodal communication using a corpus, the corpus should contain a combination of recordings, documentation,...

chapter

Fluency and L1 phonology interference on L2 English analysis OF Japanese AESOP corpus

Mariko Kondo, Hajime Tsubaki

2012 International Conference on Speech Database and Assessments > 123 - 128

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Fifty Japanese speakers' read speech data of “the North Wind and the Sun” from the Japanese AESOP corpus was analyzed by automatic alignment using the HTK tool with a modified TIMIT dictionary. The results showed typical phonetic and phonological problems of English pronunciation by Japanese speakers which have been often discussed in EFL. The Japanese subjects' English fluency was evaluated by 8...

chapter

Hub page

2012 International Conference on Speech Database and Assessments > 1

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Presents the proceedings page that links various sections of the overall electronic record.

chapter

Frequently asked questions

2012 International Conference on Speech Database and Assessments > 1 - 4

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Provides instructions on viewing the proceedings articles in PDF format and support information for CD users.

chapter

Letter-to-sound conversion using coupled Hidden Markov Models for lexicon compression

Hao Che, Jianhua Tao, Shifeng Pan

2012 International Conference on Speech Database and Assessments > 141 - 144

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Letter-to-Sound(LTS) conversion, which is used to compress the lexicon for embedded application purpose, has become an important part in Text-to-Speech (TTS) system. In this paper, coupled Hidden Markov Models (CHMM) for LTS conversion is proposed. In the phase of preprocessing, many-to-many alignment is adopted for lexicon alignment instead of one-to-one alignment which is commonly used in previous...

chapter

Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages

Trung-Nghia Phung, Mai Chi Luong, Masato Akagi

2012 International Conference on Speech Database and Assessments > 129 - 134

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Concatenative speech synthesis (CSS) provides the greatest naturalness. However, it requires a huge stored database resulting a huge footprint. Reducing the capacity of stored database while preserving the quality of CSS, or improving the quality to size ratio (QSr), is still a challenge. In this paper, we propose a method of transforming fundamental frequency (F0) contours of lexical tones, developed...

chapter

CENSREC-2-AV: An evaluation framework for bimodal speech recognition in real environments

Naoya Ukai, Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu, more

2012 International Conference on Speech Database and Assessments > 88 - 91

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

In this paper, we introduce a bimodal speech recognition corpus in real environments. In recent years, speech recognition technology has been used in noisy conditions. Therefore, it becomes necessary to achieve higher recognition accuracy in real environments. As one of the solutions, bimodal speech recognition using audio and non-audio information is getting studied. However, there are few databases...

chapter

Grapheme-to-phoneme conversion methods for minority language conditions

Mengxue Cao, Steve Renals, Peter Bell, Aijun Li, more

2012 International Conference on Speech Database and Assessments > 151 - 156

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

This study attempts to investigate the grapheme-to-phoneme conversion approaches for minority language conditions. Instead of isolated-word data for major languages, sentence-form data is defined to be a proper form of training data for minority languages. Joint-multigram Model and Hidden Markov Model were examined in this study. The “treat-sentence-as-word” training method and the forced-alignment...

chapter

Analysis and synthesis of F₀ contours of declarative, interrogative, and imperative utterances of Bangla

Anal Haque Warsi, Tulika Basu, Keikichi Hirose, Hiroya Fujisaki

2012 International Conference on Speech Database and Assessments > 56 - 61

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

This study first examines the differences in the gross features of the fundamental frequency contour (the F₀ contour) responsible for discriminating utterances of three sentence types, namely declarative, imperative and interrogative, in Bangla. In order to realize these differences in speech synthesis, these differences are then interpreted in terms of differences in the parameters of the command-response...

chapter

Towards language preservation: Preliminary collection and vowel analysis of Indonesian ethnic speech data

Auliya Sani, Sakriani Sakti, Graham Neubig, Tomoki Toda, more

2012 International Conference on Speech Database and Assessments > 118 - 122

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Multilingualism in Indonesia gradually faces a state of catastrophe. Although several projects have been initiated for cultural preservation, the available technology that could support communication between elders and younger people within indigenous communities, as well as with people outside the community, is still very rare in Indonesia. This paper presents the first step of long-term development...

INFONA - science communication portal

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments

Annotating conversational speech for corpus-based dialogue speech synthesizer — A first step

Automatic scoring method considering quality and content of speech for scat Japanese speaking test

The development of a Chinese learner corpus

Phonetic manifestation and influential factors of pronominal anaphoric word “TA” in Chinese reading texts

Acoustic analysis of the vowel system of Yongding Hakka Chinese

[Front cover]

Review committee

Speech corpus script design for TTS system applied on railway passenger service information broadcasting

Collection and annotation of Malay conversational speech corpus

Thai ASR development for network-based speech translation

An activity based spoken language corpus of Nepali

Fluency and L1 phonology interference on L2 English analysis OF Japanese AESOP corpus

Hub page

Frequently asked questions

Letter-to-sound conversion using coupled Hidden Markov Models for lexicon compression

Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages

CENSREC-2-AV: An evaluation framework for bimodal speech recognition in real environments

Grapheme-to-phoneme conversion methods for minority language conditions

Analysis and synthesis of F₀ contours of declarative, interrogative, and imperative utterances of Bangla

Towards language preservation: Preliminary collection and vowel analysis of Indonesian ethnic speech data

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2012 Oriental COCOSDA 2012 - International Conference on Speech Database and Assessments