-
Language Recognition Using Latent Dynamic Conditional Random Field Model with Phonological Features
- Back
Metadata
Document Title
Language Recognition Using Latent Dynamic Conditional Random Field Model with Phonological Features
Author
Boonsuk S, Suchato A, Punyabukkana P, Wutiwiwatchai C, Thatphithakkul N
Name from Authors Collection
Affiliations
Chulalongkorn University; National Science & Technology Development Agency - Thailand; National Electronics & Computer Technology Center (NECTEC)
Type
Article
Source Title
MATHEMATICAL PROBLEMS IN ENGINEERING
ISSN
1024-123X
Year
2014
Volume
2014
Issue
9
Open Access
Green Submitted, gold
Publisher
HINDAWI LTD
DOI
10.1155/2014/250160
Format
Abstract
Spoken language recognition (SLR) has been of increasing interest in multilingual speech recognition for identifying the languages of speech utterances. Most existing SLR approaches apply statistical modeling techniques with acoustic and phonotactic features. Among the popular approaches, the acoustic approach has become of greater interest than others because it does not require any prior language-specific knowledge. Previous research on the acoustic approach has shown less interest in applying linguistic knowledge; it was only used as supplementary features, while the current state-of-the-art system assumes independency among features. This paper proposes an SLR system based on the latent-dynamic conditional random field (LDCRF) model using phonological features (PFs). We use PFs to represent acoustic characteristics and linguistic knowledge. The LDCRF model was employed to capture the dynamics of the PFs sequences for language classification. Baseline systems were conducted to evaluate the features and methods including Gaussian mixture model (GMM) based systems using PFs, GMM using cepstral features, and the CRF model using PFs. Evaluated on the NIST LRE 2007 corpus, the proposed method showed an improvement over the baseline systems. Additionally, it showed comparable result with the acoustic system based on i-vector. This research demonstrates that utilizing PFs can enhance the performance.
Industrial Classification
Knowledge Taxonomy Level 1
Knowledge Taxonomy Level 2
Knowledge Taxonomy Level 3
Funding Sponsor
Thailand Graduate Institute of Science and Technology (TGIST); NSTDA; CU. Graduate School Thesis Grant
License
CC BY
Rights
Authors
Publication Source
WOS