In conjunction with ACL-IJCNLP 2009

The 7th Workshop on Asian Language Resources

Suntec City, Singapore
August 6-7, 2009

The workshop is officially endorced by FLaReNet.

Last modified: Wed Jun 24 09:07:36 JST 2009


Language resources play an important role as corpus-based, stochastic, and learning approaches are introduced to natural language processing research. Many research units put great efforts on developing corpora for their particular purpose, and some even focus on compiling various kinds of language resources. Asia, the land of language variation, are suffering from the shortage of sharing the resource and cross language problem solving experience. There are several reports referring to the success of constructing and using corpora in many dimensions. But however, there are few efforts in establishing common formats or frameworks for handling these languages. The re-organizing the existing resources and finding for the guideline in corpus development become significant issue in the current research. The workshop is organised under the Asian Language Resources Committee (ALRC) of AFNLP aiming at the following goals. To achieve these goals, we call for the technical (and non-technical) papers concerning, but not limited to the following issues.

Important Dates

Paper submission due 8 May, 2009
Demo session requests due 8 May, 2009
Notification of acceptance 1 June, 2009
Camera-ready papers due 7 June, 2009
ACL-IJNLP 2009 Workshops 6-7 August, 2009

Program Committee


Virach Sornlertlamvanich (virach[at-mark]


Suntec City, Singapore


Thursday, August 6, 2009

9:10–9:35Enhancing the Japanese WordNet
Francis Bond, Hitoshi Isahara, Sanae Fujita, Kiyotaka Uchimoto, Takayuki Kuribayashi and Kyoko Kanzaki
9:35–10:00An Empirical Study of Vietnamese Noun Phrase Chunking with Discriminative Sequence Models
Le Minh Nguyen, Huong Thao Nguyen, Phuong Thai Nguyen, Tu Bao Ho and Akira Shimazu
10:30–10:55Corpus-based Sinhala Lexicon
Ruvan Weerasinghe, Dulip Herath and Viraj Welgama
10:55–11:20Analysis and Development of Urdu POS Tagged Corpus
Ahmed Muaz, Aasim Ali and Sarmad Hussain
11:20–11:45Annotating Dialogue Acts to Construct Dialogue Systems for Consulting
Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura
11:45–12:10Assas-band, an Affix-Exception-List Based Urdu Stemmer
Qurat-ul-Ain Akram, Asma Naseer and Sarmad Hussain
12:10–13:50Lunch break
13:50–14:15Automated Mining Of Names Using Parallel Hindi-English Corpus
R. Mahesh K. Sinha
14:15–14:40Basic Language Resources for Diverse Asian Languages: A Streamlined Approach for Resource Creation
Heather Simpson, Kazuaki Maeda and Christopher Cieri
14:40–15:05Finite-State Description of Vietnamese Reduplication
Le Hong Phuong, Nguyen Thi Minh Huyen and Roussanaly Azim
15:05–15:30Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions
Xinhui Hu, Ryosuke Isotani and Satoshi Nakamura
16:00–16:15Bengali Verb Subcategorization Frame Acquisition - A Baseline Model
Somnath Banerjee, Dipankar Das and Sivaji Bandyopadhyay
16:15–16:30Phonological and Logographic Influences on Errors in Written Chinese Words
Chao-Lin Liu, Kan-Wen Tien, Min-Hua Lai, Yi-Hsuan Chuang and Shih-Hung Wu
16:30–16:45Resource Report: Building Parallel Text Corpora for Multi-Domain Translation System
- Budiono, Hammam Riza and Chairil Hakim
16:45–17:00A Syntactic Resource for Thai: CG Treebank
Taneth Ruangrajitpakorn, Kanokorn Trakultaweekoon and Thepchai Supnithi
17:00–17:15Part of Speech Tagging for Mongolian Corpus
Purev Jaimai and Odbayar Chimeddorj

Friday, August 7, 2009

8:45–9:10Interaction Grammar for the Persian Language: Noun and Adjectival Phrases
Masood Ghayoomi and Bruno Guillaume
9:10–9:35KTimeML: Specification of Temporal and Event Expressions in Korean Text
Seohyun Im, Hyunjo You, Hayun Jang, Seungho Nam and Hyopil Shin
9:35–10:00CWN-LMF: Chinese WordNet in the Lexical Markup Framework
Lung-Hao Lee, Shu-Kai Hsieh and Chu-Ren Huang
10:30–10:55Philippine Language Resources: Trends and Directions
Rachel Edita Roxas, Charibeth Cheng and Nathalie Rose Lim
10:55–11:20Thai WordNet Construction
Sareewan Thoongsup, Thatsanee Charoenporn, Kergrit Robkop, Tan Sinthurahat, Chumpol Mokarat, Virach Sornlertlamvanich and Hitoshi Isahara
11:20–11:45Query Expansion using LMF-Compliant Lexical Resources
Takenobu Tokunaga, Dain Kaplan, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Virach Sornlertlamvanich, Thatsanee Charoenporn, Yingju Xia, Chu-Ren Huang, Shu-Kai Hsieh and Kiyoaki Shirai
11:45–12:10Thai National Corpus: A Progress Report
Wirote Aroonmanakun, Kachen Tansiri and Pairit Nittayanuparp
12:10–13:50Lunch break
13:50–14:15The FLaReNet Thematic Network: A Global Forum for Cooperation
Nicoletta Calzolari and Claudia Soria
14:15–14:40Towards Building Advanced Natural Language Applications - An Overview of the Existing Primary Resources and Applications in Nepali
Bal Krishna Bal
14:40–15:05Using Search Engine to Construct a Scalable Corpus for Vietnamese Lexical Development for Word Segmentation
Doan Nguyen
15:05–15:30Word Segmentation Standard in Chinese, Japanese and Korean
Key-Sun Choi, Hitoshi Isahara, Kyoko Kanzaki, Hansaem Kim, Seok Mun Pak and Maosong Sun
16:00–17:50Panel discussion "ALR and FLaReNet"