About Me
I’m currently an AI Engineer at AI Singapore, NUS. My responsibility is to develop SEA LLMs and benchmarks. Before that, I was a Ph.D. student (5-year program) in the Natural Language and Representation Lab (NRL), Information Science and Technology (IST) at VISTEC, Thailand. My advisor and co-advisor were Assoc. Prof. Dr. Sarana Nutanong and Dr. Ekapol Chuangsuwanich, respectively. During my Ph.D., I was an intern at Amazon, Cambridge, GBR (2022-2023). Working with Weiwei Cheng (mentor), Christos Christodoulopoulos, Amir Saffari, Jens Lehmann, and Daniele Masato (PM). I also was a part of the WangchanX project as a Subject Matter Expert (SME) from 2019-2024. Our team developed two Thai large language models: WangchanGLM (based on XGLM) and WangchanLion (based on SeaLION). Moreover, we released a WangchanX toolkit for fine-tuning Thai LLMs. Our team also released the largest Thai instruction dataset in 2024.
My research topics
- Word segmentation (SEFR_CUT, OSKut)
- Multilingual and cross-lingual retrieval systems (RFR, CL-ReLKT, McCrolin, Distil CoT)
- Representation learning (ConGen, DST, SCT)
- Evaluation and benchmarks (SEACrowd, MT CS dataset, TH-EN Benchmark, CHIE)
- Information Extraction: Entity Linking (mReFinED, CFT) and Nested NER (Thai NNER).
News
- 7 papers accepted at EMNLP’24 (3 main, 2 finding, 2 workshops)!!!
- 1 paper at the ALVR workshop and 1 paper at the ACL-SRW (as the co-responding author) had been accepted!
- My latest two papers have been accepted at ACL 2024 (finding), such as a new sentence embedding and a debiasing technique in NLU.
- My intern project at Amazon, mReFinED, has been published at EMNLP 2023 (finding)!
- My latest sentence embedding paper, SCT, was accepted at TACL 2023.
- ConGen accepted at Finding of EMNLP 2022.
- In Fall 2022, I will intern at Amazon, Cambridge, GBR.
- CL-ReLKT accepted at Finding of NAACL 2022
- Thai Nested NER accepted at Finding of ACL 2022