About Me

I’m currently a Research Fellow at AI Singapore, NTU. My responsibility is to develop and research around Southeast Asian (SEA) NLP, such as SEA-LION, SEA-HELM, SEA-Guard, and SEA-LION Embedding. Those models are industry-centric and reflect the needs in SEA, including advances in LLMs, safety, and embeddings for RAG. Moreover, I also work as the main contributor of SEACrowd, a project that collects data in SEA and does many cool SEA research, such as SEACrowd (EMNLP’24) and SEA-VL (ACL’25). In addition to AI Singapore, I am also an external researcher at Chulalongkorn University, Thailand, and a member of the advisory board of SIGSEA.

Before that, I was a Ph.D. student (5-year program) in the Natural Language and Representation Lab (NRL), Information Science and Technology (IST) at VISTEC, Thailand. My advisor and co-advisor were Assoc. Prof. Dr. Sarana Nutanong and Dr. Ekapol Chuangsuwanich, respectively. My research topics were representation and multilingual learnings.

During my Ph.D., I was an intern at Amazon, Cambridge, GBR (2022-2023). I was working with Weiwei Cheng (mentor), Christos Christodoulopoulos, Amir Saffari, Jens Lehmann, and Daniele Masato (PM). I also was a part of the WangchanX project as a Subject Matter Expert (SME) from 2019-2024. Our team developed two Thai large language models: WangchanGLM (based on XGLM) and WangchanLion (based on SeaLION). Moreover, we released a WangchanX toolkit for fine-tuning Thai LLMs. Our team also released the largest Thai instruction dataset in 2024-25.

My research topics

Contact and Collaboration

  • I’m mostly working on SEA languages, models, and benchmarks, such as safety in AI, encoder and decoder models, and generalization benchmarks (i.e., out-of-domain or low-resource languages). And, if you’re looking for a collaborator on these topics, feel free to contact me at peerat(at)aisingapore.org
  • AI Singapore is also looking for an intern, an engineer, or a researcher who is passionate about large language models, especially SEA LLMs. Feel free to contact me and attach your CV to the email as well.

News (2026)

  1. I just released SOTA embedding for SEA called SEA-LION Embedding. It outperforms E5-large and Qwen-Embedding in SEA languages.
  2. I will organize WiNLP at EMNLP’26 in Budapest, Hungary.
  3. BURMESE-SAN got accepted at LREC 2026.

News (20XX-2025)

  1. SEA-LION got accepted to the AACL Main Conference, and it won the Best Resource Award! That made me win 2 *CL awards in a single year!!
  2. 1 paper got accepted to the EMNLP Main Conference (co-first author): WangchanThaiInstruction.
  3. 5 papers got accepted at ACL, where 2 main, 2 findings, and 1 LLMSEC@ACL.
  4. I just got prompted to be a Research Fellow at AI Singapore. I will mainly focus on research in LLMs and NLP.
  5. WorldCuisines receives Best Theme Paper Award at NAACL’25!
  6. I just got promoted to be an invited researcher at Chulalongkorn, where I got funded to do research and publish at top-tier conferences and journals.
  7. 7 papers accepted at EMNLP’24 (3 main, 2 finding, 2 workshops)!!!
  8. 4 papers accepted at ACL’24 (2 Findings, 1 SRW, 1 workshop).
  9. My intern project at Amazon, mReFinED, has been published at EMNLP 2023 (finding)!
  10. My latest sentence embedding paper, SCT, was accepted at TACL 2023.
  11. ConGen accepted at Finding of EMNLP 2022.
  12. In Fall 2022, I will intern at Amazon, Cambridge, GBR.
  13. CL-ReLKT accepted at Finding of NAACL 2022
  14. Thai Nested NER accepted at Finding of ACL 2022