Publications
^ denotes equal contribution
2024
- Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai (ACL-SRW’24) Parinthapat Pengpun, Can Udomcharoenchaikit, Weerayut Buaphet, Peerat Limkonchotiwat. Github: LINK
- Space Decomposition for Sentence Embedding (ACL’24 - Finding) Wuttikorn Ponwitayarat^, Peerat Limkonchotiwat^, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
- Identifying and Mitigating Annotation Bias in Natural Language Understanding using Causal Mediation Analysis (ACL’24 - Finding) Can Udomcharoenchaikit, Sitiporn Sae Lim, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
2023
- Typo-Robust Sentence Representation Learning for Dense Retrieval (ACL’23) Panuthep Tasawong, Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
- An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL 2023) Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Lalita Lowphansirikul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
- mReFinED: An Efficient End-to-End Multilingual Entity Linking System (EMNLP’23 - Finding) Peerat Limkonchotiwat, Weiwei Cheng, Christos Christodoulopoulos, Amir Saffari, Jens Lehmann.
2022
- Thai Nested Named Entity Recognition Corpus (ACL’22 - Findings) Weerayut Buaphet, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Attapol Rutherford, Sarana Nutanong. Github: LINK
- CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering (NAACL’22 - Findings) Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
- ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022) Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Lalita Lowphansirikul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
2021
- Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL’21 - Findings) Peerat Limkonchotiwat, Raheem Sawar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
- Robust fragment-based framework for cross-lingual sentence retrieval (EMNLP’21 - Findings) Nattapol Trijakwanich, Peerat Limkonchotiwat, Raheem Sawar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK
2020
- Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP’20) Peerat Limkonchotiwat, Raheem Sawar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong. Github: LINK