IPN at MWE-2026 PARSEME 2.0 Subtask 1: MWE identification via related languages and harnessing thinking mode

Conference contribution (Article) › Research › Peer reviewed

Publication data

By	Anna Hülsing, Noah-Manuel Michael, Daniel Ignacio Mora Melanchthon, Andrea Horbach
Original language	English
Published in	Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026)
Pages	177-186
Editor (Publisher)	Association for Computational Linguistics
ISBN	979-8-89176-363-0
DOI/Link	https://doi.org/10.18653/v1/2026.mwe-1.24
Publication status	Published – 03.2026

We present IPN, our system for Subtask 1 of the PARSEME 2.0 Shared Task, which targets the identification of MWEs in 17 languages. Overall, IPN outperformed a much larger-parameter baseline model, yet a performance gap to the top-performing systems remains. To better understand these results, we investigate Qwen3-32B’s suitability for mono-, cross- and multilingual MWE identification. We also explore whether this model benefits from prepending automatically generated thinking data to the gold label during instruction-tuning. We find that target language data is vital for instruction-tuning. Prepending generated thinking data to a subset of the training data slightly improves performance for two out of three languages, but more detailed evaluation is required.

Announcements

About Us

The IPN's Departments

Research Lines

Projects

Publications

Collaboration and Networks

Open Science and Good Research Practice

Topics

Extracurricular Offers

Podcasts - Listening to Research

Teaching and Training Materials

Publication data

DOI

IPN - Leibniz Institute for Science and Mathematics Education