University of Cambridge > Talks.cam > NLIP Seminar Series > MultiBLiMP: A Multilingual Benchmark of Linguistic Minimal Pairs

MultiBLiMP: A Multilingual Benchmark of Linguistic Minimal Pairs

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Suchir Salhan.

We introduce MultiBLiMP, a massively multilingual benchmark of linguistic minimal pairs, covering 101 languages, 6 linguistic phenomena and containing more than 120.000 minimal pairs. Our minimal pairs are created using a fully automated pipeline, leveraging the large-scale linguistic resources of Universal Dependencies and UniMorph. MultiBLiMP evaluates linguistic abilities of LLMs at an unprecedented multilingual scale, and highlights the shortcomings of the current state-of-the-art in modelling low-resource languages.

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2025 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity