IXA group


IXAmBERT: Good news for languages with few resources!

Good news for languages with few resources!
Pre-trained Basque monolingual and multilingual language models have proven to be very useful in NLP tasks for Basque!
Even they have been created with a 500 times smaller corpus than the English one and with a 80 times smaller wikipedia.


An example of Conversational Question Answering, and its  transcription to […]

PhD Thesis: Unsupervised Machine Translation (Mikel Artetxe, 2020/07/29)

Title:  Unsupervised Machine Translation
           / Itzulpen automatiko gainbegiratu gabea

Non: Teleconference: https://eu.bbcollab.com/guest/b22b606d9ae74bc5b3e067821c897617
Faculty of informatics (UPV/EHU) Ada Lovelace room
Date: July 29, 2020, Wednesday,  11:00
Author: Mikel Artetxe Zurutuza 
Supervisors: Eneko Agirre & Gorka Labaka
Languages:  Basque (motivation, state of the art)  and English (second half, papers, conclusions, ~11:30…)




The advent of neural sequence-to-sequence models has led to impressive […]

The Ixa research group has been awarded in the artificial intelligence competition promoted by the US government related to COVID-19 disease

The competition CORD-19 (COVID-19 Open Research Dataset Challenge)  has been organized by several organizations such as Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft Research, National Institutes of Health and The White House Office of Science and Technology Policy. The organization has made available to the global research community more than 50,000 scientific […]

Five papers accepted at 58th annual meeting of the Association for Computational Linguistics

The members of the Ixa group and their collaborators will present five papers at 58th annual meeting of the Association for Computational Linguistics (ACL). ACL is one of the most important conferences on Natural Language Processing. It was to be held in July in Seattle, but this year it will be online.

Following, we present the […]

Eneko Agirre won for the third consecutive year the Google prize

Eneko Agirre  won again a Google prize last March. He is one of the few researchers who has obtained the Google Faculty Research Award on three occasions. The $62,000 prize will fund the project ‘Conversational Question Answering agents that learn after deployment’ to develop user dialogue systems, chatbots and artificial intelligence.

Eneko Agirre, member of […]

“Itzulbide” project: a tool for normalizing the use of Basque in clinical histories

The use of machine translation tools between languages in today’s society is common and widespread. Our Ixa group of the University of the Basque Country (UPV/EHU) has extensive experience in the Natural Language Processing for Basque. In this context, UPV-EHU and Osakidetza (The official Organization for Health in the Basque Country) in 2019 saw the […]

Meeting of LINGUATEC project in Donostia (2019-02-21)

LINGUATEC project:  Development of cross-border cooperation and knowledge transfer in language technologies.

LINGUATEC is an European project funded by FEDER via POCTEFA (Programa INTERREG V-A España-Francia-Andorra). The partners are the followings:

Elhuyar Fundazioa
Lo Congrès Permanent de la Lenga Occitana
Universidad Del País Vasco / Euskal Herriko Unibertsitatea (Ixa Taldea)
CNRS (CENTRE National de la Recherche Scientifique) – Delegation Regionale […]

Best Thesis Award in PLN (Aitor Gonzalez, 2018-09-13)

Last September Aitor Gonzalez Agirre was awarded with the best MSc thesis Award 2018 by the SEPLN association. Congratulations to Aitor and to his supervisors Eneko Agirre  and German Rigau.

Aitor is now working at the Barcelona Supercomputing Center.

The abstract of his thesis entitled “Computational Models for Semantic Textual Similarity” is the following:

Measuring semantic similarity between […]

Talk: Karelian dialects, how to study variation between closely related languages? (I. Moshnikov, 2018-06-19)

Speaker: Ilia Moshnikov
…………Karelian Institute (Joensuu)
Date: Tuesday,June 19, 2018
Time: 15:00-16:00
Place: UPV/EHUko Informatika Fakultatea, Manuel de Lardizabal 1, 20018 Donostia (map)
Title:  Variants of the active past participle in the Border Karelian dialects:
how to study variation between closely related languages?

Karelian languages (Wikipedia)

During my visit I would like to present my research interests. I will speak about my […]

Be a friend of the Minority SafePack!

We call upon the EU to adopt a set of legal acts to improve the protection of persons belonging to national and linguistic minorities and strengthen cultural and linguistic diversity in the Union. It shall include policy actions in the areas of regional and minority languages, education and culture, regional policy, participation, equality, audiovisual […]