XSL Content

NLP Applications (II): Building Information Extraction, Question Answering and Conversational Systems

General details of the subject

Face-to-face degree course

Description and contextualization of the subject

The objective of the subject is to obtain the ability to implement applications based on language technologies/natural language processing. During the course we will learn the basic applications of Natural Language Processing that are currently used in the industry of language technology industry..

The content will focus on the following tasks:

i) Information extraction: We present advanced techniques of lexical disambiguation of multiple various linguistic levels. Disambiguation techniques include word sense disambiguation algorithms, entity linking, and recognition and classification of named entities (NERC). We will learn and implement structured information extraction algorithms, as well as semantic relation and event extraction. For this, the student will be able to use advanced techniques of Deep Learning (embeddings, transfer learning, LSTM, CNN, etc.), sequence labeling (inference, beam search, viterbi, etc.) and distant supervision.

ii) Question Answering: We present unsupervised learning techniques based on semantic textual similarity (embeddings, graph theory), and techniques based on supervised algorithms that include end-to-end methods, information retrieval, and knowledge acquisition . Language generation techniques will also be studied (e.g. language models, seq2seq). The latest advances in multimodal tasks will be studied (e.g. visual question answering)

iii) Conversational systems: We will learn the modules that define a conversational systems, and the algorithms that control the interaction between human and machine. Special emphasis will be given to the natural language comprehension module (NLU) as well as the language generation.

Teaching staff

NameInstitutionCategoryDoctorTeaching profileAreaE-mail
ALDEZABAL ROTETA, IZASKUNUniversity of the Basque CountryProfesorado AgregadoDoctorBilingualBasque
BARRENA MADINABEITIA, ANDERUniversity of the Basque CountryProfesorado Adjunto (Ayudante Doctor/A)DoctorBilingualComputer Languages and
GONZALEZ DIOS, ITZIARUniversity of the Basque CountryProfesorado AgregadoDoctorBilingualBasque
LARRAÑAGA OLAGARAY, MIGUELUniversity of the Basque CountryProfesorado AgregadoDoctorBilingualComputer Languages and
LOPEZ DE LACALLE LECUONA, OIERUniversity of the Basque CountryProfesorado Adjunto (Ayudante Doctor/A)DoctorBilingualComputer Languages and


Knowledge about existing tools for processing multiple languages (morphological, syntactic, semantic analyzers).12.0 %
Knowledge on the use of linguistic engineering techniques and resources for the implementation of applications of information extraction, question-answer systems, and conversational systems.12.0 %
Ability to understand machine learning strategies for natural language processing.12.0 %
Ability to handle tools and strategies based on knowledge for natural language processing.12.0 %
Ability to manage, adapt and improve the most relevant empirical methods for research in language technologies.12.0 %
Ability to manage and adapt existing tools for processing different languages (morphological, syntactic, semantic analyzers, etc.).12.0 %
Ability to design and implement linguistic applications for information extraction, question-answers, and parts of conversational systems.28.0 %

Study types

TypeFace-to-face hoursNon face-to-face hoursTotal hours
Applied laboratory-based groups304575

Training activities

NameHoursPercentage of classroom teaching
Computer work practice, laboratory, site visits, field trips, external visits75.040 %
Lectures37.540 %

Assessment systems

NameMinimum weightingMaximum weighting
OTROS20.0 % 20.0 %
Practical tasks40.0 % 40.0 %
Presentations20.0 % 20.0 %
Written examination20.0 % 20.0 %

Learning outcomes of the subject

Ability to implement applications based on language technologies.

Knowledge about basic tools of natural language processing.

Ability to use and code applications that use advanced methods such as Deep Learning and Machine Learning.

Knowledge to evaluate NLP applications.

Ordinary call: orientations and renunciation

1) 25% of the grading is based on class assignments: these will be typically performed in class hours, and will be handed in using e-gela. The student who hands-in approximately correct solutions to all assignments will be assigned full credit.

2) 75% of the grading is on three projects, each related to main tasks introduced in the course: each student will select or propose a subject for the project to one of the lecturers, depending on his/her interests. If a student does not propose any subject, the lecturers will assign a final project subject to him/her. The final project will be graded based on an oral presentation and written report., with the following percentages:

- write-up 30%, including features like clarity, structure,background, references, discussion

- technical 40%, including features like correctness and depth ofthe work

- presentation 30%, including clarity, structure, discussion

Extraordinary call: orientations and renunciation

In case the class assignments are missing or clearly deficient, the student will be evaluated depending on the three projects.


1. Introduction to NLP applications.

2. Information Extraction and Disambiguation techniques.

3. Question Answering systems.

4. Conversational systems.


Basic bibliography

Chris Manning and Hinrich Schütze. Foundations of Statistical Natural Language Processing, MIT Press. Cambridge, MA: May 1999.

Daniel Jurafsky, James H. Martin. Speech and Language Processing (2nd Edition), Pearson. May, 2008.

Eneko Agirre, Phillip Edmonds (Eds.). Word Sense Disambiguation: Algorithms and applications, Springer. 2007

Yoav Goldberg, Graeme Hirst. Neural Network Methods in Natural Language Processing (Synthesis Lectures on Human Language Technologies). 2017. Primer.