Subject

XSL Content

NLP Applications (II): Building Information Extraction, Question Answering and Conversational Systems

General details of the subject

Mode: Face-to-face degree course
Language: English

Description and contextualization of the subject

The objective of the subject is to obtain the ability to implement applications based on language technologies/natural language processing. During the course we will learn the basic applications of Natural Language Processing that are currently used in the industry of language technology industry..

The content will focus on the following tasks:

i) Information extraction: We present advanced techniques of lexical disambiguation of multiple various linguistic levels. Disambiguation techniques include word sense disambiguation algorithms, entity linking, and recognition and classification of named entities (NERC). We will learn and implement structured information extraction algorithms, as well as semantic relation and event extraction. For this, the student will be able to use advanced techniques of Deep Learning (embeddings, transfer learning, LSTM, CNN, etc.), sequence labeling (inference, beam search, viterbi, etc.) and distant supervision.

ii) Question Answering: We present unsupervised learning techniques based on semantic textual similarity (embeddings, graph theory), and techniques based on supervised algorithms that include end-to-end methods, information retrieval, and knowledge acquisition . Language generation techniques will also be studied (e.g. language models, seq2seq). The latest advances in multimodal tasks will be studied (e.g. visual question answering)

iii) Conversational systems: We will learn the modules that define a conversational systems, and the algorithms that control the interaction between human and machine. Special emphasis will be given to the natural language comprehension module (NLU) as well as the language generation.

Teaching staff

Name	Institution	Category	Doctor	Teaching profile	Area	E-mail
ALDEZABAL ROTETA, IZASKUN	University of the Basque Country	Profesorado Agregado	Doctor	Bilingual	Basque Philology	izaskun.aldezabal@ehu.eus
BARRENA MADINABEITIA, ANDER	University of the Basque Country	Profesorado Adjunto (Ayudante Doctor/A)	Doctor	Bilingual	Computer Languages and Systems	ander.barrena@ehu.eus
GONZALEZ DIOS, ITZIAR	University of the Basque Country	Profesorado Agregado	Doctor	Bilingual	Basque Philology	itziar.gonzalezd@ehu.eus
LARRAÑAGA OLAGARAY, MIGUEL	University of the Basque Country	Profesorado Agregado	Doctor	Bilingual	Computer Languages and Systems	mikel.larranaga@ehu.eus
LOPEZ DE LACALLE LECUONA, OIER	University of the Basque Country	Profesorado Adjunto (Ayudante Doctor/A)	Doctor	Bilingual	Computer Languages and Systems	oier.lopezdelacalle@ehu.eus

Competencies

Name	Weight
Knowledge about existing tools for processing multiple languages (morphological, syntactic, semantic analyzers).	12.0 %
Knowledge on the use of linguistic engineering techniques and resources for the implementation of applications of information extraction, question-answer systems, and conversational systems.	12.0 %
Ability to understand machine learning strategies for natural language processing.	12.0 %
Ability to handle tools and strategies based on knowledge for natural language processing.	12.0 %
Ability to manage, adapt and improve the most relevant empirical methods for research in language technologies.	12.0 %
Ability to manage and adapt existing tools for processing different languages (morphological, syntactic, semantic analyzers, etc.).	12.0 %
Ability to design and implement linguistic applications for information extraction, question-answers, and parts of conversational systems.	28.0 %

Study types

Type	Face-to-face hours	Non face-to-face hours	Total hours
Lecture-based	15	22.5	37.5
Applied laboratory-based groups	30	45	75

Training activities

Name	Hours	Percentage of classroom teaching
Computer work practice, laboratory, site visits, field trips, external visits	75.0	40 %
Lectures	37.5	40 %

Assessment systems

Name	Minimum weighting	Maximum weighting
OTROS	20.0 %	20.0 %
Practical tasks	40.0 %	40.0 %
Presentations	20.0 %	20.0 %
Written examination	20.0 %	20.0 %

Learning outcomes of the subject

Ability to implement applications based on language technologies.

Knowledge about basic tools of natural language processing.

Ability to use and code applications that use advanced methods such as Deep Learning and Machine Learning.

Knowledge to evaluate NLP applications.

Ordinary call: orientations and renunciation

1) 25% of the grading is based on class assignments: these will be typically performed in class hours, and will be handed in using e-gela. The student who hands-in approximately correct solutions to all assignments will be assigned full credit.

2) 75% of the grading is on three projects, each related to main tasks introduced in the course: each student will select or propose a subject for the project to one of the lecturers, depending on his/her interests. If a student does not propose any subject, the lecturers will assign a final project subject to him/her. The final project will be graded based on an oral presentation and written report., with the following percentages:

- write-up 30%, including features like clarity, structure,background, references, discussion

- technical 40%, including features like correctness and depth ofthe work

- presentation 30%, including clarity, structure, discussion

Extraordinary call: orientations and renunciation

In case the class assignments are missing or clearly deficient, the student will be evaluated depending on the three projects.

Temary

1. Introduction to NLP applications.

2. Information Extraction and Disambiguation techniques.

3. Question Answering systems.

4. Conversational systems.

Bibliography

Basic bibliography

Chris Manning and Hinrich Schütze. Foundations of Statistical Natural Language Processing, MIT Press. Cambridge, MA: May 1999.

Daniel Jurafsky, James H. Martin. Speech and Language Processing (2nd Edition), Pearson. May, 2008.

Eneko Agirre, Phillip Edmonds (Eds.). Word Sense Disambiguation: Algorithms and applications, Springer. 2007

Yoav Goldberg, Graeme Hirst. Neural Network Methods in Natural Language Processing (Synthesis Lectures on Human Language Technologies). 2017. Primer. http://u.cs.biu.ac.il/~yogo/nnlp.pdf

ACL: https://aclweb.org/anthology/

Search Bar

Erasmus Mundus Master in Language and Communication Technologies (LCT)

Subject

XSL Content

NLP Applications (II): Building Information Extraction, Question Answering and Conversational Systems

General details of the subject

Description and contextualization of the subject

Teaching staff

Competencies

Study types

Training activities

Assessment systems

Learning outcomes of the subject

Ordinary call: orientations and renunciation

Extraordinary call: orientations and renunciation

Temary

Bibliography

Basic bibliography

Search Bar

Breadcrumb

Subject

XSL Content

NLP Applications (II): Building Information Extraction, Question Answering and Conversational Systems

General details of the subject

Description and contextualization of the subject

Teaching staff

Competencies

Study types

Training activities

Assessment systems

Learning outcomes of the subject

Ordinary call: orientations and renunciation

Extraordinary call: orientations and renunciation

Temary

Bibliography

Basic bibliography