XSL Content

Advanced Techniques in Natural Language Processing28285

Centre
Faculty of Informatics
Degree
Grado en Inteligencia Artficial
Academic course
2023/24
Academic year
4
No. of credits
6
Languages
English
Code
28285

TeachingToggle Navigation

Distribution of hours by type of teaching
Study typeHours of face-to-face teachingHours of non classroom-based work by the student
Lecture-based2030
Applied laboratory-based groups4060

Teaching guideToggle Navigation

Description and Contextualization of the SubjectToggle Navigation

Language is the most powerful tool created by human intelligence. We use language, our natural means of communication, to encode, store, transmit, share and manipulate information. In fact, most of the digital information available is unstructured information in the form of documents written or spoken in multiple languages, which represents a challenge for any organization that wants to exploit and process this information. Natural Language Processing (NLP)is thus at the center of our efforts to develop Artificial Intelligence (AI), and vice versa.



This subject is a follow-up of two subjects: Language Processing and Textual Data Mining. It focuses on the latest developments in NLP like, for instance, generative Large Language Models (LLM) like ChatGPT that are currently disrupting several areas of our societies. The student will be able to understand LLMs and use and evaluate them in specific applications. The course follows an active methodology, with group readings, presentations, discussions and a capstone project designed by the student.



The course is taught in English.



Skills/Learning outcomes of the subjectToggle Navigation

1. Understand the main concepts of large language models, and have a global vision of the area.

2. Understand and use large language models.

3. Understand the structure of applications that use large language models.

4. Analyze a specific application and identify the options and advantages of using large language models to deal with it.

5. Design and implement a specific application that uses large language models.

6. Design and carry out the necessary experiments to evaluate the performance of the specific application using empirical methods.



M011CE39 Ability and knowledge to design, build, use and evaluate robust systems for the NLP.

M011CE40 Master the different stages of development of an advanced artificial intelligence project: analysis, design, implementation, testing, presentation and evaluation of an advanced artificial intelligence project applied to a real problem.

Theoretical and practical contentToggle Navigation



This course introduces advanced PLN concepts and algorithms, focusing on large language models.



Topic 1: Language-centric AI. Introduction to empirical methods in NLP.



The first topic quickly reviews several topics, as follows: Scientific method applied to PLN. Knowledge-based systems, statistical systems, machine learning-based systems, and large language models. Supervised and unsupervised learning in PLN. Evaluation of PLN systems. Tasks and datasets. Benchmarks.



Topic 2: Advanced NLP techniques.



A review of Large language models. Creation, adaptation, prompts, alignment to user values and evaluation.



Topic 3: Advanced applications.



A review of the applications of large language models will be covered, including, among other, conditional text generation, question-answering systems, chatbots, summary generation, machine translation, etc.



In parallel to the theoretical topics, a practical capstone project is started, which will be developed throughout the entire course.

MethodologyToggle Navigation

Through readings, presentations, assignments, and a capstone project, students will learn the skills necessary to design, implement, evaluate and understand a state-of-the-art NLP system.



The objective of the project is to be able to apply what has been learned during the degree to several applications, and to work the creativity of the students.



Active methodologies will be used throughout the course. That is, the students will participate actively both in class and in the laboratory.

Assessment systemsToggle Navigation

  • Continuous Assessment System
  • Final Assessment System
  • Tools and qualification percentages:
    • Realization of Practical Work (exercises, cases or problems) (%): 20
    • Individual works (%): 30
    • Team projects (problem solving, project design)) (%): 50

Ordinary Call: Orientations and DisclaimerToggle Navigation

The UPV/EHU Protocol on academic ethics and prevention of dishonest or fraudulent practices in assessment tests and academic work will be applied.



The subject has two modes of evaluation: continuous evaluation and final evaluation.



The pre-registration in the continuous evaluation mode will be done at the beginning of the course. The pre-registration will become final after the confirmation of the application by the student on the dates established (between 60% and 80% of the course) and after verification of the partial performance by the teaching staff. If, on the aforementioned dates, the student does not confirm their final enrollment in continuous assessment, it will be understood that they renounce it.





CONTINUOUS ASSESSMENT



- 20% Realization of practices, cases or problems

- 30% Presentation and active participation in class: readings, bibliographic review, presentations and discussion.

- 50% Capstone project



All tests and activities mentioned for continuous assessment are mandatory.



The conditions to pass the continuous assessment are:

- Presentation and active participation in the seminar.

- Get 40% of the max. score in the practices

- Get 40% of the max. score in presentation and active participation in class.

- Get 40% of the max. score in the capstone project

- Get, after a weighted average, 50% or more of the max. score.



If the student does not express their waiver of continuous assessment within the given dates and does not meet any of the conditions to pass the continuous assessment, he/she will fail the subject in the ordinary call and may opt for the extraordinary call. If the student did not submit the final project, he/she would be assessed as a no-submit.







FINAL EVALUATION



20% Carrying out practices, cases or problems

30% written test

50% the capstone project



In order to take the written test, you must have previously handed in the individual practice. All the mentioned tests and activities are compulsory.



The conditions to pass the final evaluation are:

- Get at least 40% of the max. score in the written test

- Get 40% of the max. score in the practices

- Get 50% of the max. score in the capstone project

- Get, after a weighted average, 50% or more of the max. score.



If the student does not take the written test, it is understood that she waives the evaluation. If she/he did not submit the practices and project, he/she would be assessed as a no-submit.

Extraordinary Call: Orientations and DisclaimerToggle Navigation

FINAL EVALUATION



20% Carrying out practices, cases or problems

30% written test

50% the capstone project



In order to take the written test, you must have previously handed in the individual practice. All the mentioned tests and activities are compulsory.



The conditions to pass the final evaluation are:

- Get at least 40% of the max. score in the written test

- Get 40% of the max. score in the practices

- Get 50% of the max. score in the capstone project

- Get, after a weighted average, 50% or more of the max. score.



If the student does not take the written test, it is understood that she waives the evaluation. If she did not submit the practices and project, she would be assessed as a no-submit.



Compulsory materialsToggle Navigation

The material will be available on the virtual platform (eGela).

BibliographyToggle Navigation

Basic bibliography

Jurafsky D., Martin J.H. Speech and Language Processing (3rd edition draft). An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, 2018. https://web.stanford.edu/~jurafsky/slp3/



Lewis Tunstall, Leandro von Werra, Thomas Wolf. Natural Language Processing with Transformers, Revised Edition. O'Reilly Media, Inc. ISBN: 9781098136796

Web addresses

https://github.com/Hannibal046/Awesome-LLM

GroupsToggle Navigation

61 Teórico (English - Mañana)Show/hide subpages

Calendar
WeeksMondayTuesdayWednesdayThursdayFriday
1-15

09:00-10:30 (1)

10:30-12:00 (2)

Teaching staff

61 Applied laboratory-based groups-1 (English - Mañana)Show/hide subpages

Calendar
WeeksMondayTuesdayWednesdayThursdayFriday
1-15

12:00-13:30 (1)

Teaching staff