Subject

XSL Content

Introduction to Automatic Learning

General details of the subject

Mode
Face-to-face degree course
Language
English

Description and contextualization of the subject

This is an elementary course for students without any background on data mining. First we will address elementary aspects in the areas of descriptive statistics. We will also introduce machine learning techniques, including basic data processing and the main learning algorithms. The course provides a basic application-case on computational linguistics (e.g. sentiment analysis, spam detection, etc.) to learn elementary vectorial representations for textual information and understand their limitations.

Teaching staff

NameInstitutionCategoryDoctorTeaching profileAreaE-mail
ARBELAIZ GALLEGO, OLATZUniversity of the Basque CountryProfesorado AgregadoDoctorBilingualComputer Architecture and Technologyolatz.arbelaitz@ehu.eus
PEREZ RAMIREZ, ALICIAUniversity of the Basque CountryProfesorado AgregadoDoctorBilingualComputer Languages and Systemsalicia.perez@ehu.eus

Competencies

NameWeight
Capacidad de comprender y aplicar las medidas estadísticas básicas para la descripción de características en un conjunto de datos.35.0 %
Capacidad para comprender estrategias de aprendizaje automático en el procesamiento del lenguaje humano.25.0 %
Capacidad de aplicar algoritmos clásicos para la resolución de problemas de PLN.40.0 %

Study types

TypeFace-to-face hoursNon face-to-face hoursTotal hours
Lecture-based101525
Applied computer-based groups203050

Learning outcomes of the subject

* To know the appropriate preprocessing of the input data in order to set and be able to adequately solve the classification problem.



* Learn to use specific software for classification in natural language processing tasks.



* Extract the most important features of statistical variables, such as measures of central tendency, dispersion and correlation, both for quantitative and qualitative variables.

Ordinary call: orientations and renunciation

Sistema de Evaluación Continua

Herramientas y porcentajes de calificación:

* Prueba escrita a desarrollar (%): 30

* Trabajos prácticos (%): 60

* Asistencia y participación (%): 10



Sistema de Evaluación Final

Herramientas y porcentajes de calificación:

* Prueba escrita a desarrollar (%): 100

Extraordinary call: orientations and renunciation

Sistema de Evaluación Final

Herramientas y porcentajes de calificación:

* Prueba escrita a desarrollar (%): 100

Temary

1. Introduction to Machine Learning in NLP

2. Basic descriptive statistics

3. Basic Machine Learning algorithms

4. Evaluation in supervised learning

Bibliography

Basic bibliography

R.H. Baayen (2008) Analyzing Linguistic Data. A Practical Introduction to Statistics using R. Cambridge University Press



Data Mining. Mark Hall, Ian Witten and Eibe Frank (4th Edition). TheMorgan Kaufmann, 2017.



Machine Learning for Text. Charu C. Aggarwal. Springer, 2018



Fundamentals of Predictive Text Mining (2nd Edition). Weiss, SholomM., Indurkhya, Nitin, Zhang, Tong. Springer-VerlagLondon, 2015

XSL Content

It was not possible to load the content, please try again later. In case the problem persists contact CAU (Phone: 916014400 / E-mail: cau@ehu.eus / Website: https://lagun.ehu.eus).