Subject
Speech Processing
General details of the subject
- Mode
- Face-to-face degree course
- Language
- English
Description and contextualization of the subject
The objective is for students to become familiar with the voice signal as well as the basic techniques used for its analysis and processing. For this, the fundamentals of the production of human speech and its perception by the human ear are described, to continue with the definition of the basic techniques of voice signal processing. The course includes numerous exercises and practices for students to work with these concepts.Teaching staff
Name | Institution | Category | Doctor | Teaching profile | Area | |
---|---|---|---|---|---|---|
HERNAEZ RIOJA, INMACULADA CONCEPCION | University of the Basque Country | Profesorado Catedratico De Universidad | Doctor | Not bilingual | Theory of Signals and Communications | inma.hernaez@ehu.eus |
SARATXAGA COUCEIRO, IBON | University of the Basque Country | Profesorado Agregado | Doctor | Bilingual | Telematics Engineering | ibon.saratxaga@ehu.eus |
Competencies
Name | Weight |
---|---|
Know and interpret the processes of generation and perception of the human speech. | 25.0 % |
Analyze and explain the temporal and spectral analysis of speech signals. | 25.0 % |
Manage basic computer tools for speech signal processing. | 25.0 % |
Understand and interpret the main parameters used in the representation of the speech signal. | 25.0 % |
Study types
Type | Face-to-face hours | Non face-to-face hours | Total hours |
---|---|---|---|
Lecture-based | 15 | 22.5 | 37.5 |
Applied computer-based groups | 30 | 45 | 75 |
Training activities
Name | Hours | Percentage of classroom teaching |
---|---|---|
Computer work practice, laboratory, site visits, field trips, external visits | 75.0 | 40 % |
Lectures | 37.5 | 40 % |
Assessment systems
Name | Minimum weighting | Maximum weighting |
---|---|---|
Internship Report/Summary | 30.0 % | 60.0 % |
Multiple-choice examination | 40.0 % | 70.0 % |
Learning outcomes of the subject
RA1 Describe the processes of generation and perception of the speech signalRA2 Show knowledge about the temporal and frequency representation of the speech signal and about the techniques used for its analysis
RA3 Measure and interpret the main parameters that characterize the speech signal
RA4 Manage tools for the analysis and processing of the speech signal
Ordinary call: orientations and renunciation
The evaluation is divided into two independent parts: master lessons and lab practices.- The knowledge about the master lessons will be proven by means of an individual written test based on short and multiple-choice questions (50%).
- The lab practices will be evaluated by the reports and the work developed at the lab (50%).
In the regular evaluation, the exam must be passed with almost a 4 over 10 and to succeed in the subject a minimum final grade of 5 over 10 is required, once the grades of the two parts are added up. The two parts are independent and once one part is passed, the corresponding grade is kept for future calls.
The students unable to follow the combined evaluation must justify their reasons with proper documentation sent to the subject lecturers, according to the procedure established by the current regulation in the first two weeks of the course. These students will be able to prove the achievement of the learning results by means of a final evaluation consisting of: a written exam (50%) and a laboratory exam (50%).
By not showing to the final written exam, the student refuses to take part in the call.
Extraordinary call: orientations and renunciation
In the extraordinary call, two different tests will be made: an exam for the master classes and another one for the lab practices. Each test will represent the 50% of the final note and both of them must be passed with at least a 5 over 10.The students that have not presented the written justification to elude in the continuous evaluation must prove they have correctly completed the lab practices.
Temary
1. Speech production2. Speech perception
3. Basic concepts about speech signals
4. Speech signal representation
Bibliography
Compulsory materials
Documentation provided via eGela, both for master classes and for laboratory practicesBasic bibliography
X. Huang, A. Acero, H. Hon. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall, 2001. (ISBN: 978-0130226167)L. R. Rabiner, R. W. Schafer. Digital processing of speech signal. Prentice-Hall, 1978. (ISBN: 978-0132136037)
J. G. Proakis, D. G. Manolakis. Digital signal processing. Principles, algorithms and applications (4th edition). Pearson Prentice Hall, 2007. (ISBN: 978-0131873742)
D. Jurafsky, J. H. Martin. Speech and Language Processing (2nd edition). Prentice Hall, 2008. (ISBN: 978-0131873216)
Journals
Computer Speech and LanguageSpeech Communication
IEEE Transactions on Audio Speech and Language Processing
IEEE Transactions on Multimedia
Links
Página de voz en la CMU http://www.speech.cs.cmu.edu/European Languages Resources Association (ELRA) http://www.elra.info/en/catalogues/
Linguistic Data Consortium (LDC) https://www.ldc.upenn.edu/language-resources
Smithsonian Speech Synthesis History Project (SSSHP) http://www.mindspring.com/~ssshp/ssshp_cd/ss_home.htm
Síntesis de habla emocional http://emosamples.syntheticspeech.de/