Voice, Audio and Transcription

Artificial intelligence tools focused on oral language processing, automatic transcription, voice synthesis and audio editing, which facilitate accessibility, the documentation of academic activities, and support for teaching, research and university management. Institutional note: These tools make audio transcription, synthesis and editing easier and contribute to academic accessibility and documentation. Their use must be carried out respecting data protection, informed consent, and the principles of responsible AI use.

OpenAI Whisper

Website / Automatic speech recognition system developed by OpenAI for the transcription of audio in multiple languages.

Primary area of use:

Cross‑cutting use
Suitable for the transcription of lectures, interviews, academic meetings and audiovisual materials.

Recommended purposes:

Automatic transcription of lectures, seminars and training sessions.
Transcription of interviews for qualitative research and later analysis.
Conversion of academic meetings and events into text for minutes or follow‑up.
Generation of subtitles for educational videos and audiovisual materials.
Support for accessibility through multilingual transcriptions and subtitles.
Transcription of audio in multiple languages for academic analysis or documentation.
Production of transcription drafts for later human editing and correction.
Indexing and search of content in recordings based on transcribed text.

Access model:

Free / Paid
Available as an open‑source model for local use and also via paid services/APIs depending on consumption.

Conditions of use:

Permitted
May be used for the transcription of content, respecting data protection regulations and participant consent.
Available features and specific terms of use may vary according to the version, mode of use (local or service) and environment configuration.
It is recommended to anonymise or protect sensitive information and review the final transcription before dissemination or archiving.

Data use:

Low
In local use, audio processing is carried out without sending data to external servers.
Data handling may vary depending on the mode of use (local or service/API) and environment configuration, in line with the provider’s conditions.

Back Up

Otter.ai

Website / Automatic transcription and note‑generation tool for meetings, lectures and conversations.

Primary area of use:

Institutional management and communication
Especially useful for preparing minutes, meeting tracking and documentation of academic sessions.

Recommended purposes:

Automatic transcription of meetings, classes and conversations in real time or from recordings.
Generation of structured notes and summaries for minutes and agreement tracking.
Identification of discussed topics and key points for quick session documentation.
Search and location of specific segments within extensive transcriptions.
Support for accessibility through transcription and subtitling of spoken content.
Organisation of transcriptions by meetings, projects or themes for document management.
Sharing notes and transcriptions for collaborative work and later review.
Creation of records and documentation of academic sessions, seminars or interviews.

Access model:

Freemium
Offers a limited free version and paid plans with advanced features.

Conditions of use:

Permitted with limitations
Must be used with the prior consent of recorded participants and in compliance with data protection regulations.
Available features and specific terms of use may vary according to the access plan or licence and account configuration.
It is recommended to avoid recording sensitive information and to review transcriptions before dissemination or archiving.

Data use:

Medium
Audio is processed on external servers in accordance with provider policies.
Data handling (storage, retention and use for service improvement) may vary depending on the plan or licence and account configuration.

Back Up

Speechify

Website / AI‑based text‑to‑speech tool that converts written documents into audio.

Primary area of use:

Learning and teaching
Especially useful for accessibility support and different learning styles.

Recommended purposes:

Conversion of academic texts into audio to facilitate study and content review.
Support for accessibility for individuals with reading difficulties or specific needs.
Listening to articles, notes and documents during complementary activities (commuting, revision).
Improvement of reading comprehension through assisted listening and speed control.
Support for reviewing one’s own texts by hearing coherence or fluency errors.
Conversion of teaching materials into alternative formats for flexible learning.
Creation of audio resources for educational guides or support materials.
Adaptation of learning to different styles with options for voice, pace and format.

Access model:

Freemium
Offers limited free access and paid plans with advanced voices and features.

Conditions of use:

Permitted
Can be used to support accessibility and individual study.
Available features and specific terms may vary according to the plan or licence and account configuration.
It is recommended to respect copyright of converted texts and avoid processing sensitive information.

Data use:

Medium
Text is processed on external servers for audio generation, in accordance with provider terms.
Data handling (storage, retention and use for service improvement) may vary depending on plan or licence and account configuration.

Back Up

Descript

Website / AI‑assisted audio and video editing platform, with transcription, text‑based editing and voice synthesis capabilities.

Primary area of use:

Cross‑cutting use
Suitable for editing educational, outreach and institutional audiovisual materials.

Recommended purposes:

Automatic transcription of audio and video for editing and accessibility.
Audio and video editing through direct modification of the transcribed text.
Automatic removal of filler words, pauses and noise to improve audio quality.
Generation of subtitles and captions for educational and outreach content.
Creation of clips and highlights for social media, micro‑content or communication.
Voice synthesis and adjustment for narration, scripts or teaching resources (as available).
Production and enhancement of podcasts, interviews and explainer videos with streamlined workflows.
Preparation of audiovisual materials in final format ready for publication or distribution.

Access model:

Freemium
Offers limited free access and paid plans with advanced functions.

Conditions of use:

Permitted with limitations
Can be used for audio and video editing, with all automatically generated content reviewed.
Available features and specific terms of use may vary according to the plan or licence and account configuration.
It is recommended to obtain consent when editing recordings of identifiable individuals and verify rights of use for materials and generated voices.

Data use:

Medium
Content is processed on external servers in accordance with provider policies.
Data handling (storage, retention and use for service improvement) may vary depending on the plan or licence and account configuration.

Back Up

Breadcrumb

Voz, audio y transcripción

OpenAI Whisper

Primary area of use:

Recommended purposes:

Access model:

Conditions of use:

Data use:

Otter.ai

Primary area of use:

Recommended purposes:

Access model:

Conditions of use:

Data use:

Speechify

Primary area of use:

Recommended purposes:

Access model:

Conditions of use:

Data use:

Descript

Primary area of use:

Recommended purposes:

Access model:

Conditions of use:

Data use: