Release of the open source transLectures-UPV Platform (TLP) version 3.2

The MLLP has released version 3.2 of the transLectures-UPV Platform (TLP) for the management of automated transcription and translation technologies in MOOC and media repositories. This new version refines and consolidates TLP’s features for the management of multilingual media subtitling, text-to-speech synthesis, text document translation, and multilingual website translation.

The transLectures-UPV Platform (TLP), developed by the Universitat Politècnica de València’s (UPV) Machine Learning and Language Processing (MLLP) research group, is an open source piece of software designed to integrate automated transcription and translation technologies into media repositories. Its main components are the TLP Database, Web Service, Ingest Service, Text Translation Editor, Media Player, and Website Translation Plugin+Editor. The MLLP is releasing TLP version 3.2, now available for download from the TLP page.

After adding support for multilingual website translation in its previous version 3.1, TLP release 3.2 refines and consolidates TLP’s features for the management of multilingual media subtitling, text-to-speech synthesis, text document translation, and multilingual website translation. The TLP Media Player now allows for the download of subtitles not only as time-synched SRT files, but as a plain text transcription TXT file; a new Web Translation Editor has been added (an embeddable tool, like the Media Player and the Translation Editor, to edit all the translations of a website); additionally, there are new related extensions to the API, improvements to the additional libraries and client tools, improvements to the documentation, and general bugfixes (for more details, see the full changelog).

A complete description of the functionalities of TLP 3.2 can be found in the TLP documentation.

The fastest way to try the MLLP’s technology is to access the MLLP transcription and translation platform, which integrates our own software TLP and TLK (the transLectures-UPV Toolkit for speech recognition), our machine translation technology, and our text-to-speech synthesis technology. Just open a trial account and start uploading media and text files to try our automatic transcription and translation technology, and even integrate it with your own media repository via our API.

The MLLP research group, made up of researchers based at the Universitat Politècnica de València (UPV), have been developing for years their expertise and tools for machine learning and language processing in the fields of educational technologies and big data within the ideal framework of the EU projects transLectures and EMMA. The MLLP has been an integral part of both projects, and within the ongoing project EMMA, TLP is being intensively used for the full transcription and translation of MOOC contents in several languages from leading European universities.

The MLLP’s TLP and TLK are also being used as the basis for automatic subtitle generation in the educational media repositories of our own Universitat Politècnica de València (Polimedia), and of the Carlos III University of Madrid (you can learn more about our projects here). So far, our technology has been used to generate multilingual subtitles for over 18 000 media files (more than 3 700 hours) and translations for over 8 700 text documents (more than 59 000 sentences).

The MLLP will be signing out from next week for the summer holidays, so happy holidays and see you again from 1 September!

In the meantime, you can download TLP 3.2, or try it in the MLLP transcription and translation platform. We’ll keep you updated on new developments and releases of our software from 1 September; just keep checking our website and follow us on Twitter, @mllpresearch!