Publications

Valor Miró, Juan Daniel ; Baquero-Arnal, Pau; Civera, Jorge; Turró, Carlos; Juan, Alfons

Multilingual videos for MOOCs and OER Journal Article

Journal of Educational Technology & Society, 21 (2), pp. 1–12, 2018.

Abstract | Links | BibTeX | Tags: Machine Translation, MOOCs, multilingual, Speech Recognition, video lecture repositories

@article{Miró2018,
title = {Multilingual videos for MOOCs and OER},
author = {Valor Miró, Juan Daniel and Pau Baquero-Arnal and Jorge Civera and Carlos Turró and Alfons Juan},
url = {https://www.mllp.upv.es/wp-content/uploads/2019/11/JETS2018MLLP.pdf
http://hdl.handle.net/10251/122577
https://www.jstor.org/stable/26388375
https://www.j-ets.net/collection/published-issues/21_2},
year = {2018},
date = {2018-01-01},
journal = {Journal of Educational Technology & Society},
volume = {21},
number = {2},
pages = {1--12},
abstract = {Massive Open Online Courses (MOOCs) and Open Educational Resources (OER) are rapidly growing, but are not usually offered in multiple languages due to the lack of cost-effective solutions to translate the different objects comprising them and particularly videos. However, current state-of-the-art automatic speech recognition (ASR) and machine translation (MT) techniques have reached a level of maturity which opens the possibility of producing multilingual video subtitles of publishable quality at low cost. This work summarizes authors' experience in exploring this possibility in two real-life case studies: a MOOC platform and a large video lecture repository. Apart from describing the systems, tools and integration components employed for such purpose, a comprehensive evaluation of the results achieved is provided in terms of quality and efficiency. More precisely, it is shown that draft multilingual subtitles produced by domain-adapted ASR/MT systems reach a level of accuracy that make them worth post-editing, instead of generating them ex novo, saving approximately 25%–75% of the time. Finally, the results reported on user multilingual data consumption reflect that multilingual subtitles have had a very positive impact in our case studies boosting student enrolment, in the case of the MOOC platform, by 70% relative.},
keywords = {Machine Translation, MOOCs, multilingual, Speech Recognition, video lecture repositories},
pubstate = {published},
tppubtype = {article}
}

Close

Piqueras, Santiago ; Pérez, Alejandro ; Turró, Carlos ; Jiménez, Manuel ; Sanchis, Albert ; Civera, Jorge ; Juan, Alfons

Hacia la traducción integral de vídeo charlas educativas Inproceedings

Proc. of III Congreso Nacional de Innovación Educativa y Docencia en Red (IN-RED 2017), pp. 117–124, València (Spain), 2017.

Abstract | Links | BibTeX | Tags: MOOCs, multilingual, translation

Villar Lafuente, Carlos ; Garcés Díaz-Munío, Gonçal

Several approaches for tweet topic classification in COSET – IberEval 2017 Inproceedings

Proc. of 2nd Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017), pp. 36–42, Murcia (Spain), 2017.

Abstract | Links | BibTeX | Tags: COSET2017, language models, linear models, neural networks, sentence embeddings, text classification

@inproceedings{Lafuente2017,
title = {Several approaches for tweet topic classification in COSET – IberEval 2017},
author = {Villar Lafuente, Carlos and Garcés Díaz-Munío, Gonçal},
url = {http://hdl.handle.net/10251/166361
http://ceur-ws.org/Vol-1881/COSET_paper_4.pdf},
year = {2017},
date = {2017-01-01},
booktitle = {Proc. of 2nd Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017)},
pages = {36--42},
address = {Murcia (Spain)},
abstract = {[EN] These working notes summarize the different approaches we have explored in order to classify a corpus of tweets related to the 2015 Spanish General Election (COSET 2017 task from IberEval 2017). Two approaches were tested during the COSET 2017 evaluations: Neural Networks with Sentence Embeddings (based on TensorFlow) and N-gram Language Models (based on SRILM). Our results with these approaches were modest: both ranked above the “Most frequent" baseline, but below the “Bag-of-words + SVM” baseline. A third approach was tried after the COSET 2017 evaluation phase was over: Advanced Linear Models (based on fastText). Results measured over the COSET 2017 Dev and Test show that this approach is well above the “TF-IDF+RF” baseline.

[CA] "Alguns mètodes per a la classificació temàtica de tuits en COSET - IberEval 2017": Aquest article resumeix els diferents mètodes que hem explorat per a classificar un corpus de tuits sobre les eleccions generals d'Espanya de 2015 (tasca COSET 2017 del taller IberEval 2017). Analitzàrem dos mètodes durant les avaluacions de COSET 2017: xarxes neuronals amb vectorització ("embedding") a nivell de frase (basat en TensorFlow) i models de llenguatge d'n-grames (basat en SRILM). Els nostres resultats amb aquests mètodes van ser modests: ambdós quedaren per damunt del valor de referència d'"el més freqüent" ("Most frequent"), però per davall del valor de referència de "bossa de paraules+SVM" ("Bag-of-words+SVM"). Analitzàrem un tercer mètode quan ja havia acabat la fase d'avaluacions de COSET 2017: models lineals avançats (basat en fastText). Els resultats mesurats sobre els conjunts de validació i prova de COSET 2017 mostren que aquest mètode supera clarament el valor de referència "TF-IDF+RF".},
keywords = {COSET2017, language models, linear models, neural networks, sentence embeddings, text classification},
pubstate = {published},
tppubtype = {inproceedings}
}

Close

Valor Miró, Juan Daniel

Evaluation of innovative computer-assisted transcription and translation strategies for video lecture repositories PhD Thesis

Universitat Politècnica de València, 2017, (Advisors: Jorge Civera Saiz and Alfons Juan Ciscar).

Abstract | Links | BibTeX | Tags: Computer-assisted transcription, Computer-assisted translation, video lecture repositories

@phdthesis{Miró2017b,
title = {Evaluation of innovative computer-assisted transcription and translation strategies for video lecture repositories},
author = {Valor Miró, Juan Daniel},
url = {http://hdl.handle.net/10251/90496},
year = {2017},
date = {2017-01-01},
school = {Universitat Politècnica de València},
abstract = {Nowadays, the technology enhanced learning area has experienced a strong growth with many new learning approaches like blended learning, flip teaching, massive open online courses, and open educational resources to complement face-to-face lectures. Specifically, video lectures are fast becoming an everyday educational resource in higher education for all of these new learning approaches, and they are being incorporated into existing university curricula around the world.
Transcriptions and translations can improve the utility of these audiovisual assets, but they are rarely present due to a lack of cost-effective solutions to do so. Lecture searchability, accessibility to people with impairments, translatability for foreign students, plagiarism detection, content recommendation, note-taking, and discovery of content-related videos are examples of advantages of the presence of transcriptions.
For this reason, the aim of this thesis is to test in real-life case studies ways to obtain multilingual captions for video lectures in a cost-effective way by using state-of-the-art automatic speech recognition and machine translation techniques. Also, we explore interaction protocols to review these automatic transcriptions and translations, because unfortunately automatic subtitles are not error-free. In addition, we take a step further into multilingualism by extending our findings and evaluation to several languages. Finally, the outcomes of this thesis have been applied to thousands of video lectures in European universities and institutions.},
note = {Advisors: Jorge Civera Saiz and Alfons Juan Ciscar},
keywords = {Computer-assisted transcription, Computer-assisted translation, video lecture repositories},
pubstate = {published},
tppubtype = {phdthesis}
}

Close

Silvestre-Cerdà, Joan Albert; Juan, Alfons; Civera, Jorge

Different Contributions to Cost-Effective Transcription and Translation of Video Lectures Inproceedings

Proc. of IX Jornadas en Tecnología del Habla and V Iberian SLTech Workshop (IberSpeech 2016), pp. 313-319, Lisbon (Portugal), 2016, ISBN: 978-3-319-49168-4 .

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Automatic transcription and translation, Machine Translation, Video Lectures

@inproceedings{Silvestre-Cerdà2016b,
title = {Different Contributions to Cost-Effective Transcription and Translation of Video Lectures},
author = {Joan Albert Silvestre-Cerdà and Alfons Juan and Jorge Civera},
url = {http://www.mllp.upv.es/wp-content/uploads/2016/11/poster.pdf
http://www.mllp.upv.es/wp-content/uploads/2016/11/paper.pdf
http://hdl.handle.net/10251/62194},
isbn = {978-3-319-49168-4 },
year = {2016},
date = {2016-11-24},
booktitle = {Proc. of IX Jornadas en Tecnología del Habla and V Iberian SLTech Workshop (IberSpeech 2016)},
pages = {313-319},
address = {Lisbon (Portugal)},
abstract = {In recent years, on-line multimedia repositories have experiencied
a strong growth that have made them consolidated as essential
knowledge assets, especially in the area of education, where large repositories
of video lectures have been built in order to complement or even
replace traditional teaching methods. However, most of these video lectures
are neither transcribed nor translated due to a lack of cost-effective
solutions to do so in a way that gives accurate enough results. Solutions
of this kind are clearly necessary in order to make these lectures accessible
to speakers of different languages and to people with hearing
disabilities, among many other benefits and applications.
For this reason, the main aim of this thesis is to develop a cost-effective
solution capable of transcribing and translating video lectures to a reasonable
degree of accuracy. More specifically, we address the integration
of state-of-the-art techniques in Automatic Speech Recognition and Machine
Translation into large video lecture repositories to generate highquality
multilingual video subtitles without human intervention and at
a reduced computational cost. Also, we explore the potential benefits of
the exploitation of the information that we know a priori about these
repositories, that is, lecture-specific knowledge such as speaker, topic
or slides, to create specialised, in-domain transcription and translation
systems by means of massive adaptation techniques.
The proposed solutions have been tested in real-life scenarios by carrying
out several objective and subjective evaluations, obtaining very
positive results. The main outcome derived from this multidisciplinary
thesis, The transLectures-UPV Platform, has been publicly released as an
open-source software, and, at the time of writing, it is serving automatic
transcriptions and translations for several thousands of video lectures in
many Spanish and European universities and institutions.},
keywords = {Automatic Speech Recognition, Automatic transcription and translation, Machine Translation, Video Lectures},
pubstate = {published},
tppubtype = {inproceedings}
}

Close

del-Agua, Miguel Ángel; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks Inproceedings

Proc. of the 17th Annual Conf. of the ISCA (Interspeech 2016), pp. 3464–3468, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags: BLSTM, Confidence measures, Recurrent Neural Networks, Speaker adaptation, Speech Recognition

Silvestre-Cerdà, Joan Albert

Different Contributions to Cost-Effective Transcription and Translation of Video Lectures PhD Thesis

Universitat Politècnica de València, 2016, (Advisors: Alfons Juan Ciscar and Jorge Civera Saiz).

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Education, Language Technologies, Machine Translation, Massive Adaptation, Multilingualism, video lecture repositories, Video Lectures

@phdthesis{Silvestre-Cerdà2016,
title = {Different Contributions to Cost-Effective Transcription and Translation of Video Lectures},
author = {Joan Albert Silvestre-Cerdà},
url = {http://hdl.handle.net/10251/62194
http://www.mllp.upv.es/wp-content/uploads/2016/01/slides.pdf
http://www.mllp.upv.es/wp-content/uploads/2016/01/thesis.pdf
http://www.mllp.upv.es/phd-thesis-different-contributions-to-cost-effective-transcription-and-translation-of-video-lectures-by-joan-albert-silvestre-cerda-abstract/},
year = {2016},
date = {2016-01-27},
school = {Universitat Politècnica de València},
abstract = {In recent years, online multimedia repositories have experienced a strong growth that has consolidated them as essential knowledge assets, especially in the area of education, where large repositories of video lectures have been built in order to complement or even replace traditional teaching methods. However, most of these video lectures are neither transcribed nor translated due to a lack of cost-effective solutions to do so in a way that provides accurate enough results. Solutions of this kind are clearly necessary in order to make these lectures accessible to speakers of different languages and to people with hearing disabilities. They would also facilitate lecture searchability and analysis functions, such as classification, recommendation or plagiarism detection, as well as the development of advanced educational functionalities like content summarisation to assist student note-taking.

For this reason, the main aim of this thesis is to develop a cost-effective solution capable of transcribing and translating video lectures to a reasonable degree of accuracy. More specifically, we address the integration of state-of-the-art techniques in Automatic Speech Recognition and Machine Translation into large video lecture repositories to generate high-quality multilingual video subtitles without human intervention and at a reduced computational cost. Also, we explore the potential benefits of the exploitation of the information that we know a priori about these repositories, that is, lecture-specific knowledge such as speaker, topic or slides, to create specialised, in-domain transcription and translation systems by means of massive adaptation techniques.

The proposed solutions have been tested in real-life scenarios by carrying out several objective and subjective evaluations, obtaining very positive results. The main technological outcome derived from this thesis, the transLectures-UPV Platform (TLP), has been publicly released as open-source software, and, at the time of writing, it is serving automatic transcriptions and translations for several thousands of video lectures in Spanish and European universities and institutions.},
note = {Advisors: Alfons Juan Ciscar and Jorge Civera Saiz},
keywords = {Automatic Speech Recognition, Education, Language Technologies, Machine Translation, Massive Adaptation, Multilingualism, video lecture repositories, Video Lectures},
pubstate = {published},
tppubtype = {phdthesis}
}

Close

In recent years, online multimedia repositories have experienced a strong growth that has consolidated them as essential knowledge assets, especially in the area of education, where large repositories of video lectures have been built in order to complement or even replace traditional teaching methods. However, most of these video lectures are neither transcribed nor translated due to a lack of cost-effective solutions to do so in a way that provides accurate enough results. Solutions of this kind are clearly necessary in order to make these lectures accessible to speakers of different languages and to people with hearing disabilities. They would also facilitate lecture searchability and analysis functions, such as classification, recommendation or plagiarism detection, as well as the development of advanced educational functionalities like content summarisation to assist student note-taking.

For this reason, the main aim of this thesis is to develop a cost-effective solution capable of transcribing and translating video lectures to a reasonable degree of accuracy. More specifically, we address the integration of state-of-the-art techniques in Automatic Speech Recognition and Machine Translation into large video lecture repositories to generate high-quality multilingual video subtitles without human intervention and at a reduced computational cost. Also, we explore the potential benefits of the exploitation of the information that we know a priori about these repositories, that is, lecture-specific knowledge such as speaker, topic or slides, to create specialised, in-domain transcription and translation systems by means of massive adaptation techniques.

The proposed solutions have been tested in real-life scenarios by carrying out several objective and subjective evaluations, obtaining very positive results. The main technological outcome derived from this thesis, the transLectures-UPV Platform (TLP), has been publicly released as open-source software, and, at the time of writing, it is serving automatic transcriptions and translations for several thousands of video lectures in Spanish and European universities and institutions.

Close

Valor Miró, Juan Daniel ; Turró, C; Civera, J; Juan, A

Generación eficiente de transcripciones y traducciones automáticas en poliMedia Inproceedings

Proc. of II Congreso Nacional de Innovación Educativa y Docencia en Red (IN-RED 2016), pp. 21–29, València (Spain), 2016.

Abstract | Links | BibTeX | Tags: Docencia en Red, e-learning, transcription, translation, video

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP system for the 4th CHiME Challenge Inproceedings

Proc. of the 4th Intl. Workshop on Speech Processing in Everyday Environments (CHiME 2016), pp. 57–59, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags:

Sánchez-Cortina, Isaías

Confidence Measures for Automatic and Interactive Speech Recognition PhD Thesis

Universitat Politècnica de València, 2016, (Advisors: Alfons Juan Ciscar and Alberto Sanchis Navarro).

Links | BibTeX | Tags:

Sanchez-Cortina, Isaias; Andrés-Ferrer, Jesús; Sanchis, Alberto; Juan, Alfons

Speaker-adapted confidence measures for speech recognition of video lectures Journal Article

Computer Speech & Language, 37 , pp. 11–23, 2016, ISBN: 0885-2308.

Abstract | Links | BibTeX | Tags: Confidence measures, Log-linear models, Online video lectures, Speaker adaptation, Speech Recognition

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP ASR Systems for IWSLT 2015 Inproceedings

Proc. of 12th Intl. Workshop on Spoken Language Translation (IWSLT 2015), pp. 39–44, Da Nang (Vietnam), 2015.

Abstract | Links | BibTeX | Tags:

Valor Miró, Juan Daniel ; Silvestre-Cerdà, Joan Albert ; Civera, Jorge ; Turró, Carlos ; Juan, Alfons

Efficient Generation of High-Quality Multilingual Subtitles for Video Lecture Repositories Inproceedings

Proc. of 10th European Conf. on Technology Enhanced Learning (EC-TEL 2015), pp. 485–490, Toledo (Spain), 2015, ISBN: 978-3-319-24258-3.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Docencia en Red, Efficient video subtitling, Polimedia, Statistical machine translation, video lecture repositories

@inproceedings{valor2015efficient,
title = {Efficient Generation of High-Quality Multilingual Subtitles for Video Lecture Repositories},
author = {Valor Miró, Juan Daniel and Silvestre-Cerdà, Joan Albert and Civera, Jorge and Turró, Carlos and Juan, Alfons},
url = {http://link.springer.com/chapter/10.1007/978-3-319-24258-3_44
http://www.mllp.upv.es/wp-content/uploads/2016/03/paper.pdf
},
isbn = {978-3-319-24258-3},
year = {2015},
date = {2015-09-17},
booktitle = {Proc. of 10th European Conf. on Technology Enhanced Learning (EC-TEL 2015)},
pages = {485--490},
address = {Toledo (Spain)},
abstract = {Video lectures are a valuable educational tool in higher education to support or replace face-to-face lectures in active learning strategies. In 2007 the Universitat Polit‘ecnica de Val‘encia (UPV) implemented its video lecture capture system, resulting in a high quality educational video repository, called poliMedia, with more than 10.000 mini lectures created by 1.373 lecturers. Also, in the framework of the European project transLectures, UPV has automatically generated transcriptions and translations in Spanish, Catalan and English for all videos included in the poliMedia video repository. transLectures’s objective responds to the widely-recognised need for subtitles to be provided with video lectures, as an essential service for non-native speakers and hearing impaired persons, and to allow advanced repository functionalities. Although high-quality automatic transcriptions and translations were generated in transLectures, they were not error-free. For this reason, lecturers need to manually review video subtitles to guarantee the absence of errors. The aim of this study is to evaluate the efficiency of the manual review process from automatic subtitles in comparison with the conventional generation of video subtitles from scratch. The reported results clearly indicate the convenience of providing automatic subtitles as a first step in the generation of video subtitles and the significant savings in time of up to almost 75% involved in reviewing subtitles.},
keywords = {Automatic Speech Recognition, Docencia en Red, Efficient video subtitling, Polimedia, Statistical machine translation, video lecture repositories},
pubstate = {published},
tppubtype = {inproceedings}
}

Close

Pérez González de Martos, Alejandro ; Silvestre-Cerdà, Joan Albert ; Valor Miró, Juan Daniel ; Civera, Jorge ; Juan, Alfons

MLLP Transcription and Translation Platform Miscellaneous

2015, (Short paper for demo presentation accepted at 10th European Conf. on Technology Enhanced Learning (EC-TEL 2015), Toledo (Spain), 2015.).

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Docencia en Red, Document translation, Efficient video subtitling, Machine Translation, MLLP, Post-editing, Video Lectures

Valor Miró, Juan Daniel ; Turró, C; Civera, J; Juan, A

Evaluación de la revisión de transcripciones y traducciones automáticas de vídeos poliMedia Inproceedings

Proc. of I Congreso Nacional de Innovación Educativa y Docencia en Red (IN-RED 2015), pp. 464–468, València (Spain), 2015.

Links | BibTeX | Tags: Docencia en Red, evaluaciones con usuario, Polimedia, traducciones, transcripciones

Khoury, Ihab; Giménez, Adrià; Juan, Alfons; Andrés-Ferrer, Jesús

Window Repositioning for Printed Arabic Recognition Journal Article

Pattern Recognition Letters, 51 , pp. 86–93, 2015, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: Bernoulli HMMs, Printed Arabic Recognition, Repositioning, Sliding window

Khoury, Ihab

Arabic Text Recognition and Machine Translation PhD Thesis

Universitat Politècnica de València, 2015, (Advisors: Alfons Juan Ciscar and Jesús Andrés Ferrer).

Links | BibTeX | Tags:

Valor Miró, Juan Daniel ; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Turró, Carlos; Juan, Alfons

Efficiency and usability study of innovative computer-aided transcription strategies for video lecture repositories Journal Article

Speech Communication, 74 , pp. 65–75, 2015, ISSN: 0167-6393.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Computer-assisted transcription, Interface design strategies, Usability study, video lecture repositories

@article{Valor201565,
title = {Efficiency and usability study of innovative computer-aided transcription strategies for video lecture repositories},
author = {Valor Miró, Juan Daniel and Joan Albert Silvestre-Cerdà and Jorge Civera and Carlos Turró and Alfons Juan},
url = {http://www.sciencedirect.com/science/article/pii/S0167639315001016
http://www.mllp.upv.es/wp-content/uploads/2016/03/paper1.pdf},
issn = {0167-6393},
year = {2015},
date = {2015-01-01},
journal = {Speech Communication},
volume = {74},
pages = {65--75},
abstract = {Abstract Video lectures are widely used in education to support and complement face-to-face lectures. However, the utility of these audiovisual assets could be further improved by adding subtitles that can be exploited to incorporate added-value functionalities such as searchability, accessibility, translatability, note-taking, and discovery of content-related videos, among others. Today, automatic subtitles are prone to error, and need to be reviewed and post-edited in order to ensure that what students see on-screen are of an acceptable quality. This work investigates different user interface design strategies for this post-editing task to discover the best way to incorporate automatic transcription technologies into large educational video repositories. Our three-phase study involved lecturers from the Universitat Politècnica de València (UPV) with videos available on the poliMedia video lecture repository, which is currently over 10,000 video objects. Simply by conventional post-editing automatic transcriptions users almost reduced to half the time that would require to generate the transcription from scratch. As expected, this study revealed that the time spent by lecturers reviewing automatic transcriptions correlated directly with the accuracy of said transcriptions. However, it is also shown that the average time required to perform each individual editing operation could be precisely derived and could be applied in the definition of a user model. In addition, the second phase of this study presents a transcription review strategy based on confidence measures (CM) and compares it to the conventional post-editing strategy. Finally, a third strategy resulting from the combination of that based on \\{CM\\} with massive adaptation techniques for automatic speech recognition (ASR), achieved to improve the transcription review efficiency in comparison with the two aforementioned strategies.},
keywords = {Automatic Speech Recognition, Computer-assisted transcription, Interface design strategies, Usability study, video lecture repositories},
pubstate = {published},
tppubtype = {article}
}

Close

Brouns, Francis; Serrano Martínez-Santos, Nicolás ; Civera, Jorge; Kalz, Marco; Juan, Alfons

Supporting language diversity of European MOOCs with the EMMA platform Inproceedings

Proc. of the European MOOC Stakeholder Summit EMOOCs 2015, pp. 157–165, Mons (Belgium), 2015.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, EMMA, Statistical machine translation

Piqueras Gozalbes, Santiago Romualdo

Applying Machine Learning technologies to the synthesis of video lectures Masters Thesis

Universitat Politècnica de València, 2014.

Abstract | Links | BibTeX | Tags:

Valor Miró, Juan Daniel ; Spencer, R N; Pérez González de Martos, A; Garcés Díaz-Munío, G; Turró, C; Civera, J; Juan, A

Evaluación del proceso de revisión de transcripciones automáticas para vídeos Polimedia Inproceedings

Proc. of I Jornadas de Innovación Educativa y Docencia en Red (IN-RED 2014), pp. 272–278, Valencia (Spain), 2014.

Abstract | Links | BibTeX | Tags: ASR, Docencia en Red, evaluaciones, Polimedia, transcripciones

@inproceedings{Valor14-InRed,
title = {Evaluación del proceso de revisión de transcripciones automáticas para vídeos Polimedia},
author = {Valor Miró, Juan Daniel and Spencer, R.N. and Pérez González de Martos, A. and Garcés Díaz-Munío, G. and Turró, C. and Civera, J. and Juan, A.},
url = {http://hdl.handle.net/10251/40404
http://dx.doi.org/10.4995/INRED.2014
http://www.mllp.upv.es/wp-content/uploads/2015/04/paper1.pdf
https://www.mllp.upv.es/wp-content/uploads/2019/09/poster.pdf},
year = {2014},
date = {2014-07-01},
booktitle = {Proc. of I Jornadas de Innovación Educativa y Docencia en Red (IN-RED 2014)},
pages = {272--278},
address = {Valencia (Spain)},
abstract = {[EN] Video lectures are a tool of proven value and wide acceptance in universities which is leading to video lecture platforms like poliMèdia (Universitat Politècnica de València). transLectures is an EU project that generates high-quality automatic transcriptions and translations for the poliMèdia platform, and improves them by using massive adaptation and intelligent interaction techniques. In this paper we present the evaluation with lecturers carried out under the Docència en Xarxa 2012-2013 action plan with the aim of studying the process of transcription post-editing, in contrast with transcribing from scratch.

[ES] Los vídeos docentes son una herramienta de demostrada utilidad y gran aceptación en el mundo universitario que está dando lugar a plataformas de vídeos docentes como poliMèdia (Universitat Politècnica de València). transLectures es un proyecto europeo que genera transcripciones y traducciones automáticas de alta calidad para la plataforma poliMèdia, mediante técnicas de adaptación masiva e interacción inteligente. En este artículo presentamos la evaluación con profesores que se realizó en el marco del plan de acción Docència en Xarxa 2012-2013, con el objetivo de estudiar el proceso de supervisión de transcripciones, comparándolo con la obtención de la transcripción sin disponer de una transcripción automática previa.

[CA] Els vídeos docents són una eina d'utilitat demostrada i amb gran acceptació en el món universitari que està donant lloc a plataformes de vídeos docents com poliMèdia (Universitat Politècnica de València). transLectures és un projecte europeu que genera transcripcions i traduccions automàtiques d'alta qualitat per a la plataforma poliMèdia, mitjançant tècniques d'adaptació massiva i interacció intel·ligent. En aquest article presentem l'avaluació amb professors que es realitzà en el marc del pla d'acció Docència en Xarxa 2012-2013, amb l'objectiu d'estudiar el procés de supervisió de transcripcions, comparant-lo amb l'obtenció de la transcripció sense disposar d'una transcripció automàtica prèvia.},
keywords = {ASR, Docencia en Red, evaluaciones, Polimedia, transcripciones},
pubstate = {published},
tppubtype = {inproceedings}
}

Close

[EN] Video lectures are a tool of proven value and wide acceptance in universities which is leading to video lecture platforms like poliMèdia (Universitat Politècnica de València). transLectures is an EU project that generates high-quality automatic transcriptions and translations for the poliMèdia platform, and improves them by using massive adaptation and intelligent interaction techniques. In this paper we present the evaluation with lecturers carried out under the Docència en Xarxa 2012-2013 action plan with the aim of studying the process of transcription post-editing, in contrast with transcribing from scratch.

[ES] Los vídeos docentes son una herramienta de demostrada utilidad y gran aceptación en el mundo universitario que está dando lugar a plataformas de vídeos docentes como poliMèdia (Universitat Politècnica de València). transLectures es un proyecto europeo que genera transcripciones y traducciones automáticas de alta calidad para la plataforma poliMèdia, mediante técnicas de adaptación masiva e interacción inteligente. En este artículo presentamos la evaluación con profesores que se realizó en el marco del plan de acción Docència en Xarxa 2012-2013, con el objetivo de estudiar el proceso de supervisión de transcripciones, comparándolo con la obtención de la transcripción sin disponer de una transcripción automática previa.

[CA] Els vídeos docents són una eina d'utilitat demostrada i amb gran acceptació en el món universitari que està donant lloc a plataformes de vídeos docents com poliMèdia (Universitat Politècnica de València). transLectures és un projecte europeu que genera transcripcions i traduccions automàtiques d'alta qualitat per a la plataforma poliMèdia, mitjançant tècniques d'adaptació massiva i interacció intel·ligent. En aquest article presentem l'avaluació amb professors que es realitzà en el marc del pla d'acció Docència en Xarxa 2012-2013, amb l'objectiu d'estudiar el procés de supervisió de transcripcions, comparant-lo amb l'obtenció de la transcripció sense disposar d'una transcripció automàtica prèvia.

Close

Giménez Pastor, Adrià

Bernoulli HMMs for Handwritten Text Recognition PhD Thesis

Universitat Politècnica de València , 2014, (Advisors: Alfons Juan Ciscar and Jesús Andrés Ferrer).

Links | BibTeX | Tags:

Serrano Martínez-Santos, Nicolás

Interactive Transcription of Old Text Documents PhD Thesis

Universitat Politècnica de València, 2014, (Advisors: Alfons Juan Ciscar and Jorge Civera Saiz).

Links | BibTeX | Tags:

Alabau Gonzalvo, Vicent

Multimodal interactive structured prediction PhD Thesis

Universitat Politècnica de València, 2014, (Advisors: Francisco Casacuberta Nolla and Alberto Sanchis Navarro).

Links | BibTeX | Tags:

Martínez-Villaronga, A; del-Agua, M A; Silvestre-Cerdà, J A; Andrés-Ferrer, J; Juan, A

Language model adaptation for lecture transcription by document retrieval Inproceedings

Proc. of VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop (IberSpeech 2014), Las Palmas de Gran Canaria (Spain), 2014.

Links | BibTeX | Tags:

Alabau, Vicent; Sanchis, Alberto; Casacuberta, Francisco

Improving on-line handwritten recognition in interactive machine translation Journal Article

Pattern Recognition, 47 (3) , pp. 1217–1228, 2014.

Links | BibTeX | Tags:

Giménez, Adrià; Khoury, Ihab; Andrés-Ferrer, Jesús; Juan, Alfons

Handwriting word recognition using windowed Bernoulli HMMs Journal Article

Pattern Recognition Letters, 35 (0), pp. 149–156, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: Sliding window

Giménez, Adrià; Andrés-Ferrer, Jesús; Juan, Alfons

Discriminative Bernoulli HMMs for isolated handwritten word recognition Journal Article

Pattern Recognition Letters, 35 (0), pp. 157–168, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: RIMES

Serrano, Nicolás; Civera, Jorge; Sanchis, Alberto; Juan, A

Effective balancing error and user effort in interactive handwriting recognition Journal Article

Pattern Recognition Letters, 37 , pp. 135–142, 2014.

Links | BibTeX | Tags:

Serrano, Nicolás; Giménez, Adrià; Civera, Jorge; Sanchis, Alberto; Juan, Alfons

Interactive Handwriting Recognition with Limited User effort Journal Article

Intl. Journal on Document Analysis and Recognition (IJDAR), 17 , pp. 47–59, 2014.

Links | BibTeX | Tags: