Publications

Show all

50 entries « 1 of 2 »

2022

Pérez González de Martos, Alejandro ; Giménez Pastor, Adrià ; Jorge Cano, Javier ; Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Garcés Díaz-Munío, Gonçal V; Baquero-Arnal, Pau; Sanchis Navarro, Alberto ; Civera Sáiz, Jorge ; Juan Ciscar, Alfons ; Turró Ribalta, Carlos

Doblaje automático de vídeo-charlas educativas en UPV[Media] Inproceedings

Proc. of VIII Congrés d'Innovació Educativa i Docència en Xarxa (IN-RED 2022), pp. 557–570, València (Spain), 2022.

Abstract | Links | BibTeX | Tags: automatic dubbing, Automatic Speech Recognition, Machine Translation, OER, text-to-speech

Iranzo-Sánchez, Javier; Jorge, Javier; Pérez-González-de-Martos, Alejandro; Giménez, Adrià; Garcés Díaz-Munío, Gonçal V; Baquero-Arnal, Pau; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks Inproceedings

Proc. of 19th Intl. Workshop on Spoken Language Translation (IWSLT 2022), pp. 255–264, Dublin (Ireland), 2022.

Abstract | Links | BibTeX | Tags: Simultaneous Speech Translation, speech-to-speech translation

Baquero-Arnal, Pau; Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Pérez-González-de-Martos, Alejandro; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge: Extension Journal Article

Applied Sciences, 12 (2), pp. 804, 2022.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Natural Language Processing, streaming

2021

Jorge, Javier ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Civera, Jorge ; Sanchis, Albert ; Alfons, Juan

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models Journal Article

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30 , pp. 148–161, 2021.

Abstract | Links | BibTeX | Tags: acoustic modelling, Automatic Speech Recognition, decoding, language modelling, neural networks, streaming

Pérez, Alejandro; Garcés Díaz-Munío, Gonçal ; Giménez, Adrià; Silvestre-Cerdà, Joan Albert ; Sanchis, Albert; Civera, Jorge; Jiménez, Manuel; Turró, Carlos; Juan, Alfons

Towards cross-lingual voice cloning in higher education Journal Article

Engineering Applications of Artificial Intelligence, 105 , pp. 104413, 2021.

Abstract | Links | BibTeX | Tags: cross-lingual voice conversion, educational resources, multilinguality, OER, text-to-speech

Jorge, Javier; Giménez, Adrià; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González-de-Martos, Alejandro; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge Inproceedings

Proc. of IberSPEECH 2021, pp. 118–122, Valladolid (Spain), 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Natural Language Processing, streaming

Pérez-González-de-Martos, Alejandro; Iranzo-Sánchez, Javier; Giménez Pastor, Adrià ; Jorge, Javier; Silvestre-Cerdà, Joan-Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Towards simultaneous machine interpretation Inproceedings

Proc. Interspeech 2021, pp. 2277–2281, Brno (Czech Republic), 2021.

Abstract | Links | BibTeX | Tags: cross-lingual voice cloning, incremental text-to-speech, simultaneous machine interpretation, speech-to-speech translation

Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert ; Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Baquero-Arnal, Pau; Roselló, Nahuel; Pérez-González-de-Martos, Alejandro; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization Inproceedings

Proc. Interspeech 2021, pp. 3695–3699, Brno (Czech Republic), 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, speech corpus, speech data filtering, speech data verbatimization

Iranzo-Sánchez, Javier; Jorge, Javier; Baquero-Arnal, Pau; Silvestre-Cerdà, Joan Albert ; Giménez, Adrià; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Streaming cascade-based speech translation leveraged by a direct segmentation model Journal Article

Neural Networks, 142 , pp. 303–315, 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Cascade System, Deep Neural Networks, Hybrid System, Machine Translation, Segmentation Model, Speech Translation, streaming

2020

Iranzo-Sánchez, Javier; Giménez Pastor, Adrià ; Silvestre-Cerdà, Joan Albert; Baquero-Arnal, Pau; Saiz, Jorge Civera; Juan, Alfons

Direct Segmentation Models for Streaming Speech Translation Inproceedings

Proc. of 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), pp. 2599–2611, 2020.

Abstract | Links | BibTeX | Tags: Segmentation, Speech Translation, streaming

Baquero-Arnal, Pau ; Jorge, Javier ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Iranzo-Sánchez, Javier ; Sanchis, Albert ; Civera, Jorge ; Juan, Alfons

Improved Hybrid Streaming ASR with Transformer Language Models Inproceedings

Proc. of 21st Annual Conf. of the Intl. Speech Communication Association (InterSpeech 2020), pp. 2127–2131, Shanghai (China), 2020.

Abstract | Links | BibTeX | Tags: hybrid ASR, language models, streaming, Transformer

Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Jorge, Javier; Roselló, Nahuel; Giménez, Adrià; Sanchis, Albert; Civera, Jorge; Juan, Alfons

Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates Inproceedings

Proc. of 45th Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 8229–8233, Barcelona (Spain), 2020.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Machine Translation, Multilingual Corpus, Speech Translation, Spoken Language Translation

Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

LSTM-Based One-Pass Decoder for Low-Latency Streaming Inproceedings

Proc. of 45th Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7814–7818, Barcelona (Spain), 2020.

Abstract | Links | BibTeX | Tags: acoustic modeling, Automatic Speech Recognition, decoding, Language Modeling, streaming

2019

Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Real-time One-pass Decoder for Speech Recognition Using LSTM Language Models Inproceedings

Proc. of the 20th Annual Conf. of the ISCA (Interspeech 2019), pp. 3820–3824, Graz (Austria), 2019.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, LSTM language models, one-pass decoding, real-time

2018

Del-Agua, Miguel Ángel ; Giménez, Adrià ; Sanchis, Alberto ; Civera, Jorge; Juan, Alfons

Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks Journal Article

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26 (7), pp. 1194–1202, 2018.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Confidence estimation, Confidence measures, Deep bidirectional recurrent neural networks, Long short-term memory, Speaker adaptation

Jorge, Javier ; Martínez-Villaronga, Adrià ; Golik, Pavel ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Doetsch, Patrick ; Císcar, Vicent Andreu ; Ney, Hermann ; Juan, Alfons ; Sanchis, Albert

MLLP-UPV and RWTH Aachen Spanish ASR Systems for the IberSpeech-RTVE 2018 Speech-to-Text Transcription Challenge Inproceedings

Proc. of IberSPEECH 2018: 10th Jornadas en Tecnologías del Habla and 6th Iberian SLTech Workshop, pp. 257–261, Barcelona (Spain), 2018.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Iberspeech-RTVE-Challenge2018, IberSpeech2018, Speech-to-Text

2016

del-Agua, Miguel Ángel; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks Inproceedings

Proc. of the 17th Annual Conf. of the ISCA (Interspeech 2016), pp. 3464–3468, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags: BLSTM, Confidence measures, Recurrent Neural Networks, Speaker adaptation, Speech Recognition

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP system for the 4th CHiME Challenge Inproceedings

Proc. of the 4th Intl. Workshop on Speech Processing in Everyday Environments (CHiME 2016), pp. 57–59, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags:

2015

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP ASR Systems for IWSLT 2015 Inproceedings

Proc. of 12th Intl. Workshop on Spoken Language Translation (IWSLT 2015), pp. 39–44, Da Nang (Vietnam), 2015.

Abstract | Links | BibTeX | Tags:

Khoury, Ihab; Giménez, Adrià; Juan, Alfons; Andrés-Ferrer, Jesús

Window Repositioning for Printed Arabic Recognition Journal Article

Pattern Recognition Letters, 51 , pp. 86–93, 2015, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: Bernoulli HMMs, Printed Arabic Recognition, Repositioning, Sliding window

2014

Giménez Pastor, Adrià

Bernoulli HMMs for Handwritten Text Recognition PhD Thesis

Universitat Politècnica de València , 2014, (Advisors: Alfons Juan Ciscar and Jesús Andrés Ferrer).

Links | BibTeX | Tags:

Serrano, Nicolás; Giménez, Adrià; Civera, Jorge; Sanchis, Alberto; Juan, Alfons

Interactive Handwriting Recognition with Limited User effort Journal Article

Intl. Journal on Document Analysis and Recognition (IJDAR), 17 , pp. 47–59, 2014.

Links | BibTeX | Tags:

Giménez, Adrià; Andrés-Ferrer, Jesús; Juan, Alfons

Discriminative Bernoulli HMMs for isolated handwritten word recognition Journal Article

Pattern Recognition Letters, 35 (0), pp. 157–168, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: RIMES

Giménez, Adrià; Khoury, Ihab; Andrés-Ferrer, Jesús; Juan, Alfons

Handwriting word recognition using windowed Bernoulli HMMs Journal Article

Pattern Recognition Letters, 35 (0), pp. 149–156, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: Sliding window

Piqueras, S; del-Agua, M A; Giménez, A; Civera, J; Juan, A

Statistical text-to-speech synthesis of Spanish subtitles Inproceedings

Proc. of VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop (IberSpeech 2014), Las Palmas de Gran Canaria (Spain), 2014.

Links | BibTeX | Tags:

del-Agua, M A; Giménez, A; Serrano, N; Andrés-Ferrer, J; Civera, J; Sanchis, A; Juan, A

The transLectures-UPV toolkit Inproceedings

Proc. of VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop (IberSpeech 2014), Las Palmas de Gran Canaria (Spain), 2014.

Links | BibTeX | Tags:

Wuebker, Joern; Ney, Hermann; Martínez-Villaronga, Adrià; Giménez, Adrià; Juan, Alfons; Servan, Christophe; Dymetman, Marc; Mirkin, Shachar

Comparison of Data Selection Techniques for the Translation of Video Lectures Inproceedings

Proc. of the Eleventh Biennial Conf. of the Association for Machine Translation in the Americas (AMTA-2014), pp. 193–207, Vancouver (Canada), 2014.

Links | BibTeX | Tags:

2013

Khoury, Ihab ; Giménez, Adrià ; Andrés-Ferrer, Jesús ; Juan, Alfons ; Sánchez, Joan Andreu

The UPV Handwriting Recognition and Translation System for OpenHaRT 2013 Inproceedings

Proc. of the NIST Open Handwriting Recognition and Translation Evaluation Workshop (OpenHaRT 2013), Washington DC (USA), 2013.

Links | BibTeX | Tags: Arabic HTR, Bernoulli HMM, NIST OpenHaRT, Repositioning, Sliding window

Alkhoury, Ihab; Giménez, Adrià; Juan, Alfons; Andrés-Ferrer, Jesús

Arabic Printed Word Recognition Using Windowed Bernoulli HMMs Inproceedings

Proc. of the 17th Intl. Conf. on Image, Analysis and Processings (ICIAP 2013), pp. 330 – 339, Naples (Italy), 2013.

Links | BibTeX | Tags:

2012

Silvestre-Cerdà, Joan Albert ; Del Agua, Miguel ; Garcés, Gonçal; Gascó, Guillem; Giménez-Pastor, Adrià; Martínez, Adrià; Pérez González de Martos, Alejandro ; Sánchez, Isaías; Serrano Martínez-Santos, Nicolás ; Spencer, Rachel; Valor Miró, Juan Daniel ; Andrés-Ferrer, Jesús; Civera, Jorge; Sanchís, Alberto; Juan, Alfons

transLectures Inproceedings

Proceedings (Online) of IberSPEECH 2012, pp. 345–351, Madrid (Spain), 2012.

Abstract | Links | BibTeX | Tags: Accessibility, Automatic Speech Recognition, Education, Intelligent Interaction, Language Technologies, Machine Translation, Massive Adaptation, Multilingualism, Opencast Matterhorn, Video Lectures

50 entries « 1 of 2 »