Publications

Show all

45 entries « 1 of 2 »

2021

Jorge, Javier; Giménez, Adrià; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González-de-Martos, Alejandro; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge Inproceedings

Proc. of IberSPEECH 2021, pp. 118–122, Valladolid (Spain), 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Natural Language Processing, streaming

Pérez-González-de-Martos, Alejandro; Iranzo-Sánchez, Javier; Giménez Pastor, Adrià ; Jorge, Javier; Silvestre-Cerdà, Joan-Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Towards simultaneous machine interpretation Inproceedings Forthcoming

Proc. Interspeech 2021, Brno (Czech Republic), Forthcoming.

Abstract | BibTeX | Tags: cross-lingual voice cloning, incremental text-to-speech, simultaneous machine interpretation, speech-to-speech translation

Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert ; Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Baquero-Arnal, Pau; Roselló, Nahuel; Pérez-González-de-Martos, Alejandro; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization Inproceedings Forthcoming

Proc. Interspeech 2021, Brno (Czech Republic), Forthcoming.

Abstract | BibTeX | Tags: Automatic Speech Recognition, speech corpus, speech data filtering, speech data verbatimization

Iranzo-Sánchez, Javier; Jorge, Javier; Baquero-Arnal, Pau; Silvestre-Cerdà, Joan Albert ; Giménez, Adrià; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Streaming cascade-based speech translation leveraged by a direct segmentation model Journal Article

Neural Networks, 142 , pp. 303–315, 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Cascade System, Deep Neural Networks, Hybrid System, Machine Translation, Segmentation Model, Speech Translation, streaming

2020

Iranzo-Sánchez, Javier; Giménez Pastor, Adrià ; Silvestre-Cerdà, Joan Albert; Baquero-Arnal, Pau; Saiz, Jorge Civera; Juan, Alfons

Direct Segmentation Models for Streaming Speech Translation Inproceedings

2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), pp. 2599–2611, 2020.

Abstract | Links | BibTeX | Tags: Segmentation, Speech Translation, streaming

Baquero-Arnal, Pau ; Jorge, Javier ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Iranzo-Sánchez, Javier ; Sanchis, Albert ; Civera, Jorge ; Juan, Alfons

Improved Hybrid Streaming ASR with Transformer Language Models Inproceedings

Proc. of 21st Annual Conf. of the Intl. Speech Communication Association (InterSpeech 2020), pp. 2127–2131, Shanghai (China), 2020.

Abstract | Links | BibTeX | Tags: hybrid ASR, language models, streaming, Transformer

Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Jorge, Javier; Roselló, Nahuel; Giménez, Adrià; Sanchis, Albert; Civera, Jorge; Juan, Alfons

Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates Inproceedings

Proc. of 45th Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 8229–8233, Barcelona (Spain), 2020.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Machine Translation, Multilingual Corpus, Speech Translation, Spoken Language Translation

Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

LSTM-Based One-Pass Decoder for Low-Latency Streaming Inproceedings

Proc. of 45th Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7814–7818, Barcelona (Spain), 2020.

Abstract | Links | BibTeX | Tags: acoustic modeling, Automatic Speech Recognition, decoding, Language Modeling, streaming

2019

Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Real-time One-pass Decoder for Speech Recognition Using LSTM Language Models Inproceedings

Proc. of the 20th Annual Conf. of the ISCA (Interspeech 2019), pp. 3820–3824, Graz (Austria), 2019.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, LSTM language models, one-pass decoding, real-time

2018

Jorge, Javier ; Martínez-Villaronga, Adrià ; Golik, Pavel ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Doetsch, Patrick ; Císcar, Vicent Andreu ; Ney, Hermann ; Juan, Alfons ; Sanchis, Albert

MLLP-UPV and RWTH Aachen Spanish ASR Systems for the IberSpeech-RTVE 2018 Speech-to-Text Transcription Challenge Inproceedings

Proc. of IberSPEECH 2018: 10th Jornadas en Tecnologías del Habla and 6th Iberian SLTech Workshop, pp. 257–261, Barcelona (Spain), 2018.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Iberspeech-RTVE-Challenge2018, IberSpeech2018, Speech-to-Text

Del-Agua, Miguel Ángel ; Giménez, Adrià ; Sanchis, Alberto ; Civera, Jorge; Juan, Alfons

Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks Journal Article

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26 (7), pp. 1194–1202, 2018.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Confidence estimation, Confidence measures, Deep bidirectional recurrent neural networks, Long short-term memory, Speaker adaptation

2016

del-Agua, Miguel Ángel; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks Inproceedings

Proc. of the 17th Annual Conf. of the ISCA (Interspeech 2016), pp. 3464–3468, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags: BLSTM, Confidence measures, Recurrent Neural Networks, Speaker adaptation, Speech Recognition

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP system for the 4th CHiME Challenge Inproceedings

Proc. of the 4th CHiME Speech Separation and Recognition Challenge (CHiME-4), pp. 57–59, San Francisco (USA), 2016.

Abstract | Links | BibTeX | Tags:

2015

del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Piqueras, Santiago; Giménez, Adrià; Sanchis, Alberto; Civera, Jorge; Juan, Alfons

The MLLP ASR Systems for IWSLT 2015 Inproceedings

Proc. of 12th Intl. Workshop on Spoken Language Translation (IWSLT 2015), pp. 39–44, Da Nang (Vietnam), 2015.

Abstract | Links | BibTeX | Tags:

Khoury, Ihab; Giménez, Adrià; Juan, Alfons; Andrés-Ferrer, Jesús

Window Repositioning for Printed Arabic Recognition Journal Article

Pattern Recognition Letters, 51 , pp. 86–93, 2015, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: Bernoulli HMMs, Printed Arabic Recognition, Repositioning, Sliding window

2014

Giménez Pastor, Adrià

Bernoulli HMMs for Handwritten Text Recognition PhD Thesis

Universitat Politècnica de València , 2014, (Advisors: Alfons Juan Ciscar and Jesús Andrés Ferrer).

Links | BibTeX | Tags:

Wuebker, Joern; Ney, Hermann; Martínez-Villaronga, Adrià; Giménez, Adrià; Juan, Alfons; Servan, Christophe; Dymetman, Marc; Mirkin, Shachar

Comparison of Data Selection Techniques for the Translation of Video Lectures Inproceedings

Proc. of the Eleventh Biennial Conf. of the Association for Machine Translation in the Americas (AMTA-2014), pp. 193–207, Vancouver (Canada), 2014.

BibTeX | Tags:

del-Agua, M A; Giménez, A; Serrano, N; Andrés-Ferrer, J; Civera, J; Sanchis, A; Juan, A

The transLectures-UPV toolkit Inproceedings

Proc. of VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop (IberSpeech 2014), Las Palmas de Gran Canaria (Spain), 2014.

Links | BibTeX | Tags:

Piqueras, S; del-Agua, M A; Giménez, A; Civera, J; Juan, A

Statistical text-to-speech synthesis of Spanish subtitles Inproceedings

Proc. of VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop (IberSpeech 2014), Las Palmas de Gran Canaria (Spain), 2014.

Links | BibTeX | Tags:

Serrano, Nicolás; Giménez, Adrià; Civera, Jorge; Sanchis, Alberto; Juan, Alfons

Interactive Handwriting Recognition with Limited User effort Journal Article

Intl. Journal on Document Analysis and Recognition (IJDAR), 17 , pp. 47–59, 2014.

Links | BibTeX | Tags:

Giménez, Adrià; Khoury, Ihab; Andrés-Ferrer, Jesús; Juan, Alfons

Handwriting word recognition using windowed Bernoulli HMMs Journal Article

Pattern Recognition Letters, 35 (0), pp. 149–156, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: Sliding window

Giménez, Adrià; Andrés-Ferrer, Jesús; Juan, Alfons

Discriminative Bernoulli HMMs for isolated handwritten word recognition Journal Article

Pattern Recognition Letters, 35 (0), pp. 157–168, 2014, ISSN: 0167-8655, (Frontiers in Handwriting Processing).

Links | BibTeX | Tags: RIMES

2013

Khoury, Ihab ; Giménez, Adrià ; Andrés-Ferrer, Jesús ; Juan, Alfons ; Sánchez, Joan Andreu

The UPV Handwriting Recognition and Translation System for OpenHaRT 2013 Inproceedings

Proc. of the NIST Open Handwriting Recognition and Translation Evaluation Workshop (OpenHaRT 2013), Washington DC (USA), 2013.

Links | BibTeX | Tags: Arabic HTR, Bernoulli HMM, NIST OpenHaRT, Repositioning, Sliding window

Alkhoury, Ihab; Giménez, Adrià; Juan, Alfons; Andrés-Ferrer, Jesús

Arabic Printed Word Recognition Using Windowed Bernoulli HMMs Inproceedings

Proc. of the 17th Intl. Conf. on Image, Analysis and Processings (ICIAP 2013), pp. 330 – 339, Naples (Italy), 2013.

Links | BibTeX | Tags:

2012

Silvestre-Cerdà, Joan Albert ; Del Agua, Miguel ; Garcés, Gonçal; Gascó, Guillem; Giménez-Pastor, Adrià; Martínez, Adrià; Pérez González de Martos, Alejandro ; Sánchez, Isaías; Serrano Martínez-Santos, Nicolás ; Spencer, Rachel; Valor Miró, Juan Daniel ; Andrés-Ferrer, Jesús; Civera, Jorge; Sanchís, Alberto; Juan, Alfons

transLectures Inproceedings

Proceedings of IberSPEECH 2012, pp. 345–351, Madrid (Spain), 2012.

Abstract | Links | BibTeX | Tags: Accessibility, Automatic Speech Recognition, Education, Intelligent Interaction, Language Technologies, Machine Translation, Massive Adaptation, Multilingualism, Opencast Matterhorn, Video Lectures

Silvestre-Cerdà, Joan Albert; Giménez, Adrià; Andrés-Ferrer, Jesús; Civera, Jorge; Juan, Alfons

Albayzin Evaluation: The PRHLT-UPV Audio Segmentation System Inproceedings

Proceedings of IberSPEECH 2012, pp. 596-600, Madrid (Spain), 2012.

Abstract | Links | BibTeX | Tags:

Khoury, Ihab; Giménez-Pastor, Adrià; Juan, Alfons

Guide to OCR for Arabic Scripts Book Chapter

Märgner Volkerand El Abed, Haikal (Ed.): Chapter Arabic Handwriting Recognition Using Ber, pp. 255-272, Springer, 2012.

BibTeX | Tags:

Toselli, AlejandroH. ; Serrano, Nicolás ; Giménez-Pastor, Adrià ; Khoury, Ihab ; Juan, Alfons ; Vidal, Enrique

Language Technology for Handwritten Text Recognition Incollection

Advances in Speech and Language Technologies for Iberian Languages (iberSpeech 2012), 328 , pp. 178-186, Springer Berlin Heidelberg, 2012.

Links | BibTeX | Tags: Hidden Markov Model Emission Probability, Mixture of Bernoulli Distributions, Mixture of Gaussian Densities, Off-Line Continuous Handwritten Text Recognition

Doetsch, Patrick; Hamdani, Mahdi; Giménez-Pastor, Adrià; Andrés-Ferrer, Jesús; Juan, Alfons; Ney, Hermann

Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition Inproceedings

Proc. of the 2012 Intl. Conf. on Frontiers in Handwriting Recognition (ICFHR 2012), pp. 3 – 7, 2012.

BibTeX | Tags:

2011

Serrano, Nicolás; Giménez-Pastor, Adrià; Sanchis, Alberto; Juan, Alfons

Multimodal Interactive Pattern Recognition and Applications Book Chapter

Toselli, Alejandro H; Vidal, Enrique; Casacuberta, Francisco (Ed.): Chapter Active Interaction and Learning in Handw, Springer, 1st Edition, 2011, (http://www.springer.com/computer/hci/book/978-0-85729-478-4).

BibTeX | Tags:

45 entries « 1 of 2 »