Publications

Show all

2022

Jorge Cano, Javier

Streaming Automatic Speech Recognition with Hybrid Architectures and Deep Neural Network Models PhD Thesis

Universitat Politècnica de València, 2022, (Advisors: Alfons Juan Ciscar and Jorge Civera Saiz).

Links | BibTeX | Tags: Automatic Speech Recognition, Deep Neural Networks, hybrid ASR, streaming

Baquero-Arnal, Pau; Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Pérez-González-de-Martos, Alejandro; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge: Extension Journal Article

Applied Sciences, 12 (2), pp. 804, 2022.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Natural Language Processing, streaming

2021

Jorge, Javier ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Civera, Jorge ; Sanchis, Albert ; Alfons, Juan

Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models Journal Article

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 30 , pp. 148–161, 2021.

Abstract | Links | BibTeX | Tags: acoustic modelling, Automatic Speech Recognition, decoding, language modelling, neural networks, streaming

Jorge, Javier; Giménez, Adrià; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González-de-Martos, Alejandro; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge Inproceedings

Proc. of IberSPEECH 2021, pp. 118–122, Valladolid (Spain), 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Natural Language Processing, streaming

Iranzo-Sánchez, Javier; Jorge, Javier; Baquero-Arnal, Pau; Silvestre-Cerdà, Joan Albert ; Giménez, Adrià; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Streaming cascade-based speech translation leveraged by a direct segmentation model Journal Article

Neural Networks, 142 , pp. 303–315, 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Cascade System, Deep Neural Networks, Hybrid System, Machine Translation, Segmentation Model, Speech Translation, streaming

Javier Iranzo-Sánchez Jorge Civera, Alfons Juan

Stream-level Latency Evaluation for Simultaneous Machine Translation Inproceedings

Findings of the ACL: EMNLP 2021, pp. 664–670, Punta Cana (Dominican Republic), 2021.

Abstract | Links | BibTeX | Tags: latency, simultaneous machine translation, stream-level evaluation, streaming

2020

Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Silvestre-Cerdà, Joan Albert; Civera, Jorge; Sanchis, Albert; Juan, Alfons

LSTM-Based One-Pass Decoder for Low-Latency Streaming Inproceedings

Proc. of 45th Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7814–7818, Barcelona (Spain), 2020.

Abstract | Links | BibTeX | Tags: acoustic modeling, Automatic Speech Recognition, decoding, Language Modeling, streaming

Baquero-Arnal, Pau ; Jorge, Javier ; Giménez, Adrià ; Silvestre-Cerdà, Joan Albert ; Iranzo-Sánchez, Javier ; Sanchis, Albert ; Civera, Jorge ; Juan, Alfons

Improved Hybrid Streaming ASR with Transformer Language Models Inproceedings

Proc. of 21st Annual Conf. of the Intl. Speech Communication Association (InterSpeech 2020), pp. 2127–2131, Shanghai (China), 2020.

Abstract | Links | BibTeX | Tags: hybrid ASR, language models, streaming, Transformer

Iranzo-Sánchez, Javier; Giménez Pastor, Adrià ; Silvestre-Cerdà, Joan Albert; Baquero-Arnal, Pau; Saiz, Jorge Civera; Juan, Alfons

Direct Segmentation Models for Streaming Speech Translation Inproceedings

Proc. of 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), pp. 2599–2611, 2020.

Abstract | Links | BibTeX | Tags: Segmentation, Speech Translation, streaming