Publications

Show all

2025

Santamaría-Jordà, Jaume; Segovia-Martínez, Pablo; Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert; Giménez, Adrià; Gaspar Aparicio, Rubén ; Fernández Sánchez, René ; Civera, Jorge; Sanchis, Albert; Juan, Alfons

LHCP-ASR: An English Speech Corpus of High-Energy Particle Physics Talks for Narrow-Domain ASR Benchmarking Inproceedings

Interspeech 2025, pp. 4033–4037, Rotterdam (Netherlands), 2025.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, domain adaptation, manual transcription, pseudo-labelling, speech corpus

2021

Garcés Díaz-Munío, Gonçal V; Silvestre-Cerdà, Joan Albert ; Jorge, Javier; Giménez, Adrià; Iranzo-Sánchez, Javier; Baquero-Arnal, Pau; Roselló, Nahuel; Pérez-González-de-Martos, Alejandro; Civera, Jorge; Sanchis, Albert; Juan, Alfons

Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization Inproceedings

Proc. Interspeech 2021, pp. 3695–3699, Brno (Czech Republic), 2021.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, speech corpus, speech data filtering, speech data verbatimization