Early software by MLLP researchers (2010-2015): AK, GIDOC, jaf_Tools, Bilingual Text Classification. https://www.mllp.upv.es

Gonçal V. Garcés Díaz-Munío 6cb8443ead Completed README with links to repositories 1 year ago
README.md 6cb8443ead Completed README with links to repositories 1 year ago

README.md

Early_Software_by_the_MLLP

Early software by MLLP researchers (2010-2015): AK, GIDOC, jaf_Tools, Bilingual Text Classification.

Website of the MLLP research group, VRAIN Institut Valencià d'Investigació en Intel·ligència Artificial, Universitat Politècnica de València: https://www.mllp.upv.es/

AK

Another Kit for the building and use of Bernoulli (and diagonal Gaussian) Hidden Markov Models (HMMs). The precursor for TLK, the MLLP's current hybrid DNN-HMM ASR toolkit.

Features:

  • Free HMM-based toolkit for (handwritten) text (or speech) recognition.
  • Supports Bernoulli mixture and diagonal Gaussian mixture HMMs.
  • Core implemented as a dynamic library.

Learn more and download: http://aktoolkit.sourceforge.net/

BibTeX for citations:

    @misc{AK,
    author={Giménez Pastor, Adrià and del Agua Teba, Miguel Ángel and Andrés Ferrer, Jesús and Juan Ciscar, Alfons},
    title={AK: Another Kit for the building and use of Hidden Markov Models},
    year={2012},
    howpublished={\url{http://sourceforge.net/projects/aktoolkit/}}
    }

GIDOC: GIMP-based Interactive transcription of old text DOCuments

A computer-assisted transcription prototype for handwritten text in old documents. Developed within the project iTransDoc.

Features:

  • Interactive transcription of old text documents.
  • System training.

Learn more and download: http://gidoc.sourceforge.net/

BibTeX for citations:

    @InProceedings{Serrano10-gidoc,
    author = {N. Serrano and L. Tarazón and O. Ramos~Terrades and A. Juan}
    title = {{The GIDOC prototype}},
    booktitle = {Proc. of the 10th Intl. Workshop on Pattern Recognition in Information Systems (PRIS 2010)},
    pages = {82–89},
    year = {2010},
    address = {Funchal (Portugal)}
    }

jaf_Tools

jaf_MT

Software for training phrase-based Hidden semi-Markov Models for SMT. Learn more and download: http://sourceforge.net/projects/jafmt/

jaf_Kernels

Similarity Word-Sequence Kernels for Sentence Clustering toolkit. Learn more and download: http://sourceforge.net/projects/jafkernels/

jaf_Utils

A C++ library for Statistical Language Processing tasks. Learn more and download: http://sourceforge.net/projects/jafutils/

BibTeX for citations:

    @InProceedings{Andres09-jafmt,
    author = {Jesús Andrés-Ferrer and Alfons Juan},
    title = {{A phrase-based hidden semi-Markov approach to machine translation}},
    booktitle = {Proc. of the 13th Conf. of the European Association for Machine Translation (EAMT 2009)},
    pages = {168–175},
    year = {2009},
    address = {Barcelona (Spain)}
    }

    @InProceedings{Andres10-jafkernels,
    author = {Jesús Andrés-Ferrer and Germán Sanchis-Trilles and Francisco Casacuberta},
    title = {{Similarity Word-Sequence Kernels for Sentence Clustering}},
    booktitle = {Proc. of the 8th Intl. Workshop on Statistical Pattern Recognition (S+SSPR 2010)},
    pages = {610–619},
    year = {2010},
    address = {Cesme (Turkey)}
    }

    @misc{Andres09-jafutils,
    author={Jesús Andrés-Ferrer},
    title={{jaf_Utils: A C++ library for Statistical Language Processing tasks}},
    year={2009},
    howpublished={\url{http://sourceforge.net/projects/jafutils/}}
    }

Bilingual Text Classification

A software package implementing statistical mixture models for bilingual text classification trained with the EM algorithm.

Learn more and download: https://mloss.org/software/view/247/

BibTeX for citations:

    @PhdThesis{Civera08,
    author = {J. Civera},
    title = {{Novel statistical approaches to text classification, machine translation and computer-assisted translation}},
    school = {Universitat Politècnica de València},
    year = {2008},
    note = {Advisors: A. Juan and F. Casacuberta}
    }