[Elsnet-list] ICDAR 2013 Recognition of Handwritten Historical Texts (RHHT) Competition - Deadlines extended!

CD cmartine at dsic.upv.es
Mon Mar 11 09:35:28 CET 2013

Apologies for multiple copies, please distribute!

Notice extended deadlines!


ICDAR 2013 Recognition of Handwritten Historical Texts (RHHT) Competition

The "ICDAR 2013 Recognition of Handwritten Historical Texts (RHHT)"
competition is organised in the framework of the ICDAR 2013 competitions
by the PRHLT group. Its goal is to bring together the different
researchers working on off-line handwritten text recognition (HTR) and
let them work on the specific application of off-line HTR techniques to
historical texts, in order to provide them a suitable benchmark to compare
their techniques. In this competition, the recognition of a database of
historical handwritten text is proposed. The database corresponds to a
manuscript written in Spanish during the 16th century by a single writer.


The participant systems must try to obtain the most accurate recognition
results of the test partition. The available data for achieving this task
will consist of:

- Image for each line of the training, validation, and test sets (see example 
- Corresponding transcriptions for each line (only for training and validation 

Training partition is about 10,000 lines, whereas validation and test
partitions are about 5,000 lines each. An extra test partition of a
similar text, but from a different database, may be provided to check the
robustness of the systems against small changes in language and
handwritting style. A baseline system based on HTK and SRILM is provided,
along with a set of scripts that performs a baseline training and test
experiment. The participants can use this baseline system as an initial
approach to their own systems, where they will be allowed to improve this
baseline by using:

- different feature extraction techniques
- different recognition systems
- different types of models
- ...

The only restriction will be related to the final training data, that must 
pertain to the training and validation sets provided by the organisers.

Several submissions per participant will be allowed and all the results will be 
considered when presenting the competition results. In each submission, the 
participant must provide a brief description of the characteristics of the 
submitted system, emphasising the main differences between the submitted system 
and the baseline system. The final goal is to analyse the different proposals 
of the participants.


The evaluation will be performed on the final result of the whole system (from 
preprocess to recognition). The evaluation metric is based on final word 
recognition, and Word Error Rate (WER) will be used to determine the 
performance of the systems. The winner will be the one that obtain the less WER 
on the test set. A web-based platform will be available for the participants to 
check their validation and test results.

The description of the methods and the evaluation scores will be presented 
during a dedicated ICDAR 2013 session. A report on the competition will be 
published in the ICDAR 2013 conference proceedings. Participants on this 
contest are not obliged to attend the ICDAR 2013 conference, although their 
presence in the presentation the evaluation and the posterior discusion will be 
very much appreciated.


To inscribe in this contest send an e-mail to cmartine_AT_dsic_DOT_upv_DOT_es 
with the subject "ICDAR 2013 RHHT competition inscription". In the message you 
must provide the following data:

- Group name and acronym
- Institution
- Participants and e-mail
- Contact person

A username and password will be given to each registered participant. Each 
participant will have access granted to the data and evaluation page by using 
that username and password.


- Feb, 15th, 2013 Competition opens, start of inscription period, training and 
validation data available, baseline system available.
- Apr, 1st, 2013 (EXTENDED!) Registration deadline (no more participants 
would be admitted).
- May, 1st, 2013 (EXTENDED!) Test data available
- May, 15th, 2013 Deadline for systems results


Carlos-D. Martínez-Hinarejos
Nicolás Serrano
PRHLT group - Universitat Politècnica de València

More information about the Elsnet-list mailing list