[Elsnet-list] Reminder: CFP: 3rd Workshop on Example-Based Machine Translation

Mikel L. Forcada mlf at dlsi.ua.es
Mon Sep 7 11:50:26 CEST 2009

REMINDER: Call for papers
3rd Workshop on Example-Based Machine Translation
"Going open-source to revive EBMT"

Hosted by the Centre for Next Generation Localisation (CNGL)
Dublin City University, Dublin, Ireland
November 12–13, 2009



    * Mikel L. Forcada (DLSI, Universitat d'Alacant and CNGL, Dublin City
    * Andy Way (CNGL, Dublin City University)

Following two successful EBMT workshops in 2001 at the MT Summit VIII in
Santiago de Compostela, Spain, and in 2005 at the MT Summit X in Phuket,
Thailand, this is the third workshop of its kind. Many things have happened
since 2005. The last few years have witnessed a decline in example-based machine
translation (EBMT) research and statistical machine translation (SMT) has almost
completely taken over the corpus-based machine translation arena, with many EBMT
practitioners moving into hybrid approaches integrating EBMT with other
approaches, mostly (but not only) SMT. Not having a clear definition of what
EBMT is has also contributed to this lack of visibility. In fact, research that
would have been considered EBMT has been published without the EBMT label.

Is the success of SMT due to the fact that it is the best way to do corpus-based
machine translation or is it because many SMT software packages are readily
available to researchers under free/open-source licences that allow use as well
as collaborative improvement? Shouldn't EBMT practitioners start to think about
putting together their tools, their engines and their data and releasing them
under open licenses to extend their use both in academia and in industry?

The pressure on machine translation researchers to prove their results through
detailed empirical evaluation is growing. But the validity of empirical results
hinges on reproducibility. Turning our experimental research into packages and
tools that other researchers can use and improve is a challenge but it is not
infeasible as SMT practitioners have shown.

Papers addressing this or other aspects related to a prospective strengthening
of research on EBMT and real-world applications of it would be very welcome. As
in previous editions, we expect papers on:

    * descriptions of 'pure' EBMT systems
          o knowledge resources used
          o representation of numeric and symbolic knowledge
    * descriptions of 'hybrid' systems
          o integration of EBMT with rule-based methods
          o integration of EBMT with statistical methods
    * (semi-)automatic preparation of existing mono/bi/multilingual corpora for EBMT
          o extraction of bi/multilingual texts from the web
          o preparation of treebanks for EBMT
          o bi/multilingual alignment/bracketing/parsing
          o inference of mono/bi/multilingual grammars
          o inference of bi/multilingual transfer rules
    * evaluation of EBMT results and/or comparison with other MT systems
    * considerations on domain-(in)dependence of EBMT systems
    * computational and/or system complexity of EBMT systems

In addition, we plan to have a panel session focusing on the main theme of the
conference, that is, open-sourcing of existing EBMT software to strenghten EBMT
research and usage.

Important Dates

    * Paper Submission: 18 September 2009
    * Notification of acceptance: 30 September 2009
    * Camera Ready Papers due: 22 October 2009
    * Workshop takes place: 12–13 November 2009

Paper submission

Please submit papers of up to 8 pages in PDF format through the Easychair
system, conforming to the format of the Easychair LaTeX template (A4 paper
size). Templates for other word-processing software may be provided on request.
Authors are kindly requested to keep their papers anonymous and submit the
remaining information through the website.


In addition to being published as part of the (printed and online) Workshop
proceedings, revised, extended versions of accepted papers will be eligible for
publication in a Special Issue of the journal Machine Translation.

Attendance Fee

Details of registration procedures, including registration fees, will be
included here when they have are available.

Programme committee

    * Sivaji Bandyopadhyay, Jadavpur University, Kolkata, India
    * Ralf Brown, Carnegie Mellon University, Pittsburgh, USA.
    * Michael Carl, Copenhagen Business School, Denmark
    * David Farwell, ICREA-UPC, Barcelona, Spain
    * Mikel Forcada, Universitat d'Alacant, Spain and Dublin City University,
    * Declan Groves, Traslán, Ireland
    * John Hutchins, Norwich, U.K.
    * Sadao Kurohashi, Kyoto University, Japan.
    * Yves Lepage, Université de Caen, France.
    * Harold Somers, Dublin City University, Dublin, Ireland
    * Eiichiro Sumita, ATR, Kyoto, Japan.
    * Andy Way, Dublin City University, Ireland

Mikel L. Forcada <mlf at dlsi.ua.es>
Dept. Llenguatges i Sistemes Informàtics
Universitat d\\\'Alacant, E-03071 Alacant (Spain)
Tel.: +34 96 590 9776    Fax: +34 96 590 9326

More information about the Elsnet-list mailing list