[Elsnet-list] VACANCIES: two computer linguists

Katrien Depuydt depuydt at inl.nl
Thu Dec 20 14:07:36 CET 2007

    Vacancies for two computer linguists

The Institute for Dutch Lexicology has two vacancies for experienced 
computer linguists for the development of Named Entity Processing tools 

/IMPACT/ is a new European research project in the field of informatics 
for the humanities. The project will start on 1 january 2008. In IMPACT 
15 National libraries and research institutes from Europe, Israel and 
Russia will work together.

The main purpose of IMPACT is to obtain a significant improvement of the 
accessibility of historical documents.

To achieve this, the following will be tackled:

   1. Current OCR-software is not suitable for mass digitisation of
      historical documents. Within the project, OCR software will be
      developed that will significantly improve the accuracy of
      state­-of-the-art systems, so as to enable for the first time,
      reliable full text mass digitisation of historical documents.
   2. Information in historical documents is not easily accessed by
      modern users because of the historical language barrier. Within
      the project, historical lexica and linguistic processing tools
      will be developed that will enable enriched indexing to provide
      access historical material with contemporary query.

To be effective the lexica will also have to contain Named Entity data 
and tools for NE recognition and NE classification for historical 
language material will have to be developed.


The NE specialists will be responsible for the development of a toolbox 
for NE lexicon building and NE lexicon deployment to tackle historical 
language material to be used for the improvement of OCR of historical 
texts and for better retrieval on historical text material. The work 
will imply the implementation as well as the design of relevant algorithms.


- relevant background in computational linguistics, computer science or 
applied mathematics (master level, preferably PHD level)

- sufficient knowledge and experience with the development and 
implementation of NLP algorithms, preferably in the field of NE processing

- sufficient experience in developing complex software systems; 
preferably proficiency in C, C++ and/or Java

- knowledge of Dutch language is required, preferably knowledge of 
historical Dutch language


An INL contract for two years. According to the 
cao–Onderzoekinstellingen the salary scale indicated for this job is 11 
max., with a maximum of € 4.138, - gross per month on the basis of a 40 
hour week. In addition you will be entitled to 42 days holiday per year 
plus holiday pay.


Contact Katrien Depuydt (Taalbank) INL, Postbus 9515, 2300 RA, Leiden

tel. (+31 (0)71 527 2479), email: depuydt at inl.nl. <mailto:depuyd at inl.nl>

Send your application to Dr. Jeannine Beeken, INL, Postbus 9515, 2300RA 
Leiden, email: secretariaat at inl.nl <mailto:secretariaat at inl.nl>

*Closing date:* 02-01-2008

Katrien Depuydt
Instituut voor Nederlandse Lexicologie
(Institute for Dutch Lexicology)
(Language Database Dept.)
Postbus 9515
NL-2300 RA Leiden

tel.: +31 71 5272479
mail: depuydt at inl.nl

More information about the Elsnet-list mailing list