[Elsnet-list] research internship @ Xerox Research Centre Europe

Matthias Gallé mgalle at gmail.com
Thu Jan 30 18:04:09 CET 2014

An open internship position on a mix of string algorithms, grammatical
inference and statistical machine learning


We are looking for a motivated intern for a project involving the use of
sequential patterns for the inference of grammars.

The Smallest Grammar Problem is the problem of finding the smallest
context-free grammars that generates exactly one given sequence. We plan to
generalize this in order to find grammars which generate a set of natural
language documents with a strong but hidden structure. This structure will
then be converted into additional features (through tree kernels for
example) in our analytics pipeline or alternatively as a starting template
for existing multilingual authoring tools.


   - Research-oriented master student or PhD candidate in computer science
   - Knowledge of standard text algorithms and data structures
   - Knowledge of formal grammars (a course covering the Hopcroft & Ullman
   book or equivalent for example)
   - Knowledge in statistical machine learning applied to text is a strong
   - Fluency in either C, C++ or Java is a plus

The intern will work closely with researchers in a very international
environment, and will be strongly encouraged to produce scientific

*Duration*: 5-6 months
 *Start Date*: March-April 2014

*Application instructions*

Informal inquiries are welcome and can be made at
matthias.galle at xrce.xerox.com .

To submit an application, please send your CV and cover letter to both
xrce-candidates at xrce.xerox.com and matthias.galle at xrce.xerox.com .

Ideally, you will also include in your CV people we can contact for letters
of recommendation.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://list.hum.uu.nl/pipermail/elsnet-list/attachments/20140130/a4ab600e/attachment-0001.html>

More information about the Elsnet-list mailing list