[Elsnet-list] ECML/PKDD 2012 Discovery Challenge on Large Scale Hierarchical Text Classification

Ion Androutsopoulos ionandr at gmail.com
Mon Apr 2 21:09:38 CEST 2012

ECML/PKDD 2012 Discovery Challenge: Third Challenge on Large Scale 
Hierarchical Text Classification

                   Web site: http://lshtc.iit.demokritos.gr/
                     Email: lshtc_info at iit.demokritos.gr

This year's discovery challenge hosts the third edition of the 
successful PASCAL challenges on large scale hierarchical text 
classification. The challenge comprises three tracks and it is based on 
two large datasets created from the ODP web directory (DMOZ) and 
Wikipedia. The datasets are multi-class, multi-label and hierarchical. 
The number of categories ranges between 13,000 and 325,000 roughly and 
the number of documents between 380,000 and 2,400,000.

The tracks of the challenge are organized as follows:

1. Standard large-scale hierarchical classification
   a) On collection of medium size from Wikipedia
   b) On a large collection from Wikipedia
2. Multi-task learning, based on both DMOZ and Wikipedia category systems
3. Refinement-learning
   a) Semi-Supervised approach
   b) Unsupervised approach

In order to register for the challenge and gain access to the datasets you
must have an account at the challenge Web site.

Important dates:

- March 30, start of the challenge
- April 20, opening of the evaluation
- June 29, closing of evaluation
- July 20, paper submission deadline
- August 3, paper notifications

- Ion Androutsopoulos, AUEB, Athens, Greece
- Thierry Artieres, LIP6, Paris, France
- Patrick Gallinari, LIP6, Paris, France
- Eric Gaussier, LIG, Grenoble, France
- Aris Kosmopoulos, NCSR "Demokritos" & AUEB, Athens, Greece
- George Paliouras, NCSR "Demokritos", Athens, Greece
- Ioannis Partalas, LIG, Grenoble, France

More information about the Elsnet-list mailing list