Workshop on
CORPUS-BASED NATURAL LANGUAGE PROCESSING
17th Dec 2001-2nd Jan 2002
ANNA UNIVERSITY
CHENNAI
Organised by
The AU-KBC Research Centre, Anna University
Jointly with
The Language Technology Research Centre, IIIT Hyderabad
The National Centre for Software Technology, Mumbai
The School of Computer Science & Engineering, Anna University
&
The Tamil University, Thanjavur
The Background
The world is passing through an Information Revolution in which the ability
to generate disseminate, access and utilise information is becoming a key
determinant of development and progress, for individuals as well as nations.
Since humans express and absorb information and knowledge best through
their own ?natural? languages, language processing by machines has come
to the centre stage. Example applications are computers translating from
one human language to another, humans and computers communicating with each
other through written and spoken forms of human languages, etc.
This workshop will introduce the participants to some emerging techniques
of machine processing of natural languages based on Language Corpora, with
emphasis on Statistical Processing and Machine Translation(MT).
The Goals of the Workshop
The following are the goals of the Workshop:
i. Provide an understanding of NLP in the context
of Machine-Translation, Multilingual Information Retrieval and other applications
related to Indian Languages and English
ii. Introduce the use of Statistical Techniques on Lexical
Resources to refine rule-based methods.These will be used to develop
the NLP applications mentioned earlier.
iii. Provide training in the use of Tools and Resources in the
domain of Statistical processing
iv. Identify and define a set of problems to be pursued
by the participants in future in the context of the NLP of Indian
languages.
The Workshop Faculty
The Workshop would be conducted primarily by the following:
- Prof.Aravind K Joshi, University of Pennsylvania, USA
- Dr.B.Srinivas, AT&T Research, New Jersey, USA.
Additionally, faculty from the Organising Institutions
would also be contributing to the Workshop through lectures, labs and tutorials.
The Workshop Coverage
The following topics would be covered in detail in lectures, tutorials
and labs :
**Finite State Automata and Finite State Transducers
**Linguistic Formalisms and Computational Grammar
**Hidden Markov Models
**Parsing
**Machine Learning
**Word Sense Disambiguation
**Statistical Machine Translation
**Information Retrieval
In addition to the hands-on lab classes, the teams of
participants would also be involved in executing certain real-life
Projects as a part of the workshop, broadly towards realising an English-Indian
language MT solution, and/or a cross-lingual Information Retrieval
system. Necessary multilingual corpora and other resources are being created
,and is expected to be made available to the workshop ,in Tamil, Hindi and
Telugu, in addition to English. The training imparted should enable the participants
to pursue state-of-the-art NLP work in their home institutions .
The Target Audience
Persons belonging to the following two academic streams
would benefit from the workshop:
--Language and Linguistic streams.
--Science and Engineering streams such as Computers, Communications,
Maths and
Statistics.
Typically the following persons belonging to either of these two streams
could participate:
--Masters and Ph.D. students planning to specialise in NLP.
--Staff working in NLP-related Projects in Universities, Research labs
and Industry.
--Faculty members ,Researchers and Product Developers engaged in NLP
work, or intending to do so.
While some familiarity with NLP work would be most helpful, we
would also be admitting a limited number of candidates without any prior
exposure to it, with the condition that they undergo a Special Preparatory
Program of three days during the 14th-16th Dec. conducted at the Wokshop
venue. The background necessary to benefit from the Workshop would
be provided through this program.
The Workshop fees and other details
There would be a fee of Rs.5000/- per candidate, which
would cover the Workshop materials, Tea and Lunch. The candidates have to
make their own arrangements for travel ,stay, breakfast and dinner.
Assistance in arranging these would however be available from
the organisers on prior request, including limited on-campus accommodation.
Some fee reduction could be considered under exceptional circumstances
for a small number of candidates on their request, at the discretion of
the Organisers.
The application should give full details about the candidate?s
educational background and experience, with special emphasis
on NLP-related areas, if any. Also to be stated clearly is your computer
background,if any. Based on the information provided ,a decision would be
taken by the organisers on whether the candidate should go through the Special
Preparatory Program during the 14th-16th Dec. or not.
Your Application should reach the organisers by the 30th of
Sept. 2001.
On being selected, the Workshop fees should be paid by the 15th of Nov.2001
Payments are to be made in the name of "The Co-ordinator, NLP Workshop".
Correspondence
All correspondence to be addressed to:
The Co-ordinator,
NLP Workshop, AU-KBC Research Centre
M I T Campus of Anna University
Chromepet,Chennai-600044.
Ph.044-2232711/2234885
Fax:044-2213034.
e-mail: info4all@au-kbc.org
with "NLP Workshop" as the Subject.
We would prefer e-mail correspondence as much as possible.