- English Compound Noun Compositionality Dataset
- Hindi POS Tagger
- Hindi Dependency Parser
- Hindi WordNet in Python
- Kannada POS Tagger
- Telugu POS Tagger
- Indonesian and Malay Tools
POS Taggers, Corpora, Lemmatizers, Morph Analyzers for Indian Languages
Most of these tools are developed by the methods described in Reddy and Sharoff (2011, CLIA @ IJCNLP). Some of the taggers are built using cross-lingual resources and some using mono-lingual resources. Please read corresponding README's of each tool for additional information.
If you need resources for any other Indian languages, please contact me.
Indonesian and Malay morphological analyzer, part-of-speech (POS) tagger, Machine Translation System
With support from Sketch Engine, I have made few contributions to the Apertium Indonesian-Malay language pair. All the tools can be downloaded from http://sourceforge.net/projects/apertium/files/apertium-id-ms/