- GraphParser: An Ungrounded and Grounded Semantic Parser
- English Compound Noun Compositionality Dataset
- Hindi POS Tagger
- Hindi Dependency Parser
- Hindi WordNet in Python
- Kannada POS Tagger
- Telugu POS Tagger
- Indonesian and Malay Tools
GraphParser: An Ungrounded and Grounded Semantic Parser
Graph Parser is a semantic parser which converts Natural Language Sentences/Questions to predicate-argument graphs, which can in-turn be converted to logical queries and executed on Freebase knowledge-graph. Download the code. An improved version will be released in February 2015. Please read more about it in our paper Large-scale Semantic Parsing without Question-Answer Pairs.
POS Taggers, Corpora, Lemmatizers, Morph Analyzers for Indian Languages
Most of these tools are developed by the methods described in Reddy and Sharoff (2011, CLIA @ IJCNLP). Some of the taggers are built using cross-lingual resources and some using mono-lingual resources. Please read corresponding README's of each tool for additional information.
If you need resources for any other Indian languages, please contact me.
Indonesian and Malay morphological analyzer, part-of-speech (POS) tagger, Machine Translation System
With support from Sketch Engine, I have made few contributions to the Apertium Indonesian-Malay language pair. All the tools can be downloaded from http://sourceforge.net/projects/apertium/files/apertium-id-ms/