Insight Language Machine

The Malayalam
Lexical Universe

മലയാള ഭാഷയുടെ ആഴം, 17 ഭാഷകളിൽ

A diachronic lexical knowledge graph spanning classical to contemporary Malayalam. 136,629 lemmas, 122,230 senses — built for AI training, NLP research, and linguistic scholarship.

മലയാളംEnglishहिन्दीالعربيةதமிழ்ಕನ್ನಡతెలుగుसंस्कृतবাংলাमराठीاردوଓଡ଼ିଆDeutschFrançaisРусский中文日本語

Knowledge Graph

Concept-centric architecture modelled on WordNet and ConceptNet. Lemmas, senses, relations, and etymological layers — all linked.

Diachronic Depth

From Gundert's 1872 classical dictionary to contemporary Malayalam. Traces semantic drift across 150 years of the language.

AI-Ready

JSONL format, HuggingFace-hosted, gated-access. Built for fine-tuning language models, NLP pipelines, and academic citation.