writing

long-form pieces on the thing i keep coming back to: getting models to produce structure you can actually trust.

a 4-part series · neuro-symbolic LLM systems

Creating the first morphological analyser for Spoken Tamil

Spoken Tamil is hard for language models to understand. Come see why, and how to get an LLM to break a Spoken Tamil word into its parts without inventing parts that don't exist. The series builds from “what is this even?” to a working neuro-symbolic system, traced end to end. Pages are interactive, so you don't need to know how to read Tamil script or even know what morphlogy is.