Ranit Karmakar | Portfolio

Why My LLM Can’t Count the “R”s in Strawberry?

April 11, 2025 · 8 min read · AI LLMs Limitations Tokenization

Nephew vs LLM — A game my 6-year-old nephew wins against frontier LLMs.

TL;DR: Your LLM can’t reliably count the r’s in strawberry because its entire pipeline—from tokenization to vectorization to attention—optimizes for meaning over mechanics. Counting is a symbolic operation that requires character access and an algorithm; standard LLMs have neither by default. Give them character-level visibility or a small tool to do the job, and the problem disappears.

Ranit Karmakar, PhD

Interests

Socials

My Blogs

Why My LLM Can’t Count the “R”s in Strawberry?