Krish Dalsaniya codeW-Krish

What I do

Software engineer. Stack-agnostic. I build applied AI systems end to end not demos, not notebooks, not "proof of concepts" that live permanently in a Notion doc. The full problem: architecture decisions, backend design, user experience, pipeline engineering, cost routing, retrieval accuracy, and making sure the thing keeps working once real users get their hands on it. The model is one component. The system around it is the actual work.

Currently deep in Vision Transformers after finishing a structured empirical comparison between CNNs and ViTs that might become a paper. Also building production LMS and document AI products where the actual engineering challenge is the system design around the model, not the model itself.

What I'm building

DocuChat — Multi-document RAG with real-time passage highlighting and character-level source attribution. Next.js + Flask microservice + PHP CodeIgniter. Live →
UpSkill AI — AI course platform that generates structured content and gates progression behind quiz checkpoints. Multi-provider LLM routing (Groq + Cerebras + Gemini). 50 subtopics in 24 seconds. Walkthrough →
CNN vs ViT Study — Empirical comparison across CNN baselines, ResNet variants, and Vision Transformer (scratch + pretrained) on CIFAR-10. 95.4% with fine-tuned ViT. Working toward a paper.

Stack

GitHub Stats

Commits are the honest number. Stars are a marketing metric. The rest catches up when the products ship.

What I find interesting

Why most AI product demos are impressive and most AI products are not
The gap between "it works in a notebook" and "it works at 3am under real user load"
Building things I wish existed, not because the idea is unique, but because I want them to exist
Reinventing the wheel on purpose to understand the architecture from scratch, then putting something new on top. The wheel isn't the goal. What you build after understanding it is
Why ViTs beat CNNs at scale but CNNs still win on small datasets — and what inductive bias has to do with it
System design decisions that cut LLM API costs without touching output quality (the UpSkill video pipeline is a good example of this)
Character-level source attribution and how it changes the trust dynamic in RAG systems
Stay curious. Learn new things. Pick up whatever the problem needs, not whatever you already know

Currently

Agent architectures → what actually makes multi-step reasoning reliable
CNN vs ViT paper → writing up findings, might publish
Vision Transformer training run → in progress
LLM inference internals → KV cache, speculative decoding
Always something new on the side — that's the point

Let's talk

If you're building something where the hard parts are system design, backend architecture, or the AI layer that has to actually work in production or if you just want to talk about something interesting you're working on, reach out.

_{Building at the intersection of "how does this actually work" and "how do we make this actually ship."}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Krish Dalsaniya codeW-Krish

Achievements

Achievements

Highlights

Block or report codeW-Krish

What I do

What I'm building

Stack

GitHub Stats

What I find interesting

What I find interesting

Currently

Let's talk

Pinned Loading

Uh oh!