Tech | Chirag Hasija

Mar 2026 • Tech

Running LLMs Locally with Ollama in 2026 - A Complete Guide

Running LLMs on your own hardware has gone from a novelty to a legitimate production strategy. Ollama turned what used to require a PhD in …

Mar 2026 • Tech

Stop Copy-Pasting from ChatGPT - How to Actually Learn from AI Code

There is a pattern that has become endemic among developers who use AI tools: paste a problem into ChatGPT, copy the output, run it, fix the …

Mar 2026 • Tech

The AI Agent Framework Landscape in 2026 - LangChain, CrewAI, Claude Agent SDK, and What Actually Works

Every AI startup in 2025 shipped an “agent” demo. Most of those agents broke in production within the first week. The gap between a …

Mar 2026 • Tech

The AI Coding Workflow That Senior Engineers Actually Use in 2026

The gap between engineers who use AI effectively and those who do not is no longer about prompt cleverness. It is about workflow structure. …

Mar 2026 • Tech

The CLAUDE.md Guide That Actually Makes Claude Code Useful in 2026

Claude Code reads a file called CLAUDE.md before every interaction. This file is the difference between an AI that writes code you …

Mar 2026 • Tech

The Complete Guide to WebRTC in 2026 - P2P Video That Actually Works

WebRTC promises peer-to-peer video, but the reality involves STUN servers, TURN relays, and architecture decisions that determine whether …

Mar 2026 • Tech

The Context Window Is the Most Expensive Resource in AI Coding - How to Manage It

Every AI coding session has a hidden resource that most developers ignore until it breaks: the context window. It is the total number of …

Mar 2026 • Tech

The MCP Servers Every Developer Should Install for Claude Code in 2026

MCP (Model Context Protocol) servers give Claude Code the ability to interact with external systems - browsers, databases, APIs, cloud …

Mar 2026 • Tech

The Real Cost of Running LLMs in Production in 2026

Every team building with LLMs eventually hits the same wall: the demo costs $0.02 per request, but production costs $0.50. The gap between …

Mar 2026 • Tech

Vector Databases in 2026 - Pinecone vs Weaviate vs pgvector Compared

The vector database market exploded in 2023, consolidated through 2024-2025, and has now settled into a clear hierarchy. If you are building …

Mar 2026 • Tech

When NOT to Use AI for Code - The Tasks That Still Need a Human in 2026

AI code generation in 2026 is genuinely impressive. Models write working implementations, catch bugs, refactor with precision, and produce …

Mar 2026 • Tech

Why Every Backend Team Is Moving to Event-Driven Architecture in 2026

Event-driven architecture is not a buzzword anymore. It is how teams decouple services, handle eventual consistency, and build systems that …

Mar 2026 • Tech

Why SQLite Is Replacing Postgres for More Use Cases Than You Think

The conventional wisdom is simple: SQLite is for development and mobile, Postgres is for production. This was true for a long time. It is …

Mar 2026 • Tech, System Design

The Real Cost of Running Production Systems

You just shipped a new service. It works. The demo went great. Leadership is happy. Then the bill arrives - not just the AWS invoice, but …

Mar 2026 • Tech, System Design

PostgreSQL Is All You Need (Until It Isn't)

You’re three months into a new project. You have a PostgreSQL database for your core data, Redis for caching, Elasticsearch for search, …

Mar 2026 • Tech, System Design

System Design Roadmap - What to Learn in What Order

You have 47 browser tabs open. One is a YouTube video on consistent hashing. Another is a blog post about CAP theorem. Somewhere in the mix …

Mar 2026 • Tech, Ai

How to Integrate the ChatGPT API - A Complete Guide for Developers

You’ve used ChatGPT through the web interface. Now you want to build it into your own app - a customer support bot, a code review tool, a …

Mar 2026 • Tech, System Design

How Redis Went from Single-Threaded to 3.5 Million Ops/Sec

“Redis is single-threaded.” You’ve heard this in every system design interview, every blog post, every tech talk. It was true in 2009. It’s …

Mar 2026 • Tech, System Design

Redis Internals - Clustering, Sentinel, Sharding, and Pipelining Explained

You spin up a single Redis instance, throw your session data in it, and everything works great. Then your app grows. One day Redis goes down …

Mar 2026 • Tech, System Design

Database Ops/Sec and Memory Limits - When to Shard and When Not To

You’re in a system design interview. You say “we’ll use PostgreSQL” and immediately follow it with “and we’ll shard it across 16 nodes.” The …