Discover in-depth
technical insights
Notes on system design, DevOps, and software engineering β written as I learn.

Deep Understanding of AI Large Language Models π€
Join us on a deep dive into the architecture, mathematics, and implementation of modern Large Language Models (LLMs)....

Token Embedding π’
Token embedding is the foundational step in any Natural Language Processing (NLP) pipeline. It involves converting di...

Text to Numbers π’
Before an AI model can

10. Translating Between Tokenizers οΏ½οΏ½
name=

Preparing Text for Tokens π
In real-world applications, data rarely arrives in a clean, perfectly formatted state. Before we can convert text to ...

Coding Challenge: Make a Tokenizer π οΈ
Now it

Coding Challenge: Tokenizing
In this challenge, we step up our game by processing a full-length book: H.G. Wells

5. Byte Pair Encoding (BPE) Concepts π§¬
Byte Pair Encoding (BPE) is a cornerstone of modern NLP. It

Coding Challenge: Byte Pair Encoding (BPE) π§¬
In this challenge, we implement a manual loop for Byte Pair Encoding (BPE). BPE is a subword tokenization method that...

GPT-4 Tokenizer
name=

BERT Tokenizer
name=

9. CodeChallenge: BERT Character Counts π§©
name=

Containers
- Containers provide a way of creating an isolated environment, sometimes called a sandbox, in which applications and...

JavaScript Interview Masterclass π
This is a comprehensive guide to JavaScript interview questions, mirrored directly from the Top JavaScript Interview ...

π JavaScript Complete Refresher
---

React Interview Masterclass βοΈ
This is a comprehensive guide to React interview questions, mirrored directly from the Top React Interview Questions ...

βοΈ React Complete Refresher
---

Git Basics
- Version control is a system that records changes to a file or set of files over time so that you can recall specifi...

Git Branches
What is the use of Branches?

Git for Professionals
- File staging helps in better commits, (easier to manage, review code and adding commit messages).

Configuration Management
- It helps to maintain infrastructure using software (like providing resources)

Pod
- Pods are the smallest deployable units of computing that you can create and manage in Kubernetes.

Worker Node
A Kubernetes cluster consists of a set of worker machines, called nodes, that run containerized applications.

Please edit the object below. Lines beginning with a
Minikube is a single node k8s cluster which can be run on virtual box in local machine (used for testing purposes).

Kubernetes Configuration File
Each configuration file has 3 parts:

Mongo Express K8s Cluster
$ kubectl get all

Intro
- Code has zero autonomy and is 100% deterministic

Elements
- The State is a shared data Structure that holds current information or context of an application.

Drafter Agent π
- Our company is not working efficiently! We spend way too much time writing emails and documents and this need to be...

Reflection Agent π§
1. Basic Reflection Agents

Reflexion Agent π§
Reflection Agent System consists of a generator and a reflector component

Hello World Graph π
1. Understand and define the AgentState structure

Multiple Input Graph π’
1. Define a more complex AgentState

Sequential Graph βΎοΈ
1. Create multiple Nodes that sequentially process and update different parts of the state.

Conditional Graph π§
1. Implement conditional logic to route the flow of data to different nodes.

Looping Graph π
1. Implement looping logic to route the flow of data to different nodes.

Simple Bot Agent π€
1. Define state structure with a list of Human Message objects.

Chatbot Agent π§
1. Use different message types - HumanMessage and AIMessage.

ReAct Agent π±
Think

MERN Stack Tutorial
This includes Repository includes my personal notes, and cover important topics that you need to know about MERN stac...

Node JS
- Node JS Installer

Express JS
Docs : https://expressjs.com/en/guide/routing.html

Mongo DB
- MongoDB is a source-available cross-platform document-oriented database program.

RESTful API
Mozzila Docs

Rust
Rust is a general-purpose programming language emphasizing performance, type safety, and concurrency. It enforces mem...

Introduction
fn main() {

Traits
pub struct NewsArticle {

Lifetimes
- A dangling reference is a pointer that references a location in memory that may have been given to someone else, or...

Testing
- create a new library project named adder.

Mini Grep CLI
use std::env;

Closures
- Clousures are like functions, expect that are anonymous that can be stored at variables

Iterators
- Iterators are provided by the standard library

Smart Pointer
- A pointer is a variable that stores the memory address, which points to any other data in the memory.

Concurrency
- Concurrency programming is different parts of your program executes

Programming Concepts
fn main() {

Memory Management
Ways to manage memory:

Structs
- Allows grouping related data of different types

Enums
- Enums are a way to define a type by enumerating a list of variants.

Module System
- cargo new creates a new package

Vectors
fn main() {

Error Handling
- Rust has a robust error handling system

Generics
- Generics are a way to define functions, structs, enums, and methods that work with any type

SQL for Data Analytics π
Welcome to the comprehensive guide on using SQL for data analytics. This series covers everything from the basics of ...

Page 1: Introduction β Why SQL Is the Analyst
SQL is the cornerstone of data analysis. While newer tools and languages emerge, SQL remains the universal language o...

Page 2: Preparing Your Data β The Foundation of Good Analysis ποΈ
The quality of your analysis is directly proportional to the quality of your data. Data preparation, often called

Page 3: Time Series Analysis β Tracking Trends Over Time π
Time is perhaps the most important dimension in any analytical model. Understanding how metrics evolve over days, mon...

Page 4: Cohort Analysis β Understanding Groups Over Time π₯
A cohort is a group of users who share a common characteristic over a specified period. The most common cohort is bas...

Page 5: Text Analysis with SQL β Mining Meaning from Words π
While SQL isn

Page 6: Anomaly Detection β Finding What Doesn
Anomaly detection is the process of identifying data points that deviate significantly from the

Page 7: Experiment Analysis β Testing Hypotheses with SQL π§ͺ
A/B testing is how modern companies make decisions. While statisticians use specialized tools, the core data for expe...

Page 8: Building Complex Data Sets β Advanced SQL Techniques βοΈ
As projects grow, your SQL becomes more complex. Maintaining readability, performance, and accuracy requires moving b...

Large Scale Deployment π
Deployment is not a one-time activity; it is a continuous process of delivering value through upgrades and bug fixes....

System Design ποΈ
Master the core concepts of scalable, reliable, and efficient system design.

Software Architecture Patterns ποΈ
Software architecture is the high-level structure of a software system. It defines the components, their relationship...

Event-Driven Architecture (EDA)
Event-Driven Architecture (EDA) is a software design pattern where decoupled components communicate through events ra...

Microservices Architecture π
Microservices architecture is a software design pattern where applications are structured as a collection of small, l...

Multi-Tier Architecture
Multi-Tier Architecture is a software design pattern that structures applications into multiple physical or logical l...

Design a Chat Application (WhatsApp) π¬
Designing a real-time chat application requires handling millions of persistent connections, ensuring sub-second mess...

Design a News Feed (aka Twitter) π±
A news feed is a constantly updating list of posts and updates from people you follow. It is the core feature of plat...

Design a Notification System π
A notification system delivers critical updatesβlike messages, shipping alerts, or security codesβacross multiple cha...

Design a Ticketing System (BookMyShow) ποΈ
A ticketing system is a high-concurrency platform that allows users to browse, book, and manage tickets for massive e...

Design a URL Shortener (TinyURL) π
A URL shortening service converts long URLs into short, unique links for easy sharing and tracking.

Networking & Communication π
Explore the fundamentals of how systems communicate over a network, from basic client-server models to advanced globa...

What is the Client-Server Model?
The client-server model is a computing model where clients request services and servers provide them. This is the fou...

Networking Basics & IP Addresses
Networking is the backbone of any distributed system. Every system relies on data exchange between components to enab...

DNS in Distributed Systems
DNS (Domain Name System) is the

Communication Patterns
Understanding how data travels between systems is crucial for designing scalable applications.

What is a Proxy?
A proxy is an intermediary server that acts as a gateway between a client and another server (usually on the internet...

Load Balancing
Load balancing is the process of distributing incoming network traffic across multiple backend servers to ensure effi...

API Gateway
An API Gateway is a server that acts as an intermediary between clients and backend services. It serves as a <Highlig...

Content Delivery Networks (CDNs)
A Content Delivery Network (CDN) is a globally distributed network of servers that work together to deliver internet ...

System Performance: Speed, Capacity & Efficiency β‘
Performance is a multi-dimensional goal. It

Caching for Speed & Optimization π°
Caching is the technique of storing frequently accessed data in a faster storage layer (usually in-memory) to reduce ...

Concurrency & Parallelism π§΅
In modern system design, the terms Concurrency and Parallelism are often used interchangeably, but they represent two...

Database Performance Optimization ποΈ
Optimizing database performance is a critical task in system design. As data grows, the way we store, retrieve, and m...

Performance Measurement: SLAs, SLOs & Percentiles π
To optimize performance, you must first be able to measure it accurately. Averages are often misleading in distribute...

Messaging & Queues for Decoupling π¬
In distributed systems, components often need to communicate without being tightly coupled. Asynchronous messaging al...

Performance Testing & Monitoring π
Performance isn

Network Protocols π‘
Protocols are the sets of rules that govern how data is transmitted across a network. Explore the fundamental protoco...

TCP vs. UDP: The Core Protocols
---

Introduction to HTTP
---

REST & RESTful API Design
---

Introduction to Real-Time Communication
---

Modern API Protocols: Beyond REST
---

System Reliability π‘οΈ
Reliability is the ability of a system to operate continuously without failure for a specified period and under certa...

Backup & Recovery πΎπ
No matter how highly available or fault-tolerant a system is, data loss can still occur due to extreme circumstances....

Disaster Recovery (DR) πͺοΈπ‘οΈ
While High Availability (HA) keeps the system running through minor failures (like a single server crash), Disaster R...

High Availability, Fault Tolerance & Failover ποΈ
Building a reliable system at scale requires achieving High Availability (HA) and Fault Tolerance. This ensures that ...

Scalability in System Design π
Scalability is the ability of a system to handle an increasing amount of work, or its potential to accommodate growth...

Autoscaling: The Elastic Power of Cloud βοΈ
Autoscaling is the automatic adjustment of compute resources based on real-time load. It ensures that your applicatio...

Load Balancing: The Traffic Controller βοΈ
Load balancing is the critical process of distributing incoming network traffic across a group of backend servers, al...

Scaling Strategies: Horizontal, Vertical & Diagonal π
Choosing the right scaling strategy is one of the most important decisions in system design. It determines how your a...

Authentication & Authorization ππ
In any distributed system, the first line of defense is ensuring that users are who they say they are (Authentication...

Data Protection & Secure Communication ππ‘
In a world of constant data breaches and strict regulations (GDPR, HIPAA), protecting data at every stage of its life...

Network & Infrastructure Security ππ§±
Building a secure distributed system requires more than just encrypting data and checking passwords. You must also pr...

Storage Basics: The Foundation of Data ποΈ
In system design, storage is not just about

The CAP Theorem: Balancing Distributed Data βοΈ
In any distributed system, you can only fully guarantee two out of the following three properties at any given time. ...

Database Scaling, Replication & Sharding π
As applications grow from thousands to millions of users, a single database server becomes a bottleneck. To handle ma...

Distributed File Systems & Big Data π
A File System defines how data is stored and retrieved on disk. While traditional systems (like NTFS or ext4) are gre...

Object Storage: Scalable Data for the Modern Web π¦
Object Storage is a storage architecture that manages data as discrete units called

SQL vs. NoSQL: Choosing the Right Database ποΈ
A database is a structured way to store, retrieve, and manage data. It is a core component of backend systems, servin...

Linux & Terminal Commands
A shell is a command-line interface which allows to perform certain tasks using command

Should you start a startup ?
Should You Start A Startup? | Startup School

How to Get and Evaluate Startup Ideas ?
How to Get and Evaluate Startup Ideas | Startup School

Startup Business Models and Pricing
Startup Business Models and Pricing | Startup School

How to get First Customers
How to Get Your First Customers | Startup School

How to build an MVP
β βLaunch Quickly, then Iterateβ

The Best Way to Launch your Startup
The Best Way To Launch Your Startup | Startup School

Tips for Technical Startup Founders
- Evolve into leading a small team of engineers