Concepts
Concepts
AI & agentic AI explained — plain-language entries for newcomers and intermediates.
AI Foundations
- What is AI, ML & Deep Learning?AI, machine learning, and deep learning are nested circles, not synonyms — and which one you are looking at predicts how a system fails.
- Neural Networks, IntuitivelyA neural network is a big stack of adjustable knobs that turns numbers into numbers; learning is just nudging those knobs toward less error.
- What Is a Large Language Model?An LLM is a huge next-token predictor; scale turned that one simple objective into abilities nobody explicitly programmed.
- Training vs InferenceTraining builds the model’s frozen weights once; inference runs them per request and never changes them — which answers most cost and privacy questions.
- Tokens & TokenizationModels see integers, not text; the hidden tokenization step explains your bill, your context limit, and odd failure modes.
- Embeddings: Meaning as GeometryEmbeddings turn things into points in space so that similar becomes close — the engine behind search, recommendations, and RAG.
- Transformers, at a High LevelSelf-attention lets every token look at every other token; that one idea fixed long-range memory and unlocked GPU-scale training.
- Generation & Sampling: TemperatureThe model returns a probability distribution, not an answer; temperature reshapes it, and temperature 0 is low-variance, not deterministic.
Agentic AI
- What Is an AI Agent?An agent is a model placed in a loop with tools, choosing each next action toward a goal — the core mental model.
- The Agent LoopReason → act → observe → repeat: tracing a tool call from the model through the harness into the environment and back.
- Autonomy LevelsA five-rung ladder from suggest to fully autonomous, and why the right level is a per-action engineering decision.
- Agents vs Chatbots vs WorkflowsOne question — who decides the next step — sorts any LLM system into chatbot, pipeline, workflow, or agent.
- Tools, Actions & EnvironmentsWhat a tool really is, read vs write actions, and why the environment — not the model — is where agents become dangerous.
- Goals, Planning & TerminationThe planning spectrum from reactive to deliberative, and the under-appreciated hard problem of knowing when an agent is done.
- When to Use an AgentThe three properties a task needs to justify an agent, the cheaper patterns that solve most cases, and clear do-not cases.
- Risks & Limits of AgentsThe four characteristic loop failure modes, the security shift autonomy brings, and what "safe agent" honestly means.
Building Blocks
- Prompting basicsThe four levers that move output quality: instruction, context, examples, output shape.
- System vs user promptsMessage roles, the instruction hierarchy, and never letting data act as instructions.
- Few-shot prompting & examplesWhen examples beat instructions, how to choose/order them, and where they stop paying off.
- Context windows explainedThe finite shared token budget, the three limit failures, and managing it actively.
- Tool / function calling explainedThe model proposes, your code disposes: the request/response shape, the loop, the safety rules.
- Retrieval-augmented generation (RAG) explainedRetrieve→augment→generate, RAG vs alternatives, and debugging it in two halves.
- Chunking & vector search intuitionWhy we chunk, embeddings as coordinates, nearest-neighbour search, hybrid + reranking.
- Structured outputsFrom "ask for JSON" to schema-constrained decoding, plus schema design and defensive parsing.
AI Ecosystem
- The model landscape: families & providersA vendor-neutral map of the major model families and a durable mental model for placing any new release.
- Open-weight vs closed modelsControl, cost, privacy, licensing and lock-in — the real engineering trade-offs, without the marketing.
- Modalities & multimodal modelsText, vision, audio, code: input vs output modalities and why "multimodal" is a spectrum, not a checkbox.
- Cost, quality & latencyModel size and the trade-off triangle that dominates production model economics, and how to engineer around it.
- Reasoning vs non-reasoning modelsWhat inference-time "thinking" actually does, when it helps or wastes money, and why it is now a dial.
- Agent frameworks & orchestrationLangChain, LlamaIndex, provider SDKs and the broader landscape — by category and trade-off, not by brand.
- Serving & access: APIs, local, gatewaysModel choice and serving choice are orthogonal: first-party APIs, cloud catalogs, inference providers, self-host, gateways.
- Reading benchmarks criticallyWhy leaderboard rank rarely predicts your task, and why a small custom eval set beats every public number.
- Choosing a model: a checklistA repeatable, constraint-first decision procedure that synthesizes the whole topic and survives a fast-moving field.