Reference

Agent-building glossary

The core vocabulary for building real AI agents — in plain English.

Agent
An LLM given a goal, tools, and a loop: it reasons, calls tools, observes the results, and repeats until the task is done.
Tool
A function an agent can call — search, a calculator, an API. The model emits a structured call; your code runs it and returns the result.
Tool call
The model's structured request to invoke a tool, with arguments — the bridge between reasoning and real action.
Context window
The finite span of tokens a model can attend to at once; everything the agent “knows” in a turn must fit inside it.
Memory
State an agent carries across turns or sessions — facts, preferences, history — beyond what fits in the context window.
RAG
Retrieval-Augmented Generation: fetching relevant documents (usually by embedding similarity) and adding them to the prompt so the model answers from real, current data.
Embedding
A vector representation of text where similar meanings sit close together, enabling semantic search.
Vector store
A database of embeddings that returns the nearest matches to a query vector — the retrieval half of RAG.
Planning
Decomposing a goal into ordered steps before or while acting, so an agent handles complex tasks reliably instead of improvising.
Multi-agent
Multiple specialized agents that hand off or collaborate — e.g. a planner, a coder, and a reviewer.
Eval
A repeatable test of agent quality on held-out cases — the difference between “seems to work” and “works.”
LLM-as-judge
Using a model to score another model's output against a rubric — for evals at scale where exact-match comparison won't do.
Guardrails
Input and output checks that constrain what an agent accepts or emits — validation, filtering, and refusals.
Prompt injection
An attack where untrusted input smuggles in instructions that hijack the agent — the top safety risk for tool-using agents.
Human-in-the-loop
Requiring human approval before an agent takes a consequential action, like a payment or a delete.
Hallucination
A confident but fabricated answer; mitigated with retrieval, tool use, and evals.
System prompt
The standing instructions that define an agent's role, tools, and constraints on every turn.
Token
The unit a model reads and writes (roughly ¾ of a word); cost and context limits are both measured in tokens.
MCP
Model Context Protocol — an open standard for exposing tools and resources to agents so any client can use them.
Temperature
A sampling control: low values make output focused and deterministic, high values make it more varied and creative.

← The Agent Marketplace  ·  Learn to build these, live →