L37 · Operator~25 min · free · cc-by 4.0

Embeddings: meaning as numbers

An embedding is a list of numbers that captures the meaning of text · learn the shape and you unlock semantic search, deduplication, and clustering.

::TL;DR · the whole lesson in three lines

MOVEAn embedding is a list of numbers that captures the meaning of text · learn the shape and you unlock semantic search, deduplication, and clustering.
DRILLYou will embed a small set of your own text snippets and run a real semantic search to feel how meaning-distance behaves.
WINYou ran an embeddings API call from your own machine.

jump to drill ↓or read the full concept first →

::concept · what's actually happening

An embedding is a fixed-length vector (usually 256 to 3072 numbers) produced by a model that has been trained to put similar meanings near each other in that high-dimensional space. The number 0.42 has no meaning on its own · the geometric distance between two embeddings is what matters.

read full concept · 4 more paragraphs →

Semantic search is the canonical use · you embed every document, embed the query, and find the documents whose embeddings sit closest to the query's embedding. The match is by meaning, not by literal word match. 'How do I cancel my subscription' finds the doc titled 'Account closure procedures' even though they share no keywords.

Beyond search, embeddings unlock dedup ('these two support tickets are basically the same complaint'), clustering ('group my customers' open-text feedback by theme'), classification ('is this incoming email more like a support ticket or more like a sales inquiry'), and anomaly detection ('this log message is unusually far from anything we have seen before').

Embedding models are not free · API calls cost real money at scale, and embedding 100K documents adds up. The good news: embeddings are computed once and cached forever (until the underlying model changes), so the cost amortizes across every future query.

The mental model that helps: think of embeddings as a coordinate on a vast meaning-map. Words with similar meanings cluster together. Documents about cooking sit near each other. Documents about plumbing sit elsewhere. You are doing geography on meaning.

::drill · do the thing

You will embed a small set of your own text snippets and run a real semantic search to feel how meaning-distance behaves.

::L37 drill · copy-paste into any AI chat

I want to feel how embeddings work with my own data. Walk me through the smallest possible end-to-end demo: 1) recommend one specific embedding model + API I should use for hobby-scale work (cost-aware), 2) give me a 20-line Python (or Node, whichever I pick · I prefer [LANGUAGE]) script that embeds these five short text snippets I will paste in: [SNIPPET 1 · e.g. about a topic] [SNIPPET 2 · related topic] [SNIPPET 3 · unrelated topic] [SNIPPET 4 · related to 1 and 2] [SNIPPET 5 · totally different domain], 3) computes which snippet is most similar to a query I will provide, 4) prints the similarity scores. Include the exact pip/npm install command. No 'just use LangChain' · I want to see the actual API call.

I want to feel how embeddings work with my own data. Walk me through the smallest possible end-to-end demo: 1) recommend one specific embedding model + API I should use for hobby-scale work (cost-aware), 2) give me a 20-line Python (or Node, whichever I pick · I prefer [LANGUAGE]) script that embeds these five short text snippets I will paste in: [SNIPPET 1 · e.g. about a topic] [SNIPPET 2 · related topic] [SNIPPET 3 · unrelated topic] [SNIPPET 4 · related to 1 and 2] [SNIPPET 5 · totally different domain], 3) computes which snippet is most similar to a query I will provide, 4) prints the similarity scores. Include the exact pip/npm install command. No 'just use LangChain' · I want to see the actual API call.

::or open one in a new tab — then paste

Claude↗ChatGPT↗Gemini↗

::steps

01Pick 5 short text snippets you have lying around (notes, ticket titles, emails).
02Run the prompt with your language preference filled in.
03Install the dependencies and run the script.
04Pick a query and see which snippets score highest.
05Try a query you would NOT expect to match anything · see what scores 0.2 vs 0.8.
06Note: this is the same primitive that powers most production AI search.

::outcome · what should be true

You ran an embeddings API call from your own machine.
You saw real similarity scores for a real query against your own data.
You can articulate the difference between keyword search and semantic search.
You know what one embedding API call costs you (likely a fraction of a cent).

::trap · the most common failure

Operators learn the concept of embeddings but never compute one. The concept feels obvious until you see your two 'related' snippets score 0.31 and have to think about why · only then do you understand what the model considered similar.

::end of the curriculum

You're at Pilot level. There's no Level 6.

The next move is doing the work, not another lesson. If you want operator-grade infrastructure, that's /orangebox. If you want the lab's working journal, /founders-view. If you want to collaborate on the curriculum itself, the source is public on GitHub.

::other lessons at Operator level

L10~30 min

← back to /learn full lesson library →

Embeddings: meaning as numbers

You're at Pilot level. There's no Level 6.

Local AI · Ollama — privacy, offline, and the limit of free

Model routing — switching between Claude, GPT, Gemini mid-task

MCP servers — the plug socket that turned AI into a real tool

Agent mode — when AI takes action, not just answers

Computer use — when AI takes the mouse and keyboard

What AI cannot replace — taste, judgment, relationships

Agents 101: model plus tools plus loop

MCP: structured tools for AI

Skill primers: teach a session your context in 30 seconds

Local models with Ollama

Vision models: when to use them

Audio and Whisper transcription

RAG vs long context: when to retrieve, when to dump

Fine-tuning vs prompt engineering

AI safety in personal use

Multimodal prompting: combining text, image, audio

Chain-of-thought: making the model show its work

Tool use and structured output

Cost optimization: tokens, caching, model selection