built throughORANGEBOX·see what it ships·$1 →

AtomEons / Learn / L33

L33 · Operator~30 min · free · cc-by 4.0

Local models with Ollama

Run Llama, Qwen, or Mistral on your own laptop · no API, no logs, no monthly bill for the work that should stay home.

::TL;DR · the whole lesson in three lines

  • MOVERun Llama, Qwen, or Mistral on your own laptop · no API, no logs, no monthly bill for the work that should stay home.
  • DRILLYou will install Ollama, pull one model, and run it on a real privacy-sensitive task you would otherwise have sent to the cloud.
  • WINOllama is running on your machine and a model file is downloaded.

::concept · what's actually happening

Ollama is a runtime that lets you download an open-weight model and run it locally with one command. You get a chat or API surface on localhost, your data never leaves the machine, and the model file lives in a folder you own. That is the whole pitch.

read full concept · 4 more paragraphs

Local models trade capability for sovereignty · a 7B-parameter model running on your laptop will not match GPT-5 or Claude on hard tasks. It will however match them on the simple, repetitive 80% of work where the cost of round-tripping to a cloud is the actual bottleneck.

The hardware math matters: a 7B model needs roughly 5-8GB of RAM, a 13B needs 12-16GB, a 70B needs 40GB+ or aggressive quantization. Apple Silicon with unified memory punches above its weight here. Modest hardware is fine for modest models.

Privacy-critical work has an obvious home here · medical questions, legal drafts, NDA-covered code, journal entries, anything you would not want logged to a cloud provider. The legal blast radius of 'my therapist's chat became training data' is too high for hosted models on sensitive content.

The honest limitation: open-weight models in 2026 are still behind frontier closed-weight models on hard reasoning, long-context coherence, and agentic tool use. The gap has narrowed but it has not closed. Use local for what local is good at.

::drill · do the thing

You will install Ollama, pull one model, and run it on a real privacy-sensitive task you would otherwise have sent to the cloud.

::L33 drill · copy-paste into any AI chat

Walk me through installing Ollama on [YOUR OS · macOS / Windows / Linux] and pulling one mid-tier model suitable for my hardware: I have [RAM AMOUNT] of RAM and [APPLE SILICON / NVIDIA GPU / CPU ONLY]. Recommend one specific model name to start with (pin the version), give me the exact pull command, and the exact run command to start a chat. Then give me one privacy-sensitive prompt I should try first to feel the difference · something I would not want logged to a cloud API. Skip the marketing about open source · just the install steps and the first real use.

::or open one in a new tab — then paste

::steps

  1. 01Run the prompt to get install steps tailored to your machine.
  2. 02Install Ollama and pull the recommended model (this takes 5-15 min on broadband).
  3. 03Run the model with `ollama run <model>` and confirm you get a prompt.
  4. 04Try the suggested privacy-sensitive task · feel the latency and quality.
  5. 05Compare the same prompt to a cloud model to calibrate the gap.
  6. 06Decide which of your recurring tasks will move local.

::outcome · what should be true

  • Ollama is running on your machine and a model file is downloaded.
  • You have completed one real task end-to-end with the local model.
  • You can articulate the capability gap versus the cloud honestly.
  • You have a written list of which tasks you will route local going forward.

::trap · the most common failure

Operators install Ollama, run 'hello world,' get bored, and never use it again. The win only shows up when you route a real task through it · usually a privacy-critical one. Without that first real use, local stays a demo.

::end of the curriculum

You're at Pilot level. There's no Level 6.

The next move is doing the work, not another lesson. If you want operator-grade infrastructure, that's /orangebox. If you want the lab's working journal, /founders-view. If you want to collaborate on the curriculum itself, the source is public on GitHub.

::other lessons at Operator level

L10~30 min

Local AI · Ollama — privacy, offline, and the limit of free

At Operator level you need an honest opinion about local-only AI. Even if you don't use it daily, you should have run it once.

L11~25 min

Model routing — switching between Claude, GPT, Gemini mid-task

Operators don't pick one AI. They route each task to the model that does it best. Knowing the strengths is the skill.

L15~25 min

MCP servers — the plug socket that turned AI into a real tool

Model Context Protocol is the standard plug. Knowing what plugs in changes what your AI can actually touch — your files, your inbox, your calendar, your repos.

L16~20 min

Agent mode — when AI takes action, not just answers

The frontier of useful AI is agents that DO things — browse, click, file, send. The actual skill is the safety pattern, not the magic.

L26~22 min

Computer use — when AI takes the mouse and keyboard

Claude in Chrome, ChatGPT Atlas, computer-use beta — the frontier is AI that drives your browser like a human. Knowing the safety pattern is the actual skill.

L27~22 min

What AI cannot replace — taste, judgment, relationships

The operators winning in 2026 are the ones who learned what AI is for and what is theirs. Knowing the line is more valuable than any prompt.

L30~20 min

Agents 101: model plus tools plus loop

An agent is a model with tools running in a loop until done · know when you need one and when you don't.

L31~25 min

MCP: structured tools for AI

Model Context Protocol is the USB-C of AI tooling · learn the shape before you wire anything.

L32~25 min

Skill primers: teach a session your context in 30 seconds

A skill is a reusable file that primes a fresh AI session with your project, voice, and rules · stop re-explaining yourself.

L34~20 min

Vision models: when to use them

Vision lets the model see images · powerful for screenshots and diagrams · weak for precise spatial work · know the line.

L35~25 min

Audio and Whisper transcription

Whisper turns audio into text · meetings, voice memos, interviews · the AI-era replacement for note-taking.

L36~25 min

RAG vs long context: when to retrieve, when to dump

RAG fetches the right slice of your data at query time · long context stuffs everything in · know which problem you actually have.

L37~25 min

Embeddings: meaning as numbers

An embedding is a list of numbers that captures the meaning of text · learn the shape and you unlock semantic search, deduplication, and clustering.

L38~20 min

Fine-tuning vs prompt engineering

For individuals, fine-tuning is almost never worth it · know exactly when it actually is.

L39~20 min

AI safety in personal use

PII, NDAs, financial data, and other people's secrets · know the rules of what you do not paste.

L40~20 min

Multimodal prompting: combining text, image, audio

The strongest prompts use the medium that fits the question · sometimes you describe, sometimes you show, sometimes you do both.

L42~15 min

Chain-of-thought: making the model show its work

Asking the model to reason step-by-step before answering raises accuracy on hard problems · know when it earns its cost.

L43~25 min

Tool use and structured output

Function calling makes the model return JSON your code can use · know the contract before you build on it.

L44~25 min

Cost optimization: tokens, caching, model selection

AI is metered · the operators who stay profitable measure what they spend and choose the model that fits the task.

::part of the AtomEons /learn curriculum · 45 lessons · 5 levels · cc-by 4.0

LAB · ATOMEONS · MARCO ISLAND FLÆONS RESEARCH · 12 PAPERS · CC-BY 4.0ORANGEBOX v1.0.0-beta · TURBO-OPTIMIZE CLAUDE · SHIPPED 2026-05-30B00KMAKR v3.2.0 · AI PUBLISHING COCKPIT · MAC + WINDOWSFREE LAUNCH WEEK · ENDS JUNE 6 · §4A NO-SAAS LOCKFOUNDER'S VIEW · NEXT BROADCAST IN ...CITE THE WORK · FORWARD THE LINK · NO ALGORITHMLAB · ATOMEONS · MARCO ISLAND FLÆONS RESEARCH · 12 PAPERS · CC-BY 4.0ORANGEBOX v1.0.0-beta · TURBO-OPTIMIZE CLAUDE · SHIPPED 2026-05-30B00KMAKR v3.2.0 · AI PUBLISHING COCKPIT · MAC + WINDOWSFREE LAUNCH WEEK · ENDS JUNE 6 · §4A NO-SAAS LOCKFOUNDER'S VIEW · NEXT BROADCAST IN ...CITE THE WORK · FORWARD THE LINK · NO ALGORITHM