Files

MoltBot Service f2973aa76f stage-1: project bootstrap

Structure, config loader, personality/tools/memory from clawd, venv, 22 tests passing.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-13 10:20:55 +00:00

6.7 KiB

Raw Blame History

Claude Code Multi-Agent Orchestration with Opus 4.6, Tmux and Agent Sandboxes

Sursă: https://youtu.be/RpUTF_U4kiw
Autor: Andy Devdan
Durată: 24:02
Data salvare: 2026-02-10

TL;DR

Andy demonstrează noua funcționalitate de multi-agent orchestration din Claude Code (Opus 4.6), combinată cu Tmux și agent sandboxes (E2B). Arată cum un agent principal poate crea echipe de agenți specializați care lucrează în paralel pe task-uri separate, fiecare în propriul context și sandbox izolat. Demonstrația include 8 aplicații fullstack one-shotted de Opus 4.6, apoi re-deployate automat de două echipe de câte 4 agenți.

Mesaj cheie: Limitarea nu mai e modelul (Sonnet 4.5+ sunt suficient de capabile) — limitarea suntem NOI: abilitatea de prompt engineering, context engineering și de a construi sisteme agentic reusabile.

Puncte cheie

1. Multi-Agent Orchestration = Next Level

Noul workflow: Create team → Create tasks → Spawn agents → Work in parallel → Shutdown → Delete team
Tooling nou: team_create, team_delete, task_create, task_list, task_get, task_update, send_message (comunicare inter-agent)
Tmux = vizualizare: Fiecare sub-agent rulează într-un pane separat; poți vedea în timp real 4-8 agenți lucrând simultan

2. Agent Sandboxes (E2B) = Secure Playground

Izolare completă: agenții lucrează off-device, zero risc pentru local machine
Reusable: /reboot command pentru re-deploy aplicații în sandboxuri noi
Scalabil: Andy avea 24 de sandboxuri active simultan

3. Multi-Agent Observability = Critic

Sistem custom de logging care captează toate evenimentele: session start/end, tool calls, task updates, messages
Poți urmări fiecare agent individual: ce tools a rulat, câte token-uri, status
Fără observability = vibe coding (nu știi ce se întâmplă → nu poți îmbunătăți)

4. Real Constraint = You & I

Modelele pot deja mult mai mult decât știm noi să extragem
Limitările reale:
1. Tool-urile disponibile (agent sandboxes, tmux, observability)
2. Noi înșine: prompt engineering + context engineering
Fiecare engineer e limitat de: knowledge of tools + ability to engineer prompts/context

5. Practical Workflow Demo

Step 1: Opus 4.6 creează 8 aplicații fullstack (one-shot, fără intervenție)
Step 2: Agent principal primește task: "Build agent team for each codebase, summarize setup"
Step 3: Primary agent deschide 8 tmux panes, spawns 8 Haiku agents (cheaper, specialized)
Step 4: Fiecare agent explorează 1 codebase, face setup, returnează summary
Step 5: Primary agent compilează tot într-un raport final (context = doar 31% din limit!)
Step 6: Repeat pentru re-deploy în E2B sandboxes (2 teams × 4 agents)

6. Best Practices

Specialized agents > generalist: Un agent = un task specific → finish → shutdown
Reset context: După fiecare team workflow, ștergi team-ul și task-urile → fresh context
Iterative: Dacă ceva nu merge (ex: 2 apps lipsă date), spin up ad-hoc team să repare
Observability first: Nu poți scala dacă nu știi ce fac agenții

Quote-uri importante

"The game on the field is changing. It's no longer about what the models allow us to do. As of Sonnet 4.5, these models can do much more than you and I give them credit for. The true constraint of agentic engineering now is twofold: it's the tools we have available and it's you and I."

"The true limitation is you and I. It is our capabilities. It's our ability to prompt engineer and context engineer the outcomes we're looking for and build them into reusable systems."

"How can you prompt engineer and context engineer with great powerful models to get more engineering work done than ever with confidence? We want to be building systems of trust with our agents."

"You want to spin up specialized agents that do one thing extraordinarily well. They focus on one task and then they finish."

"This whole idea that engineers are going to be replaced by this technology to me is absurd. Engineers are the best positioned to use agentic technology."

"Every engineer is limited by their tools and their knowledge of their tool. That's why multi-agent observability is super key."

"It all comes back to things we talk about on the channel all the time: Core Four — Context, Model, Prompt, Tools. Everything boils down to that."

"Scale our compute to scale our impact." (tema recurentă)

Takeaways pentru Marius

1. Multi-agent orchestration = aplicabil pentru ROA

Scenarii potențiale:
- Un agent principal coordonează 3-4 agenți: unul pt. migrare DB, altul pt. testing, altul pt. documentație
- Feature complex (ex: raport nou ROA) → agent team: unul design UI, altul backend logic, altul validări ANAF
- Maintenance tasks: un agent verifică logs, altul rulează backup-uri, altul face health check

2. Observability = critical pentru încredere

Dacă implementezi agenți în producție pentru clienți → ai nevoie să știi EXACT ce au făcut
Simplu: log toate tool calls + results într-un DB/fișier → audit trail complet

3. Context engineering > model size

Nu trebuie să folosești Opus pentru totul
Pattern: Primary agent (Opus/Sonnet) coordonează, sub-agenți (Haiku) execută task-uri simple
Economie: Haiku e mult mai ieftin, perfect pentru task-uri repetitive

4. Sandboxing = safety net

Dacă ai agenți care modifică cod/baze de date, rulează-i în containere izolate (Docker/LXC)
Zero risc pentru sistem principal

5. Fresh context after each workflow

Delete team + tasks după finalizare → evită "context pollution"
Fiecare task mare = fresh start

Tools menționate

Claude Code (Opus 4.6, Sonnet 4.5, Haiku)
Tmux (vizualizare multi-pane pentru agenți paraleli)
E2B (agent sandboxes cloud-based)
Multi-agent observability system (custom logging, link în descriere)
Agent sandbox skill (custom skill pentru management E2B)
Tactical Agentic Coding (cursul lui Andy, promovat în video)

Acțiuni posibile

Testează multi-agent orchestration în Claude Code (experimental flag)
Construiește observability simplu pentru agenți ROA (log tool calls → SQLite)
Explorează Docker/LXC sandboxing pentru agenți care modifică cod
Studiază pattern: Primary (Opus/Sonnet) + Workers (Haiku) pentru economie
Implementează "fresh context" workflow: șterge history după task finalizat

6.7 KiB Raw Blame History Unescape Escape