Files

Marius Mutu d175d5ba5a chore: working-tree state — anaf snapshots, cron state, KB notes, tools

Pre-existing uncommitted changes swept in with the STT work:
anaf-monitor snapshots/versions, cron job + newsletter state, 9 youtube KB
notes, tools/ocr_bon.py, and tools/tts.py.

Note: the tts.py change breaks 2 truncation tests in test_voice_normalize.py
(sanitize word-count) — flagged for a separate follow-up.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

2026-06-27 18:16:31 +00:00

3.3 KiB

Raw Blame History

Google Just Dropped a Masterclass on Agentic Engineering

URL: https://youtu.be/zbmuiaPuiNM Data: 2026-06-25 Durata: 21:55 Tags: @work @growth

TL;DR

Google a publicat un ghid de 51 de pagini despre AI-driven SDLC (Software Development Life Cycle). Concluzia centrală: harness-ul (regulile, workflow-urile, tool-urile, guardrails) contează 90%, modelul LLM doar 10%. Vibe coding e ok pentru prototipuri, dar agentic engineering — cu spec-uri clare, teste automate și un harness bine inginerit — este singura cale spre cod fiabil și cost-eficient pe termen lung.

Puncte cheie

AI coding e un spectru, nu un switch
- Vibe coding → prompt casual, validare vizuală
- Structured AI assisted → prompts detaliate, spot-checking manual
- Agentic engineering → harness inginerit, evals automate, CI gates
Harness = 90% din sistem
- Instrucțiuni, MCP servers, guardrails, hooks, skills (workflows), sub-agenți, observabilitate
- Modelul LLM e doar 10% — poți face Sonnet să performeze ca Opus cu harness-ul potrivit
Factory model
- Tu proiectezi sistemul, agentul produce codul
- Planning agent separat de coding agent (evitare context rot + bias)
- Human review la final (cel puțin PR review)
Static vs Dynamic context
- Static = reguli de bază, system prompt — încărcat mereu (fiabil dar costisitor)
- Dynamic = skills, conventions per folder — încărcat on-demand (eficient dar riscul că agentul uită să le ceară)
- Trend: un singur agent generalist + skills specializate, nu zeci de sub-agenți specializați
System evolution mindset
- La fiecare problemă: nu doar fix-ul, ci și îmbunătățirea harness-ului
- Echivalentul tasks/lessons.md la scară industrială
Token economics
- Vibe coding: CAPEX mic, OPEX mare (arzi tokens pe cod slab)
- Agentic engineering: CAPEX mare inițial, OPEX mic — crossover-ul vine repede
- 3-10x mai fiabil și mai ieftin pe termen lung
Conductor vs Orchestrator
- Conductor = micro-management la nivel de fișier (modul vechi)
- Orchestrator = dai taskuri mari, revizuiești outcome-uri, agenți în paralel
- Google zice că mergi între cele două; autorul crede că cu harness bun rămâi mereu Orchestrator

Quote-uri

"The harness matters as much as the model." — Anthropic (citat în video)

"The model is only 10%. Everything else — instructions, tools, context, guardrails, orchestration, observability — makes up the other 90%." — Google

"Rather than embedding every piece of specialized knowledge into the agent system prompt, skills allow the agent to remain a lightweight generalist that flexes into specialist roles on demand through progressive disclosure."

"Specification quality is the new bottleneck." — Google

"Every single time you go through this process, you're making it more and more reliable. The harness is worth investing your time into."

Relevanță pentru Echo / ROA

Confirmă arhitectura Echo: personality/*.md (static) + skills gstack (dynamic) + lessons.md (evolution)
Ralph workflow = factory model în practică: PRD → planning agent → coding agent → review
tasks/lessons.md = system evolution mindset aplicat
Validare că abordarea cu un singur agent + skills e corectă (nu zeci de sub-agenți)

3.3 KiB Raw Blame History