Headless cron enrichment system + progress checkpoint at 32%

OS cron fires enrich_wave.sh twice nightly (post 23:00 UTC reset); each wave
caps at ~700 keys (~75% window) via enrichment_wave.py --prepare. Fully
headless: one claude -p per batch via xargs, flock-guarded, idempotent.
DB updated to 9541 activities; .gitignore covers enrichment intermediates.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
Claude Agent
2026-06-01 21:26:35 +00:00
parent d6971e47f8
commit f7a37f91ec
6 changed files with 619 additions and 7 deletions

12
.gitignore vendored
View File

@@ -165,9 +165,14 @@ cython_debug/
*.db.backup
*.db.bak
*.db.tmp
*.db.prefreeze*
*.sqlite.backup
*.sqlite3.backup
# Agent runtime locks
.claude/scheduled_tasks.lock
.claude/*.lock
# Temporary files
*.tmp
*.backup
@@ -179,6 +184,13 @@ data/sources/
data/chunks/
data/extracted/
# Enrichment pipeline intermediates (LLM output; final result lands in data/activities.db)
data/enrichment_prompts/
data/enrichment_parts/
data/enrichment_batches/
data/enrichment_wf/
data/enrichment.json
# Keep main production database, the hand-written index, and committed golden set
!data/activities.db
!data/INDEX_MASTER_JOCURI_ACTIVITATI.md