Headless cron enrichment system + progress checkpoint at 32%
OS cron fires enrich_wave.sh twice nightly (post 23:00 UTC reset); each wave caps at ~700 keys (~75% window) via enrichment_wave.py --prepare. Fully headless: one claude -p per batch via xargs, flock-guarded, idempotent. DB updated to 9541 activities; .gitignore covers enrichment intermediates. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
12
.gitignore
vendored
12
.gitignore
vendored
@@ -165,9 +165,14 @@ cython_debug/
|
||||
*.db.backup
|
||||
*.db.bak
|
||||
*.db.tmp
|
||||
*.db.prefreeze*
|
||||
*.sqlite.backup
|
||||
*.sqlite3.backup
|
||||
|
||||
# Agent runtime locks
|
||||
.claude/scheduled_tasks.lock
|
||||
.claude/*.lock
|
||||
|
||||
# Temporary files
|
||||
*.tmp
|
||||
*.backup
|
||||
@@ -179,6 +184,13 @@ data/sources/
|
||||
data/chunks/
|
||||
data/extracted/
|
||||
|
||||
# Enrichment pipeline intermediates (LLM output; final result lands in data/activities.db)
|
||||
data/enrichment_prompts/
|
||||
data/enrichment_parts/
|
||||
data/enrichment_batches/
|
||||
data/enrichment_wf/
|
||||
data/enrichment.json
|
||||
|
||||
# Keep main production database, the hand-written index, and committed golden set
|
||||
!data/activities.db
|
||||
!data/INDEX_MASTER_JOCURI_ACTIVITATI.md
|
||||
|
||||
Reference in New Issue
Block a user