Preflight: untrack generated data, fix dangerous .gitignore patterns

Per plan E2/E3: ignore regenerated extraction data (sources, chunks,
extracted, carti-camp-jocuri) and replace the *test*.py / *debug*.py /
*temp*.py / *test*.db patterns that would silently hide the test suite.
Keep activities.db, the hand-written index, golden set and test fixtures.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Claude Agent
2026-05-19 17:25:23 +00:00
parent a19ddf0b71
commit c68dda6c87
102 changed files with 17 additions and 89932 deletions

25
.gitignore vendored
View File

@@ -161,23 +161,32 @@ cython_debug/
# VS Code
.vscode/
# SQLite databases (keep main database, ignore backups and tests)
# SQLite databases (keep main database, ignore backups and temp build files)
*.db.backup
*test*.db
*debug*.db
*.db.bak
*.db.tmp
*.sqlite.backup
*.sqlite3.backup
# Temporary and debug files
*test*.py
*debug*.py
*temp*.py
# Temporary files
*.tmp
*.backup
*.bak
# Keep main production database
# Generated extraction data (regenerated by the pipeline, not version-controlled)
data/carti-camp-jocuri/
data/sources/
data/chunks/
data/extracted/
# Keep main production database, the hand-written index, and committed golden set
!data/activities.db
!data/INDEX_MASTER_JOCURI_ACTIVITATI.md
!data/golden/
# Test fixtures must stay tracked despite the patterns above
!tests/fixtures/
!tests/fixtures/**
# Windows
desktop.ini