feat(ocr): Implement persistent worker pool with SQLite job queue
Major OCR infrastructure improvements: - Add persistent SQLite-based job queue for OCR tasks - Implement worker pool with process isolation and auto-restart - Add OCR engine selector dropdown (Tesseract/PaddleOCR) in upload zone - Optimize Tesseract preprocessing based on benchmark results (8x faster) - Add recognize_cif_optimized() with multi-strategy CIF extraction - Add Romanian CIF checksum validation - Increase Telegram long polling timeout from 10s to 30s Squashed commits: - feat(ocr): Implement persistent worker pool with SQLite job queue - feat(ocr): Add OCR engine selector dropdown to upload zone - perf(telegram): Increase long polling timeout from 10s to 30s - perf(ocr): Optimize Tesseract preprocessing based on benchmark results 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
@@ -70,6 +70,8 @@ opencv-python>=4.8.0
|
||||
pytesseract>=0.3.10
|
||||
pdf2image>=1.16.0
|
||||
numpy>=1.24.0
|
||||
# Process management for OCR worker pool (Windows orphan cleanup)
|
||||
psutil>=5.9.0
|
||||
|
||||
# ============================================================================
|
||||
# TELEGRAM MODULE - Bot SDK
|
||||
|
||||
Reference in New Issue
Block a user