fix(ocr): Improve CUI matching and vendor name extraction

- Add CUI variant matching for Romanian fiscal codes (handles "RO22891860",
  "RO 22891860", and "22891860" formats) in both sync_service and validation
- Fix vendor name extraction to properly handle "SC." prefix (Societate
  Comercială) vs "SC" as staircase in addresses
- Remove problematic TVA pattern that was incorrectly matching percentage values
- Add docTR Plus engine option to dropdown with "(recomandat)" label

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
2026-01-04 05:34:31 +02:00
parent f1f6760bef
commit 2f7ef55868
4 changed files with 61 additions and 9 deletions

View File

@@ -113,6 +113,10 @@ const engineConfig = {
label: 'docTR',
desc: 'Rapid, bună acuratețe'
},
'doctr_plus': {
label: 'docTR Plus',
desc: '2 treceri secvențiale · (recomandat)'
},
'paddleocr': {
label: 'PaddleOCR',
desc: 'Cea mai bună calitate'
@@ -123,7 +127,7 @@ const engineConfig = {
},
'hybrid': {
label: 'Hybrid',
desc: 'docTR+Tess paralel · Recomandat'
desc: 'docTR+Tess paralel'
},
'hybrid-quality': {
label: 'Hybrid Calitate',