Author here. Built this after implementing similar workflows for a few SMEs.
The surprising part: LLM extraction (Claude/GPT-4 Vision) is reliable enough for ~90% of standard invoice formats now. The validation layer catches most hallucinations.
Happy to discuss edge cases or alternative approaches.