Document & vision AI for a post-breach response firm
After enterprise breaches, our client needed to identify exactly which individuals were affected — across tens of thousands of leaked tax forms, financial records, and scanned documents in inconsistent layouts. We built the document analysis pipeline (PDF parsing, OpenCV recognition, LLM entity extraction), then re-architected it for parallelism on ARM64 Graviton hardware. A 30,000-document run that used to take hours now finishes in under three minutes.
Throughput gain on the 30K-doc breach analysis pipeline