Grooper enhances images displayed to users throughout your workflow, removes image artifacts known to interfere with OCR, and provides crisp versions for permanent archival.
It also analyzes page structure to assist with automated decision making downstream.
Image Processing – the secret for achieving near-perfect OCR
Good OCR (optical character recognition) starts with images free of non-text artifacts. Grooper has dozens of features that remove everything that isn’t text to ensure you get distraction-free OCR. Let’s look at a few examples.
Safe and Clean Halftone Removal
Dithering and other halftone patterns are a direct result of legacy document imaging platforms poorly converting color images to black and white.
These artifacts must be eliminated to prevent massive errors in OCR results, particularly with punctuation like periods and commas.
Imagine the benefits of the best image processing in your organization. Let us know how we can help you!
Before Halftone Removal:
Halftone artifacts completely surround text we’d like to capture. OCR stands very little chance at seeing these characters.
After Halftone Removal:
Grooper recognizes dithered patterns and safely removes them without eliminating legitimate punctuation that is close to letters on the page.
Seriously Brilliant Border Removal
Borders have commonly been very tricky to remove when the black region doesn’t extend all the way to the edge of the page.
Grooper understands how to address a variety of uncommon border scenarios to cleanly remove them.
You work with full-color documents every day. So why shouldn’t your digital image processing do the same, no matter the image format you are using?
This is no problem for Grooper. It’s object removal and editing breaks out of the realm of black-and-white processing with full color recognition and editing.
Use this feature to digitally restore damaged or unknown parts of an image using information from nearby pixels.
Pixel-Perfect Line Detection & Removal
For humans, lines are needed to provide visual cues that increase readability. Lines of all sizes are common and frequent in standardized forms, table structures, and pages with “fill-in-the-blank” comb boxes.
However, these lines, particularly the short, vertical ones, are commonly and mistakenly read by OCR engines as letters or numbers.
But Grooper computer vision can erase these lines with ease…