Automated Document Classification
Quickly regain control of sprawling, unstructured document collections by automatically organizing them into logical groups based on similarity rankings in models that you train and control.
Feature Collection with ESP
Grooper’s ESP engine identifies the distinguishing features of each page to group collections of images together as classified documents. ESP uses three key feature collection mechanisms:
Natural language processing examines the language of the complete document to understand context.
Find unique key words or phrases that positively identify a document, like a title or section heading.
Computer vision identifies structured forms based on what they look like without having to read from OCR.
Provide document examples and watch as Grooper begins to learn the correct Doc Type for each instrument provided. When doing batch testing, unclassified items (those with low confidence scores) can be flagged and sent to a queue for additional training.