Amazon Textract provides you the ability to customize the pretrained Queries feature and improve extraction accuracy on your business specific document types while you maintain control and ownership of your data. Through the AWS Console you can upload as few as ten sample documents, annotate the data, and customize the pretrained Queries feature within a few hours.
Amazon Textract features
Why Amazon Textract
Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, layout elements, and data from scanned documents. It goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables. All extracted data is returned with bounding box coordinates—polygon frames that encompass each piece of identified data, such as a word, a line, a table, or individual cells within a table. Amazon Textract also returns a confidence score for everything it identifies so you can make informed decisions about how to use the results.