Table extraction can be used to identify tables in a document and extract their
contents. For example, if a PDF receipt contains a table that includes the taxes and total
amount, Document Understanding identifies the table and extracts
the table structure.
Document Understanding provides the number of rows and
columns for the table and the contents in each table cell. Each cell has a confidence score.
The confidence score is a decimal number. Scores closer to 1 indicate a higher confidence in
the extracted text, while lower scores indicate lower confidence score. The range of the
confidence score for each label is from 0 to 1.
Supported features are:
Table extraction for tables with and without borders
Bounding polygons
Confidence score
Single request
Batch request
Limitations are:
English language only
Table Extraction Example
An example of table extraction use in Document Understanding.