In this brief guide, I’ll show you how to use Cradl AI to automate the process of extracting data from PDF tables. Whether you’re handling transaction lists, inventory reports, or any other PDF or image file with tables, Cradl AI streamlines the table extraction process with minimal setup. Lets dig into how.
To start things off, we need to create or clone a Cradl AI model. If you haven't done that before, check out the first section of this guide. A few clicks is all it takes.
To enable our AI model to recognise tabular sections in our documents, we need to add the «Line Items» field to it. To do this, simply select the «Line Items» option from the «Type» dropdown.
When processing a table, the AI model will loop through all relevant columns and rows as key-value pairs.
For instance, if you’re working with a list of purchased goods on an invoice, a single «Line Items» field captures multiple columns per row, so that one line item row will hold information such as "description," "unit price," "quantity," and "VAT amount."
In other words, one Line Item field can capture hundreds of data points systematically.
Save your changes and let's extract table data from a document! We trigger the data extraction process by simply uploading a document. Wait for the parsing to finish, and inspect the outcome.
A key feature of Cradl AI is the interactive data-location mapping. When you click on an extracted data field, the corresponding area in the original document highlights. This feature is a time saver and a game changer.
And that's how you extract data from PDF tables with Cradl AI!
From here, you can integrate with other tools to automate your workflows and further streamline internal data processing.
Cradl AI supports input and output integrations from a variety of popular automation tools and apps, such as Power Automate, UiPath, Zapier, Excel, Google Sheets, Email, APIs, Webhooks, and more.
Extracting data from PDF tables with Cradl AI is quick and straightforward. Start by creating or cloning one of our AI models, then add the "Line Items" field to enable the model to recognize tabular sections in your documents. Next, upload a document to extract data directly from its tables. Once the data is extracted, you can automate the entire pipeline and transform PDF table data into structured formats using our integrations with popular automation tools and apps.
We’ll help get you started with your document automation journey.
Schedule a demo with our team today!