Kavian Braanaas

Content Writer

Reading time: 3 min.
January 10, 2025

Automate Data Extraction From Just About Any PDF With Cradl AI

In this post, we’ll walk you through how to use Cradl AI to quickly set up an automated data extraction workflow for almost any PDF document in just 3 simple steps. We’ll show you how easy it can be to automate document workflows and eliminate the need for manual data entry, giving your team more time to focus on what matters.

Create a Cradl AI data extraction model

Before we begin, make sure you’ve created a Cradl AI account.

Once you're inside the app, your first step is to create an AI model that understands your documents. In Cradl AI, creating an AI model simply equates to defining a list of the data points you want to extract from your documents.

Create a model from scratch or customise a template model

For instance, if your documents are bills of lading, you'd define data points such as BoL number, carrier name, vessel name, port of loading, and so on. Alternatively, if you have a common document type, like invoices, clone our corresponding template model then add or remove fields according to your needs.

Screenshot that juxtaposes the AI model configuration of two different Cradl AI models: Bill of lading model and invoice modelL

Extract data from your first PDF document

With your AI model set up, you're ready to extract data from your first document:

  1. Click on «Run» from your dashboard and upload your documents.
  2. Wait for your AI model to process them. Once processed, you can review the extracted data in the Validator.
  3. Make any corrections if necessary, and finalise the extracted output as JSON by clicking «Validate»

Below is an example of data extracted from a bill of lading. The location source of each piece of extracted data is conveniently highlighted on the PDF. Notice the orange confidence scores. These indicate areas where the AI requires human review (human-in-the-loop) before it finalises the data export.

Screenshot of the document and the data extracted from it inside Cradl AI

Connect your Cradl AI model to your documents

Manual document upload becomes increasingly time-consuming at scale. Besides, we want to send our extracted JSON data somewhere, such as an Excel sheet or an ERP system. Fortunately, Cradl AI makes it incredibly easy to integrate its data extraction models into most workflows without writing a single line of code.

Cradl AI follows a conventional workflow pattern with third-party triggers and export integrations. Data extraction of a document is triggered by an external event, such as inbound email attachments or one of our third party integrations.

Screenshot of the Trigger and Export options interface in Cradl aI


Once data has been extracted from a document, Cradl AI uses the extraction confidence scores to evaluate whether human review is required or not, and finally exports the data to a variety of integrations, such as webhooks, APIs, or the aforementioned third-party integrations.

That's all it takes to set up end-to-end, automated data extraction from just about any PDF with Cradl AI.

Summary

By creating an AI model that understands your PDFs and hooking it up with your inbound source of documents, Cradl AI is guaranteed to improve the automation degree of just about any business still relying on manual data entry.

Get started for free

We’ll help get you started with your document automation journey.

Schedule a free demo with our team today!