How to Extract data from PDFs in UiPath with AI

In this step-by-step guide, we'll create a production-ready UiPath data extraction workflow for PDFs and images in just four simple steps. We'll use Cradl AI’s accurate AI OCR (optical character recognition) models to extract key information from any PDF, and seamlessly export the extracted data back into UiPath, letting you automate data extraction with minimal effort.

Closed Beta

Cradl AI's UiPath connector is currently in closed beta and not publicly available. Get in touch for access.

Choosing Between UiPath Document Understanding & Cradl AI

UiPath's Document Understanding is a built-in solution for AI-powered data extraction, and enables users to extract data from a variety of common document formats.

Cradl AI is a specialised, standalone SaaS tool designed to seamlessly integrate into any document workflow, offering a best-in-class data extraction experience. It features integrated error handling, a train-as-you-go models that adapt to changing documents, and tools for easy collaboration and monitoring, making it ideal for dynamic and complex document processing needs.

Create a data extraction model in Cradl AI

Before we begin, make sure you’ve created a Cradl AI account.

Once you're inside the app, your first step is to create an AI model that understands your documents. In Cradl AI, creating an AI model simply consists of listing the fields you want to extract, and the AI parsing engine will automatically understand what you need.

Cloning a template model

If you're working with invoices, clone the invoices template. If you're working with purchase orders, clone the purchase order template. You get the idea. Template models offer the quickest way to to great data extraction accuracy due to their prior training on these documents. They're also fully editable.

Creating a model from scratch

Many industries operate in environments where document layouts vary a great deal. For these cases, creating a custom model is the way to go. Cradl AI's engine excels at extracting data from such documents and continuously improves over time, learning from your usage to deliver ever increasing accuracy.

Connect Cradl AI to UiPath

In this post, we’re using UiPath Studio Web. If you’re using the desktop version of UiPath Studio, check out this guide.

  1. Go to the Admin Panel in UiPath Studio Web
    • Click on your tenant in the admin panel.
  2. Set up Integration Services
    • Under Services, select Integration Services.
    • In the connectors pane, search for Cradl AI, and choose Connect to Cradl AI.
    • Authenticate and grant permissions as needed to complete the connection.


Send a document to your AI model for data extraction

Once the connection to your Cradl AI model is set up, you’re ready to send your automation.

  1. Set a Trigger in UiPath Studio Web
    • Go to Studio Web tab and Create new project
    • We're using the Manual Trigger in this guide to keep things simple.
  2. Add the Upload Document Activity
    • Click on Add Activity and search for Cradl AI
    • From the list of Cradl AI activities, select Upload Document.
    • Use the previously configured Cradl AI connection, and select the AI model you created.

Parse the document with human-in-the-loop

In addition to automatically uploading documents to Cradl AI, we also want to automatically extract data from them. The Parse document with human-in-the-loop activity let's us do just that, with the added benefit of automatically flagging uncertain predictions for human review, ensuring data integrity.

  1. Add the Parse document with human-in-the-loop Activity
    • Click on Add Activity and search for Cradl AI
    • From the list of Cradl AI activities, select Parse document with human-in-the-loop.
    • Use the previously configured Cradl AI connection, and select the AI model you created.
    • Test the automation by clicking Test on cloud. Inside Cradl AI, you'll see the parsed document.

Export the parsed data back into UiPath

Once the data has been extracted and validated in Cradl AI, it’s time to send it back to UiPath Studio Web and use it in other automated processes.

To export the extracted data from Cradl AI to UiPath, create a new automation project in UiPath Studio Web. This time, however, we replace the default trigger with Cradl AI's Document Parsing Completed trigger. Use the previously configured Cradl AI connection, and select the AI model you created.

In this example, we print the data in a message box to show how easily Cradl AI output can be used in your workflow. You can write the data to a database, send it to an API, or save it locally. Happy document parsing!

Summary

Integrating Cradl AI with UiPath automates document processing by connecting custom AI models to workflows. Create a Cradl AI account, design a model by listing fields or cloning a pre-trained template, and connect it to UiPath via Integration Services. Cradl AI handles data extraction and validation, including human-in-the-loop reviews for uncertain predictions, before exporting parsed data to UiPath. This integration streamlines workflows and offers flexibility for handling diverse documents.

You might also be interested in

Try for free today

We’ll help get you started with your document automation journey.

Schedule a free demo with our team today!