Parse & Extract Invoice data from Emails to Sheets in Zapier

Email is the most common method for receiving third-party invoices, but it comes with it’s share of drawbacks, such as manual data entry from PDFs, the need to download attachments, and high layout variability that complicates automated parsing and data extraction. In this article, we will explore approaches to overcoming these challenges using Zapier and no-code invoice parsing tools. Additionally, we will provide an example of how to leverage the newest AI-powered OCR solutions to extract invoice data from emails and seamlessly send it to Google Sheets.

Best approaches for parsing email invoices in Zapier

Automating invoice extraction from emails in Zapier requires the right parsing tool. The best approach depends on factors like invoice format, volume, and the need for accuracy. Below, we compare three main solutions: AI-powered OCR tools, PDF.co’s invoice OCR, and Zapier’s built-in Email Parser.

AI-Powered OCR tools for invoice parsing automation at scale

These tools streamline data extraction for mid to high-volume invoice inflows with varied layouts. They feature AI-powered OCR, error handling, data formatting, and pre-built integrations for seamless automation. Examples with Zapier connectors include Cradl AI, Docparser and Nanonets.

Benefits

These tools offer pre-trained AI models for instant invoice parsing, with retraining options to improve accuracy over time. Built-in validation and error handling ensure safe, scalable invoice parsing automation with minimal human intervention. They also support various document types beyond invoices.

Drawbacks

For invoices with predictable structures and lower volumes, these tools may be more advanced than necessary. In such cases, simpler, rule-based tools might be a more cost-effective alternative.

When to use

The best option for businesses processing mid-to-high high volumes of invoices from multiple vendors with varying formats. Their built-in validation features, such as human-in-the-loop review, make them a reliable choice for automating business-critical document workflows while ensuring data accuracy.

PDF.co’s invoice OCR for smaller invoice volumes

PDF.co provides a range of PDF-related automation tools, including OCR for extracting invoice data from PDFs. Unlike most of PDF.co’s functionality, which relies on predefined templates, its AI-powered invoice OCR model allows data extraction from invoices with variable layouts.

Benefits

While PDF.co is largely template-based, its AI-powered invoice OCR is an exception, making it more flexible for invoices that do not follow a fixed format. This allows businesses to extract invoice data without needing to manually define templates for each invoice.

Drawbacks

PDF.co’s AI model lacks a mechanism for users to refine accuracy, making it difficult to address inconsistent results. Without built-in validation, it becomes unreliable for processing large volumes of financial documents, as errors may go unchecked and lead to critical inaccuracies in essential data.

When to use

Best suited for businesses handling smaller invoice volumes where manual verification of extracted data is feasible. If accuracy is a primary concern, extracted data should be reviewed before being used in financial workflows.

Email Parser by Zapier for extracting plain text from email bodies

Zapier offers a built-in Email Parser designed to extract structured data from email bodies and use it within automation workflows.

Benefits

Email Parser by Zapier is free and easy to set up. Users can define simple parsing rules to extract key details from structured email content, making it a straightforward solution for specific use cases.

Drawbacks

This tool does not support OCR, meaning it cannot extract text from PDF or image attachments. Additionally, it relies on structured email formats, requiring emails to follow a predictable pattern. If invoice details are presented inconsistently or sent as attachments, this method will not work.

When to use

This is a viable option for extracting structured plain-text invoices directly from email bodies. However, in practice, invoices are rarely sent as plain text, making this tool impractical for most real-world invoice processing needs.

Which invoice parsing approach is right for you?

Approach Best For Limitations
AI-powered OCR automation tools (Cradl AI, Rossum, Docparser) Mid to high-volume invoice processing with variable layouts and built-in validation More advanced than necessary for predictable, low-volume invoices
PDF.co invoice OCR Smaller invoice volumes where manual review is feasible No control over AI accuracy, lacks built-in validation
Email Parser by Zapier Extracting structured plain-text invoices from email bodies Cannot process PDF or image attachments, requires predictable email format

How to extract PDF invoices from email to Google Sheets

Let’s see an actual Zapier automaton example of how to extract invoice data from emails to Google Sheets.

  • Database: A Google Sheet stored in Google Drive.
  • Data extraction tool: Cradl AI to automate data extraction any document, including tables.
  • Orchestration: Zapier to transfer extracted data from Cradl AI into the Excel sheet.  

1. Specify the data you want to extract from your invoices

Before we begin, make sure you’ve created a Cradl AI account.

Once you’re logged into Cradl AI, create your first AI model with just a few clicks by cloning the invoice model.

Customise the model to suit your needs by adding or removing fields based on the data you want to extract from your invoices. Save your model, and that's all you need to do to create an AI model that understands your invoices.

Screenshot of the model configuration user interface inside Cradl AI

Got tables or line items?

Got tabular data or line items in your invoices that you also want to extract? No problem - just use the Line Items field. For more information on that, this article goes into detail about data extraction from tables.

2. Import invoices from your mailbox to your AI model

There are several ways to import your invoices into Cradl AI for data extraction.

Email forwarding

Connecting your mailbox to Cradl AI with a forwarding address is an easy and efficient way to automate your imports. Cradl AI auto-generates an email address you can forward your invoice emails to - any email attachment that arrives at this address will be automatically parsed.

Manual bulk upload

Manually downloading your invoices and uploading them to Cradl AI in bulk is always an option for those who prefer a simple and manual approach. This works particularly well if you only process invoices occasionally.

Mailbox monitoring

Zapier can monitor your mailbox (i.e., with Gmail's New Email trigger) for trigger events—such as emails moved to a specific folder or titles containing certain keywords—and send them to the AI tool for processing. If you choose this approach, you want to add Cradl AI's Parse Document or Parse Document With Human-In-The-Loop actions.

In this example, we'll go with the Email forwarding approach.

  1. Select Email from the list of available triggers in Cradl AI.
  2. Click Save to generate a dedicated email address.

Test the integration by sending an email with a document attached to the newly generated address. The document will be automatically processed, and you can view the extracted data inside the Validator in Cradl AI.

Screenshot of the the email trigger user interface inside Cradl AI

3. Create a Sheet in Google Drive to store data

Now that your AI model is working, we'll connect it with a Google Sheet.

Head over to your Google Drive and create a blank spreadsheet. Add headers that correspond with the fields you want to extract from your documents. In Google Sheets you add headers by simply typing values into the topmost cells:

Screenshot of a Google Sheets sheet with headers configured

4. Sending parsed invoice data to Google Sheets automatically

  1. Head over to Zapier and create a free account.
  2. Use Cradl AI’s pre-built integration with Google Sheets and follow the step-by-step instructions in Zapier.
  3. Map Cradl AI’s extracted fields to your spreadsheet headers.
Screenshot of the Zapier user interface with the Google sheets connector configuration pane

Mapping extracted data from Cradl AI to dynamic Zapier values

When you're mapping Cradl AI's extracted data fields to your spreadsheet's headers, you'll notice that you can choose from way more fields than the handful you defined in your spreadsheet's headers.

99% of the time you are looking for those values that are prefixed with Validated Predictions and suffixed with Value, such as Validate Predictions Purchase Date Value, Validate Predictions Total Amount Value, and so on.

Screenshot of the Zapier user interface with the Google sheets connector step 2 configuration pane

Run your invoice data extraction Zap

Once your Zap is activated, the automation is ready to run:

  1. Forward an email with an invoice attachment to your Cradl AI email address.
  2. Cradl AI will automatically process the document and display the extracted data in the Validator interface.
  3. Review the extracted data for accuracy, correct any flagged errors, and click Validate to confirm.


Clicking Validate triggers the Zap, sending the data directly to your Google Sheet. Within seconds, your spreadsheet will update with the extracted values.

If you would like a version of this tutorial that does into more detail, this video has got you covered.

You might also be interested in

Try for free today

We’ll help get you started with your document automation journey.

Schedule a free demo with our team today!