In the evolving landscape of Agentic AI, the core promise is clear: technology should seamlessly and intelligently serve your business needs. The complexity of data processing – whether structured, semi-structured, or the vast amounts of unstructured information, should no longer be a barrier. At the end of the day, data is data, and its type shouldn’t complicate access or utilization for your end users. Your teams, your agents, require immediate access to critical insights, and that’s precisely where the power of Salesforce Data Cloud’s Document AI delivers exceptional value.
Unstructured data accounts for nearly 80% of all enterprise data, yet most organizations struggle to access this data and more importantly derive actionable insights from it. Imagine your agents effortlessly accessing and acting upon information previously locked within PDFs and images. With our groundbreaking Document AI capabilities within Data Cloud, this advanced functionality is now a “tangible reality”.
The Traditional Approach
Consider a customer who has provided an invoice, which typically includes various details like a customer name, a billable amount, a PO number, and line items. Conventionally, a team member would manually review each document, extract relevant information, and input it into disparate systems – a process that is often time-consuming and susceptible to errors.
The Modern Approach: A simple 4-Step Transformation with Data Cloud & Agentforce!
Let’s illustrate this capability with a common business scenario: accessing data from an Invoice PDF (also checkout a supporting video, in the video resource section).
Step 1: Seamless Ingestion into Salesforce Data Cloud from Diverse Sources
Salesforce recognizes that critical customer data resides across various platforms. Whether your customer’s invoices are in a scanned PDF stored on Amazon S3, or their supporting documents are in Google Cloud Storage, Salesforce Data Cloud ensures effortless ingestion of these documents. This data initially enters as Unstructured Data Lake Objects (UDLOs) and Unstructured Data Model Objects (UDMOs). The platform facilitates the smooth flow of all your vital information, irrespective of its original location or format (images, PDFs, JSON), into Data Cloud, making it immediately available for the next transformative step.
Step 2: Intelligent Unstructured Data Processing with Document AI in Salesforce Data Cloud
This is where the true power of Salesforce Data cloud – Document AI comes to life. Our cutting-edge Document AI solution implements Intelligent Document Processing (IDP) to automate the extraction, classification, and analysis of unstructured content. This capability, significantly accelerated by advanced Large Language Models (LLMs), dramatically reduces manual effort and processing time.
Document AI achieves this transformative power through a sophisticated integration of technologies (NLP, ML & AI) to accurately extract key information from your documents.
- It accurately extracts customer names and billable amounts.
- It identifies and retrieves the PO number.
- It extracts the line items and relevant dates.
For Demo purposes, a sample Invoice PDF file is being processed from AWS S3 bucket into Data Cloud as a UDLO.

We then configure Document AI in Data Cloud, filling in the below details.

The outcome? Your previously unstructured UDLOs and UDMOs are transformed into meticulously structured Data Lake Objects (DLOs) and Data Model Objects (DMOs) directly within Data Cloud, ready for immediate utilization. This not only significantly boosts efficiency and reduces costs by eliminating manual data entry but also enhances accuracy and minimizes errors, ensuring high precision and data integrity across your operations.
Step 3 – Harmonize with rest of data in Data Cloud
Once the extracted data from the documents are stored in DLOs, you can join it with the rest of the harmonized data already in the Data cloud in all kinds of compelling ways. For eg: you can create a calculated insight summarizing the spend by product category. You can potentially add the Invoice DMO to the Data graph and thereby enhance the C360 view of the customer. You can potentially segment users based on the extracted data.
As an example, here we are using the Query editor to show how the extracted data can be queried like any other structured data..
With the power of Document AI, we have managed to successfully augment the C360 data with insights from unstructured content.

Step 4: Agentforce: Empowering Your Teams with Instant, Actionable Insights
Once this data is refined and structured within Salesforce Data Cloud as DLOs and DMOs, Agentforce seamlessly integrates to complete the process with unparalleled efficiency. Imagine your team members managing their daily tasks. Through a pre-configured, auto-launched flow, or by using APEX classes, they can effortlessly access all the extracted, structured information.
Instead of navigating multiple PDFs manually, the team member views the customer’s verified details and billable amounts presented clearly within their Salesforce console. Should a specific detail from the original document require verification, a simple Q&A allows the agent to instantly view the relevant information from the original PDF or image. This direct access to previously inaccessible insights within Agentforce empowers your teams, streamlines their workflows, and significantly enhances efficiency by making complex document details instantly accessible and actionable.
The Result? Your Unstructured Data Becomes an Agent’s Strategic Asset!
Eliminate the complexities of disparate data formats and the challenge of locating information buried within documents. Salesforce Data Cloud, powered by Document AI, transforms your unstructured data (UDLOs/UDMOs) into valuable, actionable structured data (DLOs/DMOs), and Agentforce ensures that critical information is consistently at your agents’ fingertips, enabling truly business-friendly AI. Document AI isn’t just about automation—it’s a strategic enabler that helps businesses unlock the full potential of their unstructured data, driving efficiency, reducing costs, and uncovering vital insights.
Ready to revolutionize your data landscape and empower your teams? Let’s connect and explore how Salesforce Data Cloud and Agentforce can optimize your operations and elevate your agents’ capabilities.
Video Resource: https://www.youtube.com/watch?v=H8cgvUP7Ytg