Hero ImageHero Image

Document data extraction and manipulation in Studio Web

In this course, you'll master document data extraction and to use activities like Extract PDF Text and Write Range to efficiently automate workflows and organize data for your automation projects.
Difficulty level
Difficulty level
Intermediate
Content language
Content language
English
Product covered
Product covered
Studio Web
Completion time
Completion time
1 hour 30 minutes
About the Document data extraction and manipulation in Studio Web training

This course is designed to equip learners with the skills and knowledge required to automate workflows involving document data extraction and manipulation Through this course, you’ll understand the different types of PDFs—native PDFs and image-based PDFs—and how to handle each type effectively using Studio Web. You’ll learn how to configure and use key activities like Extract PDF Text and Extract Document Data to extract data such as text, tables, and fields from documents. Additionally, you’ll be introduced to activities such as Write Cell, Write Range, and Read Range to process and structure the extracted data, preparing it for use in reports, Excel files, or other applications.

The course also focuses on a practical use case. You’ll implement an end-to-end automation workflow that extracts data from scanned PDFs and populates Excel workbooks in Studio Web. Along the way, you’ll explore real-world scenarios, such as processing invoices, contracts, and forms, to apply the concepts learned and solve common business problems.

Learning prerequisites

To learn the fundamentals of Studio Web and how to build automation workflows, we would recommend you start with the following and then pursue this course:

  1. Build your first automation in Studio Web.

  2. Repetitive and rule-based tasks in Studio Web.

  3. Data validation and processing in Studio Web.

  4. Email and communication management in Studio Web.

Audience

The Document data extraction and manipulation course is perfect for a wide range of users, from beginners with little to no coding experience to experienced professionals, business users, and citizen developers.

Agenda

The full agenda covers:

  • Differences between native and scanned PDFs and their respective data extraction methods.

  • Learn to configure and use Extract PDF Text and Extract Document Data activities for efficient data retrieval.

  • Process and organize extracted data using Write Cell, Write Range, and Read Range activities.

  • Implement an end-to-end automation use case to solve real-world business problems.

Learning objectives

At the end of the Document data extraction and manipulation in Studio Web course, you should be able to:

  • Differentiate between native PDFs and image-based PDFs and identify appropriate data extraction methods for each.

  • Use and configure Extract PDF Text and Extract Document Data activities to extract relevant data.

  • Process and structure the extracted data for further use in workflows.

  • Implement a complete use case that extracts data from PDFs and populates workbooks in Studio Web.

  • Apply the skills learned to solve common business problems, such as processing invoices, contracts, or forms.

Get your diploma!Start the Document data extraction and manipulation in Studio Web course.
Complete the course to unlock