Build agentic document processing pipelines that convert PDFs into structured Markdown and JSON by extracting text, tables, charts, and forms without losing context from layout.
We'd like to know you better so we can create more relevant courses. What do you do for work?

Instructors: David Park, Andrea Kropp
Build agentic document processing pipelines that convert PDFs into structured Markdown and JSON by extracting text, tables, charts, and forms without losing context from layout.
Explore LandingAIâs Agentic Document Extraction (ADE) framework to parse complex files reliably with visual grounding and extract fields accurately through user-defined schemas.
Learn to deploy serverless RAG applications on AWS with event-driven document processing powered by LandingAIâs ADE framework.
Join this new short course on Document AI, built with LandingAI and taught by David Park, Senior Director of Applied AI, and Andrea Kropp, Applied AI Engineer at LandingAI.
Much of the worldâs data is locked in PDFs, JPEGs, and other documents. Traditional OCR extracts text but loses critical informationâthe layout of tables with merged cells, the relationship between charts and captions, the reading order of multi-column layouts. This course shows you how to build agentic workflows that process documents the way humans do: breaking them into parts, examining each piece carefully, and extracting information through multiple iterations.
Youâll start by exploring traditional OCR. After understanding its limitations, youâll build agents equipped with additional tools for document processing like layout detection, reading order, and multimodal reasoning models. Next, youâll learn to use the Agentic Document Extraction (ADE) framework from LandingAI to automate this workflow. ADE treats documents as visual objects. It uses custom models to parse complex elements and ground extracted fields to precise locations on the page. Youâll integrate ADE into RAG applications and deploy them as production-ready pipelines on AWS.
In detail, youâll:
Document AI is transforming how organizations unlock value from unstructured data. Whether youâre handling financial invoices, medical records, or academic papers, this course gives you the tools and techniques to build systems for intelligent document processing.
AI builders and developers who want to automate the extraction of information from documents. Basic familiarity with Python is recommended to make the most of this course.
Gradedă»Quiz
Additional learning features, such as quizzes and projects, are included with DeepLearning.AI Pro. Explore it today
Keep learning with updates on curated AI news, courses, and events, as well as Andrewâs thoughts from DeepLearning.AI!