Back to Podcast Digest
AI Engineer20m

Structuring the Unstructured - Cedric Clyburn, Red Hat

TL;DR

  • 50x cost savings: A Hugging Face engineer processed Common Crawl PDFs using Docling at 50 times lower cost than VLMs and naive OCR, running entirely on CPU without needing GPUs.

  • Tables and images preserved: Unlike basic PDF parsers that output garbled linear text, Docling extracts tables as dataframes and annotates images using local vision models like Granite.

  • Chunkless RAG pattern: Docling enables retrieval without vector databases by using document outlines as the index, letting LLMs find relevant sections directly from markdown structure.

  • Local and open source: Part of the Linux Foundation, Docling runs entirely on your machine with no data leaving your environment, critical for air-gapped enterprise deployments.

  • Scales via microservices: Docling Serve deploys as a REST API for processing thousands of documents, while an MCP server lets AI agents like Claude Code handle document conversion autonomously.

The Breakdown

Cedric Clyburn from Red Hat demonstrates Docling, an open-source document processing tool that converts PDFs and other unstructured files into LLM-ready formats at 50x lower cost than vision language models, all running locally without GPUs. He shows live demos of table extraction, image annotation, chunkless RAG patterns, and deployment options via REST API and MCP servers.

Was This Useful?

Share