HyperAIHyperAI

Command Palette

Search for a command to run...

Docling: Document Parsing Tool

GitHub
Stars

1. Tutorial Introduction

Docling

Docling is an open-source, versatile document conversion tool launched by IBM in 2024. It aims to simplify and automate the process of converting documents. It supports converting a variety of common file formats (such as PDF, Word, PPTX, Markdown, etc.) into a variety of different output formats, such as text, Markdown, Doctags, JSON, and YAML.

Docling adopts a modular design for document conversion and processing, and different conversion modes can be replaced as needed to meet different requirements.

Key features:

  • Supports conversion of multiple document formats to Text , Markdown , Doctags , JSON , YAML  Format.
  • Supports multiple input formats, including PDF, DOCX, PPTX, MD, ASCIIDOC, etc.
  • It provides a clear and concise interface for easy integration with other applications.
  • Supports building a visual interface through Gradio, allowing users to perform interactive file upload and conversion operations.

Supported file formats:

  • PDF: Can be converted to Text, Markdown, Doctags, JSON and YAML formats.
  • DOCX: Can be converted to Text, Markdown, Doctags, JSON and YAML formats.
  • PPTX: Can be converted to Text, Markdown, Doctags, JSON and YAML formats.
  • Markdown: Can be converted to Text, Markdown, Doctags, JSON and YAML formats.
  • ASCIIDOC: Can be converted to JSON and YAML formats.

2. Operation steps

1. Start the container

通过 API 地址进入 Web 界面
Web Interface

2. File conversion

进入 web 界面后,按照以下步骤进行操作:
Lighting Control Steps

3. Exchange and Discussion

🖌️ If you find a high-quality project, please leave a message in the background to recommend it! In addition, we have also established a tutorial exchange group. Welcome everyone to scan the QR code to join the group, note [SD Tutorial], discuss technical issues with everyone, and share application results!

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Docling: Document Parsing Tool | Tutorials | HyperAI