Abstract
This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.
Community
arXiv explained breakdown of this paper 👉 https://arxivexplained.com/papers/docling-technical-report
Models citing this paper 13
Browse 13 models citing this paperDatasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/2408.09869 in a dataset README.md to link it from this page.