HomeProductsServicesPortfolioContact Us

Data Extraction & OCR

Historical Document Digitization with LayoutParser (Python)

May 8, 2023 – May 16, 2023

Image for Historical Document Digitization with LayoutParser (Python)

Key Technologies

PythonLayoutParserPDF Data Extraction

Key Achievements

  • Successfully digitized complex historical documents for research and analysis.

Role & Contributions

Extracted historical multi-column data from PDFs using LayoutParser in Python.