Howto:Processing d-tpp using Python: Difference between revisions
Jump to navigation
Jump to search
Line 14: | Line 14: | ||
=== Classification === | === Classification === | ||
=== OCR === | |||
== Prerequisites == | == Prerequisites == |
Revision as of 14:37, 28 November 2017
This article is a stub. You can help the wiki by expanding it. |
Motivation
Modules
Scraping
Downloading
Converting to images
Classification
OCR
Prerequisites
pip install --user
- requests
- pdf2image
Code
See also
- https://github.com/euske/pdfminer
- https://dzone.com/articles/pdf-reading
- https://automatetheboringstuff.com/chapter13/
- https://www.binpress.com/tutorial/manipulating-pdfs-with-python/167
- https://github.com/pmaupin/pdfrw