Repo for the search and displace ingest module that takes odf, docx and pdf and transforms it into .md to be used with search and displace operations
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Orzu Ionut d056aa29db Fix issue 3 years ago
..
LICENSE.txt Fix issue 3 years ago
README.md Fix issue 3 years ago
derive_cubic.py Fix issue 3 years ago
page_dewarp.py Fix issue 3 years ago
requirements.txt Fix issue 3 years ago

README.md

page_dewarp

Page dewarping and thresholding using a "cubic sheet" model - see full writeup at https://mzucker.github.io/2016/08/15/page-dewarping.html

Requirements:

  • scipy
  • OpenCV 3.0 or greater
  • Image module from PIL or Pillow

Usage:

page_dewarp.py IMAGE1 [IMAGE2 ...]