Commit Graph

  • 067f429dde Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-03-26 15:34:00 -07:00
  • 6895c2d70f Fix Broken Documentation Links (#1275) Daniel Lovegrove 2024-03-22 16:38:52 -05:00
  • 686481982a Fix naming of hOCR rendered files James R. Barlow 2024-03-22 13:27:20 -07:00
  • a9e1d19b78 v16.1.2 release notes v16.1.2 James R. Barlow 2024-03-20 12:56:13 -07:00
  • f95aa63718 Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-03-20 12:26:02 -07:00
  • 855de287b2 Fix test suite failure with Ghostscript >= 10.3 James Barlow 2024-03-19 17:20:33 -07:00
  • feeb9f213f batch example: added archive, small corrections and optimizations (#1277) NilsRo 2024-03-18 21:22:24 +01:00
  • e7eb8fa805 Update Dockerfile.alpine (#1268) Emiel Molenaar 2024-03-13 22:49:42 +01:00
  • 8a747f005a pixels -> megapixels James R. Barlow 2024-02-29 15:31:07 -08:00
  • 16ab4a8b4e Fix error message about missing Python exec James R. Barlow 2024-02-21 23:54:41 -08:00
  • 8d30cff4ef Undo future annotations from watcher.py till Typer fixes its issue James R. Barlow 2024-02-20 19:14:24 -08:00
  • 59d5b0d1bd v16.1.1 release notes v16.1.1 James R. Barlow 2024-02-15 16:56:25 -08:00
  • 9ec0745ab8 Try pypy3.10 James R. Barlow 2024-02-14 14:25:13 -08:00
  • 3a3635f7f9 Python 3.10 cleanup, manual fixes James R. Barlow 2024-02-14 12:48:17 -08:00
  • 6a746a1cbb ruff linting/Python 3.10 cleanup James R. Barlow 2024-02-14 12:41:51 -08:00
  • 906c130f96 Update rust toml settings James R. Barlow 2024-02-14 12:32:26 -08:00
  • 4a78458821 v16.1.0 release notes v16.1.0 James R. Barlow 2024-02-12 01:46:21 -08:00
  • fddf3ce2f4 Clarify warnings filter James R. Barlow 2024-02-12 01:43:47 -08:00
  • 353b34e695 Merge branch 'feature/pageboxes' James R. Barlow 2024-02-12 01:41:56 -08:00
  • 7d63355c3c Use hocr renderer for LTR languages James R. Barlow 2024-02-12 01:41:41 -08:00
  • 42ff7fc842 Fix handling of pages that are restored to correct orientation with /Rotate James R. Barlow 2024-02-12 01:32:26 -08:00
  • 26470fe16a Suppress reportlab deprecation warning James R. Barlow 2024-02-12 01:17:08 -08:00
  • 3b9d4b7f0a Attempt to deal with oddball mediaboxes James R. Barlow 2023-10-31 00:33:10 -07:00
  • 11f53fe9a9 First cut at propagating page boxes James R. Barlow 2023-10-31 00:12:15 -07:00
  • 123c0c766f Mention pipx, install --user --upgrade James R. Barlow 2024-02-08 09:42:00 -08:00
  • 6a9be2142e Advise Homebrew on Linux for Ubuntu 20.04 James R. Barlow 2024-02-07 19:52:50 -08:00
  • 0bc350f55e Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-02-06 01:28:10 -08:00
  • 7a6edf62ba Bump codecov/codecov-action from 3 to 4 (#1247) dependabot[bot] 2024-02-05 03:55:13 -08:00
  • 07b6f06f11 optimize: log images with unclear decode tables James R. Barlow 2024-02-01 15:42:40 -08:00
  • 2005f622bb Update gs dependency & instructions for RHEL (#1228) nisbet-hubbard 2024-01-25 02:07:58 +08:00
  • cca04fd799 Document ability to remove all OCR James R. Barlow 2024-01-23 23:35:43 -08:00
  • 75bf8e4ba2 Add autotools automake libtool and leptonica requirements (#1230) Maxi 2024-01-09 20:58:17 +01:00
  • daabb5b100 Minor english correction in Docs (#1229) Anish Sapkota 2024-01-09 00:03:26 +05:45
  • 035ebea72f Change Windows build to use older Ghostscript v16.0.4 James R. Barlow 2024-01-07 01:56:38 -08:00
  • a499956462 v16.0.4 release notes James R. Barlow 2024-01-07 01:39:06 -08:00
  • 74d2a156c4 Update cache James R. Barlow 2024-01-07 01:35:05 -08:00
  • f87fc7b12d gs: check for text-corrupting versions James R. Barlow 2024-01-07 01:33:49 -08:00
  • 602f5632cb hocr: fix unbound local James R. Barlow 2024-01-07 01:33:36 -08:00
  • 9fbbcf7599 Merge branch 'feature/hocrdebug' James R. Barlow 2024-01-07 01:22:54 -08:00
  • 9498f01f59 hocr: Pass text direction for standalone words James R. Barlow 2024-01-06 16:21:21 -08:00
  • 2c59aca5a1 Fix render_mode ON_TOP when previous content stream does not restore CTM James R. Barlow 2024-01-06 16:20:55 -08:00
  • 51301d69c9 Change default graft render mode back to underneath James R. Barlow 2024-01-06 16:20:30 -08:00
  • 7e608fd1df Use Courier as debug font James R. Barlow 2024-01-06 14:24:25 -08:00
  • ecc79315df Adjust issue template files description James R. Barlow 2024-01-04 00:27:06 -08:00
  • 14365d10b8 Skip testing oom killer on Python 3.12 v16.0.3 James R. Barlow 2024-01-02 16:28:22 -08:00
  • 5e5320020f v16.0.3 release notes James R. Barlow 2024-01-02 02:49:54 -08:00
  • 103c3e0cd6 Drop Ghostscript CVE warning and adjust version down to 9.54 James R. Barlow 2024-01-01 13:02:57 -08:00
  • 7a1c89edd9 fixed a spelling mistake (#1222) Anthony Nabil 2023-12-30 15:53:22 -08:00
  • a5ff3d2f42 Update instructions about custom language packages James R. Barlow 2023-12-27 16:46:10 -08:00
  • b71d16dd96 Draw RTL text with reversed matrix instead of reversing characters James R. Barlow 2023-12-24 01:34:03 -08:00
  • fd593eb5e9 Reversing character order for RTL helps output James R. Barlow 2023-12-24 01:24:44 -08:00
  • a0b98abb94 Improve and externalize debug rendering settings James R. Barlow 2023-12-24 00:10:09 -08:00
  • 18353e1e94 Also fix space rendering James R. Barlow 2023-12-24 00:09:47 -08:00
  • 9adcad84da Overhaul debug rendering to support possible use of Helvetica for testing James R. Barlow 2023-12-23 23:47:35 -08:00
  • f2714586d8 Fix upside down glyphless font James R. Barlow 2023-12-23 23:44:37 -08:00
  • 0b6fb62967 Update issue template v16.0.2 James R. Barlow 2023-12-23 16:21:17 -08:00
  • 1db8b0b943 Temporarily revert default renderer to sandwich while investigating regressions in macOS Preview James R. Barlow 2023-12-23 16:19:44 -08:00
  • f38aebb3d5 Update issue template James R. Barlow 2023-12-23 15:46:43 -08:00
  • 7162c36d37 Name artifacts explicitly v16.0.1.post1 James R. Barlow 2023-12-20 13:53:31 -08:00
  • f4d4ea46c8 Update artifact actions v16.0.1 James R. Barlow 2023-12-20 12:44:43 -08:00
  • 2fd1a0f178 v16.0.1 release notes James R. Barlow 2023-12-20 12:33:41 -08:00
  • 73ed33a086 Tighten dependencies James R. Barlow 2023-12-20 12:33:18 -08:00
  • e6095a9949 Fix text rendering issue with new hOCR text renderer James R. Barlow 2023-12-20 12:26:06 -08:00
  • 16f05af401 Fix release notes - drop rc from version James R. Barlow 2023-12-18 20:08:27 -08:00
  • 1631afc878 Bump actions/setup-python from 4 to 5 (#1205) v16.0.0 dependabot[bot] 2023-12-17 15:35:13 -08:00
  • 63d87fc440 Fix --fast-web-view documentation (#1206) Robin Richtsfeld 2023-12-17 23:54:38 +01:00
  • 9489c01259 Skip test_encrypted on Py3.12 + macOS v16.0.0rc2 James R. Barlow 2023-12-08 00:12:24 -08:00
  • 30d92ad83f Fix build settings to adjust for dropping py39 James R. Barlow 2023-12-07 23:40:45 -08:00
  • a4987733c4 Filter rl_safe_eval deprecation warning James R. Barlow 2023-12-07 23:40:23 -08:00
  • 39eee05230 v16.0.0rc1 release notes v16.0.0rc1 James R. Barlow 2023-12-03 15:44:34 -08:00
  • 5b2f2e6290 Merge branch 'feature/modernhocr' James R. Barlow 2023-12-03 15:17:02 -08:00
  • 445617a1a5 Rebuild cache for hocr default case James R. Barlow 2023-12-03 15:16:18 -08:00
  • f6e90a5934 hOCR renderer is now default James R. Barlow 2023-12-02 19:58:00 -08:00
  • 43618e6b3f Move canvas API to pikepdf and import it James R. Barlow 2023-12-02 19:42:35 -08:00
  • e97f89de3b Refactor font so glyphless isn't as hard coded James R. Barlow 2023-12-02 08:55:01 -08:00
  • 11d3e32f1e Fix hocrtransform CLI James R. Barlow 2023-12-02 08:08:29 -08:00
  • 2affa83efe Remove code that attempted to manage xattrs out of output file James R. Barlow 2023-11-29 23:25:51 -08:00
  • c90d5cd84b Fix Ghostscript installation instructions and add warning v15.4.4 James R. Barlow 2023-11-29 14:10:04 -08:00
  • aacaba3d26 Ignore pypy for now James R. Barlow 2023-11-21 01:05:23 -08:00
  • fec53be841 Remove next major release deprecations James R. Barlow 2023-11-21 00:47:40 -08:00
  • 3f7b540f76 Drop Python 3.9 support James R. Barlow 2023-11-21 00:46:00 -08:00
  • d217856166 Make hocrdebug work, and try to handle CJK spacing better James R. Barlow 2023-11-21 00:33:02 -08:00
  • e2be457e9b Avoid divzero James R. Barlow 2023-11-20 23:08:00 -08:00
  • 4850f486d2 Make text API more like an accessor James R. Barlow 2023-11-20 22:59:50 -08:00
  • 729c7febd9 Fix placement of spaces in debug mode James R. Barlow 2023-11-20 22:44:12 -08:00
  • 6c6aca2f1e Refactor save_state James R. Barlow 2023-11-20 22:29:21 -08:00
  • c69823f496 Refactor; accumulate content stream as bytes rather than discrete pikepdf objects James R. Barlow 2023-11-20 22:11:59 -08:00
  • 73f8f6aac8 Add RTL output - seems to work, but debug does not James R. Barlow 2023-11-20 20:28:07 -08:00
  • d944254e45 hocr: typing cont'd James R. Barlow 2023-11-20 17:07:52 -08:00
  • f7ddffe554 hocr: typing James R. Barlow 2023-11-20 16:52:55 -08:00
  • 8a73ed5d5a Fix JBIG2 not updating progress bar James R. Barlow 2023-11-20 16:21:37 -08:00
  • 03669183d7 Rationalize canvas interface James R. Barlow 2023-11-20 15:31:44 -08:00
  • 74e101a2fa Improve canvas interface with chaining James R. Barlow 2023-11-20 14:42:48 -08:00
  • 532cf18ad3 Restructure hocrtransform submodule to avoid having everything in __init__ James R. Barlow 2023-11-20 00:57:58 -08:00
  • 0b90b697e2 More tidying James R. Barlow 2023-11-20 00:43:43 -08:00
  • 6be7c5f7c8 Fix colors and space box rendering James R. Barlow 2023-11-20 00:30:54 -08:00
  • db2e5132e6 Remove some obsolete parameters James R. Barlow 2023-11-20 00:10:55 -08:00
  • b14f6f778a Tidying new hOCR renderer James R. Barlow 2023-11-19 23:51:27 -08:00
  • 415de77457 imageops: fix annots since not using singledispatch anymore James R. Barlow 2023-11-19 23:51:27 -08:00
  • a9466c4f58 Improve word box positioning James R. Barlow 2023-11-19 23:51:27 -08:00