Commit Graph

  • d9346cc3d8 Bump requests from 2.32.5 to 2.33.0 dependabot/uv/requests-2.33.0 dependabot[bot] 2026-03-26 17:26:22 +00:00
  • 4e974ebd46 Bump version: v17.4.0 main v17.4.0 James R. Barlow 2026-03-21 01:43:13 -07:00
  • 6f2b8408c1 v17.4.0 release notes James R. Barlow 2026-03-21 01:43:03 -07:00
  • 1dba941261 Add cyclopts for dev James R. Barlow 2026-03-21 01:37:48 -07:00
  • ef76625abb Fix text stretching in fpdf2 renderer for widely-spaced words James R. Barlow 2026-03-16 16:00:00 -07:00
  • ca6dda999f Bump tornado from 6.5.4 to 6.5.5 dependabot/uv/tornado-6.5.5 dependabot[bot] 2026-03-12 00:30:42 +00:00
  • 57bb554a70 Fix verapdf NotADirectoryError crash on some platforms James R. Barlow 2026-03-10 02:08:59 -07:00
  • 5b9d6f979e Add --no-overwrite / -n option to prevent overwriting output files James R. Barlow 2026-03-10 01:58:57 -07:00
  • b588e3bfd7 Fix optimize=2/3 crash when using Python API James R. Barlow 2026-03-10 01:51:07 -07:00
  • a35dd1f9ee Bump docker/setup-buildx-action from 3 to 4 dependabot/github_actions/docker/setup-buildx-action-4 dependabot[bot] 2026-03-09 11:18:20 +00:00
  • bf46f4fe35 Bump docker/login-action from 3 to 4 dependabot/github_actions/docker/login-action-4 dependabot[bot] 2026-03-09 11:18:16 +00:00
  • 55b76338a8 Bump docker/setup-qemu-action from 3 to 4 dependabot/github_actions/docker/setup-qemu-action-4 dependabot[bot] 2026-03-09 11:18:10 +00:00
  • 2af7b1c179 Bump actions/download-artifact from 7 to 8 dependabot/github_actions/actions/download-artifact-8 dependabot[bot] 2026-03-02 11:32:46 +00:00
  • 69f4cca9b6 Bump actions/upload-artifact from 6 to 7 dependabot/github_actions/actions/upload-artifact-7 dependabot[bot] 2026-03-02 11:32:40 +00:00
  • 59190ef643 Bump version: v17.3.0 v17.3.0 James R. Barlow 2026-02-21 00:00:26 -08:00
  • 910ccccc7d Fix bump-version James R. Barlow 2026-02-21 00:00:14 -08:00
  • 0c15ff594c v17.3.0 release notes James R. Barlow 2026-02-20 23:52:48 -08:00
  • e19ea653aa Switch to static versioning and two-workflow release model James R. Barlow 2026-02-20 23:34:03 -08:00
  • a899f0d59a Split release_notes into parts for each major release James R. Barlow 2026-02-20 18:19:31 -08:00
  • b4e8e9dac9 Fix Python API ignoring language parameter (fixes #1640) James R. Barlow 2026-02-20 17:03:16 -08:00
  • aca5eb626b Docker: increase alpine version to 3.23 James R. Barlow 2026-02-20 11:06:33 -08:00
  • bd4a74de0e Restore image rendering for hocrtransform James R. Barlow 2026-02-18 18:00:34 -08:00
  • 10b71937c4 Fix OCR text displacement on PDFs with non-zero MediaBox origins James R. Barlow 2026-02-17 23:34:33 -08:00
  • 5890d1855e Fix Python API producing empty OCR due to tesseract_timeout defaulting to 0 James R. Barlow 2026-02-17 21:55:49 -08:00
  • 3da952a23d Fix garbled Arabic/Devanagari text by using HarfBuzz text shaping v17.2.0 James R. Barlow 2026-02-11 01:30:15 -08:00
  • 716ce6324c Update dependencies James R. Barlow 2026-02-11 00:43:01 -08:00
  • 76fe2f7e28 Merge remote-tracking branch 'origin/dependabot/uv/cryptography-46.0.5' James R. Barlow 2026-02-11 00:42:21 -08:00
  • c85c8941d3 Fix pdftotext word spacing by emitting single BT block per line James R. Barlow 2026-02-11 00:38:38 -08:00
  • 9a0dadbd4c Bump cryptography from 46.0.4 to 46.0.5 dependabot[bot] 2026-02-11 02:57:37 +00:00
  • 4d7e398c4b Suppress rendering of text lines with improbable aspect ratios James R. Barlow 2026-02-10 17:42:33 -08:00
  • 56c0b41f97 Fix extreme font sizes for rotated text in fpdf2 renderer James R. Barlow 2026-02-10 17:02:25 -08:00
  • 5c83dab8a7 Fix fpdf text mode in multi-page renderer; add v17.2.0 release notes James R. Barlow 2026-02-10 14:12:49 -08:00
  • e62e73e441 Fix fpdf text mode James R. Barlow 2026-02-09 02:05:23 -08:00
  • d68e2f6e34 Fix OCR text layer misalignment with non-zero mediabox origins James R. Barlow 2026-02-08 23:55:26 -08:00
  • 1684982cde Further adjustments to install docs James R. Barlow 2026-02-06 17:17:44 -08:00
  • 4d97dfd218 Update installation docs for modern tooling James R. Barlow 2026-02-05 15:04:12 -08:00
  • a35fcc9c43 Handle Ghostscript rasterization with DPI below 10 James R. Barlow 2026-01-31 13:01:04 -08:00
  • 3dd4cde7ce Tighten plugin manager return types to non-optional James R. Barlow 2026-01-31 12:12:07 -08:00
  • 92beb474a5 Normalize unpaper_args to list at construction time James R. Barlow 2026-01-31 12:05:37 -08:00
  • 9dcd882c83 Use uv to install docs with dependency groups James R. Barlow 2026-01-31 00:05:27 -08:00
  • 9d8aa5a0c3 v17.1.0 release notes v17.1.0 James R. Barlow 2026-01-30 16:15:50 -08:00
  • e036a902ae Add --tagged-pdf-mode option to control Tagged PDF handling James R. Barlow 2026-01-30 14:01:23 -08:00
  • 0a980fb11b Add Encoding.flate_jpeg to recognize deflated JPEG images James R. Barlow 2026-01-30 12:53:59 -08:00
  • 3abe8f71c7 v17.0.1 release notes v17.0.1 James R. Barlow 2026-01-30 00:15:13 -08:00
  • 64f45b7fdb Fix pypdfium type checking James R. Barlow 2026-01-30 00:14:02 -08:00
  • 7e939ad44d Fix pypdfium rasterizer to respect raster_device colorspace James R. Barlow 2026-01-30 00:04:02 -08:00
  • 297fb786a0 Update uv.lock (for protobuf) v17.0.0 James R. Barlow 2026-01-29 18:33:00 -08:00
  • ad30dd94f7 Merge branch 'release/v17' James R. Barlow 2026-01-29 18:31:11 -08:00
  • e77f79ac6f Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2026-01-29 18:30:54 -08:00
  • c84fc56e45 Update CLI completions to match current options James R. Barlow 2026-01-29 12:41:56 -08:00
  • 0a0756b33e Tidy long lines and unnested with blocks release/v17 James R. Barlow 2026-01-27 15:28:27 -08:00
  • c5d3ef4b17 Tighten ruff rules and modernize style James R. Barlow 2026-01-27 14:04:52 -08:00
  • 6b37583674 Refactor: move ocr_element to a better location James R. Barlow 2026-01-27 13:48:38 -08:00
  • de5f2b80f0 Further patching-out of fonts v17.0.0b1 James R. Barlow 2026-01-21 11:43:54 -08:00
  • d951b4f0f7 Improve font fallback checking James R. Barlow 2026-01-21 10:38:07 -08:00
  • b386d39b3b tests: fix test_page_boxes when verapdf unavailable James R. Barlow 2026-01-21 00:22:26 -08:00
  • ec595a395b tests: little fixes James R. Barlow 2026-01-20 23:23:43 -08:00
  • bd29269c00 Various test fixes, mainly Windows issues James R. Barlow 2026-01-20 22:28:06 -08:00
  • 6fb7c5d95f Additional build fixes James R. Barlow 2026-01-20 21:49:40 -08:00
  • d57552c4f8 test: For Windows, ensure outputs are UTF-8 James R. Barlow 2026-01-20 21:33:31 -08:00
  • f017c982cf watcher: use modern API James R. Barlow 2026-01-20 21:25:12 -08:00
  • 7ac51ac1a7 Fix type alias for Queue causing runtime TypeError James R. Barlow 2026-01-20 20:38:33 -08:00
  • db9f94de14 Ensure Noto font is installed where needed James R. Barlow 2026-01-20 19:50:47 -08:00
  • 37e7131a01 Drop support for Python 3.10, require Python 3.11+ James R. Barlow 2026-01-20 11:54:55 -08:00
  • bc745d4d81 Replace magic Ghostscript raster device strings with StrEnum James R. Barlow 2025-11-11 14:03:37 -08:00
  • c818ad5e75 Drop deprecated NeverRaise exception James R. Barlow 2025-11-11 14:02:45 -08:00
  • 4b16228a4a docs: minor adjustments James R. Barlow 2026-01-20 10:41:55 -08:00
  • d40fca2590 Add verapdf to build for macOS James R. Barlow 2026-01-20 10:41:43 -08:00
  • 99f8106936 Update API documentation for OcrOptions-first calling convention James R. Barlow 2026-01-20 10:30:33 -08:00
  • ef88ba3f95 Add OcrOptions as first-class argument to ocr() function James R. Barlow 2026-01-20 10:20:52 -08:00
  • 2f4280b66c Comprrehensive documentation update in preparation for v17 James R. Barlow 2026-01-16 01:38:47 -08:00
  • 6cf9d1c6ee Update release notes James R. Barlow 2026-01-15 23:29:29 -08:00
  • 6a7164a76c Update release notes with branch changes James R. Barlow 2026-01-15 23:25:51 -08:00
  • 3f328785f0 Fix pypdfium rasterizer to match Ghostscript dimensions James R. Barlow 2026-01-14 14:37:24 -08:00
  • 5acf21651f ruff lint and format James R. Barlow 2026-01-13 01:50:57 -08:00
  • 7bfe3ecd5b Fix double-compression of already-deflated JPEGs James R. Barlow 2026-01-13 01:41:59 -08:00
  • 5371cc5e39 Update test to match new error messag James R. Barlow 2026-01-13 01:33:10 -08:00
  • 4c7086c609 Replace typer with cyclopts CLI library in misc scripts James R. Barlow 2026-01-13 00:43:14 -08:00
  • bf76c8270c Rationalize optional dependencies vs dependency groups James R. Barlow 2026-01-13 00:34:55 -08:00
  • 740f67091c Rename OCROptions to OcrOptions for consistency James R. Barlow 2026-01-12 23:37:54 -08:00
  • 36dea181e6 Update cookbook: Replace --tesseract-timeout 0 with --ocr-engine none James R. Barlow 2026-01-12 23:28:14 -08:00
  • c69f293322 Add --mode/-m CLI argument with ProcessingMode enum James R. Barlow 2026-01-12 15:23:08 -08:00
  • e9fe061c30 Format fix James R. Barlow 2026-01-12 10:25:24 -08:00
  • c9ea07e954 Reduce chattiness of fonttools James R. Barlow 2026-01-12 10:16:58 -08:00
  • 0c3745a1a4 Add OCR engine selection framework and null OCR engine James R. Barlow 2026-01-10 02:23:52 -08:00
  • 664c3e2a8e Update test cache for slow rotation tests James R. Barlow 2026-01-10 16:30:25 -08:00
  • 315d0df0e9 Fix incorrect rotation direction in pypdfium rasterizer James R. Barlow 2026-01-10 16:29:49 -08:00
  • 3c94ada857 Fix tesseract_cache plugin to properly handle cache misses James R. Barlow 2026-01-09 02:10:29 -08:00
  • fcbdbac602 Update test_page_boxes MediaBox expectations for speculative PDF/A James R. Barlow 2026-01-09 01:25:31 -08:00
  • 122450c19e Fix Ghostscript tests after default output type changed to 'auto' James R. Barlow 2026-01-09 01:02:25 -08:00
  • 0c4ee5af4e Add 'auto' output type for best-effort PDF/A without Ghostscript James R. Barlow 2026-01-09 00:56:00 -08:00
  • bdc50e9470 Add explicit word spacing for pdfminer.six compatibility James R. Barlow 2026-01-08 16:32:14 -08:00
  • 4cb488d0fc Skip speculative PDF/A when --pdfa-image-compression is set James R. Barlow 2026-01-08 15:12:35 -08:00
  • bb5238e524 Update tests to use new OcrmypdfPluginManager interface James R. Barlow 2026-01-08 13:09:19 -08:00
  • 900a60fd10 Add verapdf integration for speculative PDF/A conversion James R. Barlow 2026-01-08 10:58:01 -08:00
  • f5617ce44e Refactor OcrmypdfPluginManager to use composition over inheritance James R. Barlow 2026-01-07 17:23:13 -08:00
  • 0e946a7498 Clarify messageabout number of workers James R. Barlow 2026-01-07 16:41:18 -08:00
  • b2b6a7c4b1 Pass OMP_THREAD_LIMIT to Tesseract subprocesses instead of modifying parent env James R. Barlow 2026-01-06 18:43:29 -08:00
  • 75c664793e Don't share claude James R. Barlow 2026-01-06 13:46:40 -08:00
  • bbd263ff48 Add tests for fpdf2 renderer and font infrastructure James R. Barlow 2026-01-06 13:46:11 -08:00