PaddlePaddle/PaddleOCR

PaddlePaddle / PaddleOCR UNCLAIMED

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

0 0 1 Python

32 branches 29 tags

https://code.morphllm.com/PaddlePaddle/PaddleOCR.git

ai4science chineseocr document-parsing document-translation kie ocr paddleocr-vl pdf2markdown pdf-extractor-rag pdf-parser pp-ocr pp-structure rag

	.github
	applications
	benchmark
	configs
	deploy
	doc
	docs
	langchain-paddleocr
	mcp_server
	overrides
	paddleocr
	ppocr
	ppstructure
	readme
	skills
	test_tipc
	tests
	tools
	.clang_format.hook		526 B
	.gitignore		462 B
	.lycheeignore		115 B
	.pre-commit-config.yaml		1.3 KB
	.style.yapf		48 B
	awesome_projects.md		6.8 KB
	CNAME		17 B
	LICENSE		11.1 KB
	MANIFEST.in		399 B
	mkdocs-ci.yml		148 B
	mkdocs.yml		17.3 KB
	pyproject.toml		2.3 KB
	README.md		55.5 KB
	requirements.txt		196 B
	setup.py		650 B
	train.sh		185 B