PDF Tools
activeOpen-source Python server providing PDF processing and OCR text extraction (Arabic & English) via gRPC protocol. Includes smart SQLite cache, priority queue, and parallel processing. Designed to serve Flutter and mobile applications.
View on GitHubFeatures
PDF Processing
Compress, merge, and convert PDF files at high speed
Arabic & English OCR
Extract text from images and PDFs with high accuracy
Smart Cache
500MB SQLite cache — same file processed in 0.05s
Priority Queue
50 concurrent operations with 3 parallel workers
gRPC Protocol
Fast and reliable communication with Flutter and mobile apps
Arabic Support
Full Arabic text support with arabic-reshaper and python-bidi
Fork Guide
Clone the repository:
bash
git clone https://github.com/apexflow-sinan/PDF_Tools