PDF Tools

active

Open-source Python server providing PDF processing and OCR text extraction (Arabic & English) via gRPC protocol. Includes smart SQLite cache, priority queue, and parallel processing. Designed to serve Flutter and mobile applications.

View on GitHub

Features

PDF Processing

Compress, merge, and convert PDF files at high speed

Arabic & English OCR

Extract text from images and PDFs with high accuracy

Smart Cache

500MB SQLite cache — same file processed in 0.05s

Priority Queue

50 concurrent operations with 3 parallel workers

gRPC Protocol

Fast and reliable communication with Flutter and mobile apps

Arabic Support

Full Arabic text support with arabic-reshaper and python-bidi

Fork Guide

Clone the repository:

bash
git clone https://github.com/apexflow-sinan/PDF_Tools