rw-deepseek-ocr/requirements.txt at e82cd2abf010c94dd788cb669fc9cd685aeef6c3 - rw-deepseek-ocr - Brockevitch Gitea

aaron/rw-deepseek-ocr

Files

Claude e578276d3e Add PDF processing and multi-format document conversion

Features added:
- PDF to image conversion with configurable DPI
- Multi-page PDF processing with OCR
- Export to Markdown, HTML, DOCX, and JSON formats
- Automatic image extraction from PDFs
- Formula and formatting preservation
- Real-time progress tracking for multi-page documents

Backend changes:
- New /api/process-pdf endpoint for PDF processing
- pdf_utils.py: PDF conversion and image extraction utilities
- format_converter.py: Document format conversion (MD, HTML, DOCX)
- Updated dependencies: PyMuPDF, img2pdf, python-docx, markdown

Frontend changes:
- File type toggle (Image OCR / PDF Processing)
- PDFProcessor component with format selection
- Updated ImageUpload to support both images and PDFs
- Progress bars for multi-page processing
- Download options for converted documents

Documentation:
- Updated README with PDF processing features
- Added API documentation for /api/process-pdf endpoint
- Added format conversion examples

2025-11-15 14:25:09 +00:00

18 lines

261 B

Plaintext

Raw Blame History

 fastapi>=0.104.0
 uvicorn[standard]>=0.24.0
 python-multipart>=0.0.6
 transformers==4.46.3
 tokenizers==0.20.3
 accelerate>=0.34.2
 einops
 addict
 easydict
 pillow
 safetensors
 torch
 python-decouple>=3.8
 PyMuPDF>=1.23.0
 img2pdf>=0.5.0
 python-docx>=1.1.0
 markdown>=3.5.0