unir/MastersThesis

Fork 0

Files

History

sergio d746a3c73f

build_docker / essential (push) Successful in 1s

Details

build_docker / build_paddle_ocr (push) Successful in 5m12s

Details

build_docker / build_paddle_ocr_gpu (push) Successful in 20m54s

Details

build_docker / build_easyocr (push) Successful in 18m19s

Details

build_docker / build_doctr (push) Successful in 19m49s

Details

build_docker / build_easyocr_gpu (push) Successful in 24m6s

Details

build_docker / build_raytune (push) Successful in 4m10s

Details

build_docker / build_doctr_gpu (push) Successful in 16m26s

Details

deliberable_04_01_2026

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2026-02-04 18:43:22 +01:00

archived

deliberable_04_01_2026

2026-02-04 18:43:22 +01:00

dataset_formatting

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

doctr_service

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

easyocr_service

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

paddle_ocr

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

raytune

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

results

deliberable_04_01_2026

2026-02-04 18:43:22 +01:00

docker-compose.tuning.doctr.yml

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

docker-compose.tuning.easyocr.yml

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

docker-compose.tuning.paddle.yml

Paddle ocr, easyicr and doctr gpu support. (#4 )

2026-01-19 17:35:24 +00:00

prepare_dataset.ipynb

Extended analysis

2025-12-08 11:31:26 +01:00

raytune_paddle_subproc_results_20251207_192320.csv

Hyper param serach results

2025-12-07 20:07:59 +01:00

README.md

deliberable_04_01_2026

2026-02-04 18:43:22 +01:00

README.md

OCR Hyperparameter Tuning with Ray Tune

This directory contains the Docker setup for running automated hyperparameter optimization on OCR services using Ray Tune with Optuna.

Prerequisites

Docker with NVIDIA GPU support (nvidia-container-toolkit)
NVIDIA GPU with CUDA support

Quick Start

cd src

# Start PaddleOCR service and run tuning (images pulled from registry)
docker compose -f docker-compose.tuning.paddle.yml up -d paddle-ocr-gpu
docker compose -f docker-compose.tuning.paddle.yml run raytune --service paddle --samples 64

Available Services

Service	Port	Compose File
PaddleOCR	8002	`docker-compose.tuning.paddle.yml`
DocTR	8003	`docker-compose.tuning.doctr.yml`
EasyOCR	8002	`docker-compose.tuning.easyocr.yml`

Note: PaddleOCR and EasyOCR both use port 8002. Run them separately.

Usage Examples

PaddleOCR Tuning

# Start service
docker compose -f docker-compose.tuning.paddle.yml up -d paddle-ocr-gpu

# Wait for health check (check with)
curl http://localhost:8002/health

# Run tuning (64 samples)
docker compose -f docker-compose.tuning.paddle.yml run raytune --service paddle --samples 64

# Stop service
docker compose -f docker-compose.tuning.paddle.yml down

DocTR Tuning

docker compose -f docker-compose.tuning.doctr.yml up -d doctr-gpu
curl http://localhost:8003/health
docker compose -f docker-compose.tuning.doctr.yml run raytune --service doctr --samples 64
docker compose -f docker-compose.tuning.doctr.yml down

EasyOCR Tuning

docker compose -f docker-compose.tuning.easyocr.yml up -d easyocr-gpu
curl http://localhost:8002/health
docker compose -f docker-compose.tuning.easyocr.yml run raytune --service easyocr --samples 64
docker compose -f docker-compose.tuning.easyocr.yml down

Run Multiple Services (PaddleOCR + DocTR)

# Start both services
docker compose -f docker-compose.tuning.yml up -d paddle-ocr-gpu doctr-gpu

# Run tuning for each
docker compose -f docker-compose.tuning.yml run raytune --service paddle --samples 64
docker compose -f docker-compose.tuning.yml run raytune --service doctr --samples 64

# Stop all
docker compose -f docker-compose.tuning.yml down

Command Line Options

docker compose -f <compose-file> run raytune --service <service> --samples <n>

Option	Description	Default
`--service`	OCR service: `paddle`, `doctr`, `easyocr`	Required
`--samples`	Number of hyperparameter trials	64

Output

Results are saved to src/results/ as CSV files:

raytune_paddle_results_<timestamp>.csv
raytune_doctr_results_<timestamp>.csv
raytune_easyocr_results_<timestamp>.csv

Correlation Analysis

Correlation tables used in the thesis are derived from the CSV results with a local script:

source .venv/bin/activate
python tem/scripts/compute_correlations_all.py

Outputs are written to src/results/correlations/:

paddle_correlations.csv
doctr_correlations.csv
easyocr_correlations.csv

These files are computed from the corresponding inputs:

src/results/raytune_paddle_results_20260119_122609.csv
src/results/raytune_doctr_results_20260119_121445.csv
src/results/raytune_easyocr_results_20260119_120204.csv

Directory Structure

src/
├── docker-compose.tuning.yml          # All services (PaddleOCR + DocTR)
├── docker-compose.tuning.paddle.yml   # PaddleOCR only
├── docker-compose.tuning.doctr.yml    # DocTR only
├── docker-compose.tuning.easyocr.yml  # EasyOCR only
├── raytune/
│   ├── Dockerfile
│   ├── requirements.txt
│   ├── raytune_ocr.py
│   └── run_tuning.py
├── dataset/                           # Input images and ground truth
├── results/                           # Output CSV files
└── debugset/                          # Debug output

Docker Images

All images are pre-built and pulled from registry:

seryus.ddns.net/unir/raytune:latest - Ray Tune tuning service
seryus.ddns.net/unir/paddle-ocr-gpu:latest - PaddleOCR GPU
seryus.ddns.net/unir/doctr-gpu:latest - DocTR GPU
seryus.ddns.net/unir/easyocr-gpu:latest - EasyOCR GPU

Build locally (development)

# Build raytune image locally
docker build -t seryus.ddns.net/unir/raytune:latest ./raytune

Troubleshooting

Service not ready

Wait for the health check to pass before running tuning:

# Check service health
curl http://localhost:8002/health
# Expected: {"status": "ok", "model_loaded": true, ...}

GPU not detected

Ensure nvidia-container-toolkit is installed:

nvidia-smi  # Should show your GPU
docker run --rm --gpus all nvidia/cuda:12.4.1-base nvidia-smi

Port already in use

Stop any running OCR services:

docker compose -f docker-compose.tuning.paddle.yml down
docker compose -f docker-compose.tuning.easyocr.yml down