Chapter numbering fix

This commit is contained in:
2025-12-15 23:28:21 +01:00
parent 41a85cf9fc
commit 5f81eaed04
7 changed files with 198 additions and 84 deletions

View File

@@ -50,14 +50,15 @@ config_optimizada = {
```
MastersThesis/
├── docs/ # Thesis chapters in Markdown (matches template structure)
│ ├── 00_resumen.md # Resumen + Abstract
│ ├── 01_introduccion.md # Chapter 1: Introducción
│ ├── 02_contexto_estado_arte.md # Chapter 2: Contexto y estado del arte
│ ├── 03_objetivos_metodologia.md # Chapter 3: Objetivos y metodología
│ ├── 04_desarrollo_especifico.md # Chapter 4: Desarrollo específico (4.1, 4.2, 4.3)
│ ├── 05_conclusiones_trabajo_futuro.md # Chapter 5: Conclusiones
── 06_referencias_bibliograficas.md # Referencias bibliográficas
├── docs/ # Thesis chapters in Markdown (UNIR template structure)
│ ├── 00_resumen.md # Resumen + Abstract + Keywords
│ ├── 01_introduccion.md # 1. Introducción (1.1, 1.2, 1.3)
│ ├── 02_contexto_estado_arte.md # 2. Contexto y estado del arte (2.1, 2.2, 2.3)
│ ├── 03_objetivos_metodologia.md # 3. Objetivos y metodología (3.1, 3.2, 3.3, 3.4)
│ ├── 04_desarrollo_especifico.md # 4. Desarrollo específico (4.1, 4.2, 4.3)
│ ├── 05_conclusiones_trabajo_futuro.md # 5. Conclusiones (5.1, 5.2)
── 06_referencias_bibliograficas.md # Referencias bibliográficas (APA format)
│ └── 07_anexo_a.md # Anexo A: Código fuente y datos
├── src/
│ ├── paddle_ocr_fine_tune_unir_raytune.ipynb # Main experiment (64 trials)
│ ├── paddle_ocr_tuning.py # CLI evaluation script
@@ -66,8 +67,9 @@ MastersThesis/
│ └── raytune_paddle_subproc_results_20251207_192320.csv # 64 trial results
├── results/ # Benchmark results CSVs
├── instructions/ # UNIR instructions and template
│ ├── instrucciones.pdf # TFE writing guidelines
── plantilla_individual.pdf # Word template (PDF version)
│ ├── instrucciones.pdf # TFE writing guidelines
── plantilla_individual.pdf # Word template (PDF version)
│ └── plantilla_individual.htm # Word template (HTML version, readable)
├── ocr_benchmark_notebook.ipynb # Initial OCR benchmark
└── README.md
```
@@ -78,15 +80,15 @@ The template (`plantilla_individual.pdf`) requires **5 chapters**. The docs/ fil
| Template Section | docs/ File | Notes |
|-----------------|------------|-------|
| Resumen | `00_resumen.md` (Spanish part) | 150-300 words |
| Abstract | `00_resumen.md` (English part) | 150-300 words |
| Resumen | `00_resumen.md` (Spanish part) | 150-300 words + Palabras clave |
| Abstract | `00_resumen.md` (English part) | 150-300 words + Keywords |
| 1. Introducción | `01_introduccion.md` | Subsections 1.1, 1.2, 1.3 |
| 2. Contexto y estado del arte | `02_contexto_estado_arte.md` | Subsections 2.1, 2.2, 2.3 |
| 3. Objetivos y metodología | `03_objetivos_metodologia.md` | Subsections 3.1, 3.2, 3.3 |
| 4. Desarrollo específico | `04_desarrollo_especifico.md` | Includes 4.1, 4.2, 4.3 |
| 2. Contexto y estado del arte | `02_contexto_estado_arte.md` | Subsections 2.1, 2.2, 2.3 + Mermaid diagrams |
| 3. Objetivos y metodología | `03_objetivos_metodologia.md` | Subsections 3.1, 3.2, 3.3, 3.4 + Mermaid diagrams |
| 4. Desarrollo específico | `04_desarrollo_especifico.md` | Subsections 4.1, 4.2, 4.3 + Mermaid charts |
| 5. Conclusiones y trabajo futuro | `05_conclusiones_trabajo_futuro.md` | Subsections 5.1, 5.2 |
| Referencias bibliográficas | `06_referencias_bibliograficas.md` | APA, alphabetical |
| Anexo A | (create from README) | Repository URL |
| Anexo A | `07_anexo_a.md` | Repository URL + structure |
## Important Data Files
@@ -111,10 +113,14 @@ The template (`plantilla_individual.pdf`) requires **5 chapters**. The docs/ fil
## Pending Work
### Completed Tasks
- [x] **Structure docs/ to match UNIR template** - All chapters now follow exact numbering (1.1, 1.2, etc.)
- [x] **Add Mermaid diagrams** - 4 diagrams added (OCR pipeline, Ray Tune architecture, CER comparison charts)
### Priority Tasks
1. **Validate on other document types** - Test optimal config on invoices, forms, contracts
2. **Expand dataset** - Current dataset has only 24 pages
3. **Complete unified thesis document** - Merge docs/ chapters into final UNIR format
3. **Complete unified thesis document** - Merge docs/ chapters into final UNIR Word format
4. **Create presentation slides** - For thesis defense
### Optional Extensions
@@ -445,6 +451,26 @@ Fuente: American Psychological Association, 2020b.
8. **When in doubt, ask**: If the user requests data that doesn't exist, ask rather than inventing numbers
9. **DIAGRAMS MUST BE IN MERMAID FORMAT**: All diagrams, flowcharts, and visualizations in the documentation MUST use Mermaid syntax. This ensures:
- Version control friendly (text-based)
- Consistent styling across all chapters
- Easy to edit and maintain
- Renders properly in GitHub and most Markdown viewers
**Supported Mermaid diagram types:**
- `flowchart` / `graph` - For pipelines, workflows, architectures
- `xychart-beta` - For bar charts, comparisons
- `sequenceDiagram` - For process interactions
- `classDiagram` - For class structures
- `stateDiagram` - For state machines
- `pie` - For proportional data
**Example:**
```mermaid
flowchart LR
A[Input] --> B[Process] --> C[Output]
```
### Common Tasks
- **Adding new experiments**: Update `src/paddle_ocr_fine_tune_unir_raytune.ipynb`