Some checks failed
build_docker / essential (push) Successful in 1s
build_docker / build_paddle_ocr (push) Failing after 5m31s
build_docker / build_easyocr (push) Failing after 7m40s
build_docker / build_doctr (push) Has been cancelled
build_docker / build_doctr_gpu (push) Has been cancelled
build_docker / build_raytune (push) Has been cancelled
build_docker / build_paddle_ocr_gpu (push) Has been cancelled
build_docker / build_easyocr_gpu (push) Has been cancelled
27 lines
1.2 KiB
HTML
27 lines
1.2 KiB
HTML
<section>
|
|
<h2>Motivación</h2>
|
|
<div class="two-columns">
|
|
<div>
|
|
<ul>
|
|
<li>La digitalización documental es una <strong>necesidad estratégica</strong> para organizaciones</li>
|
|
<li>OCR como puente entre el mundo físico y digital</li>
|
|
<li>Documentos en español: caracteres especiales ausentes en conjuntos de entrenamiento internacionales</li>
|
|
<li>Modelos preentrenados: <strong>rendimiento subóptimo</strong> fuera de benchmarks estándar</li>
|
|
<li>Fine-tuning requiere infraestructura costosa y datos etiquetados</li>
|
|
</ul>
|
|
</div>
|
|
<div>
|
|
<h3 style="font-size:0.8em; text-align:center; margin-bottom: 10px;">Errores típicos en español</h3>
|
|
<table class="data-table" style="font-size:0.85em;">
|
|
<thead><tr><th>Original</th><th>OCR</th><th>Error</th></tr></thead>
|
|
<tbody>
|
|
<tr><td>más</td><td>mas</td><td>Pérdida de acento</td></tr>
|
|
<tr><td>año</td><td>ano</td><td>Pérdida de eñe</td></tr>
|
|
<tr><td>¿Cómo</td><td>Como</td><td>Signos especiales</td></tr>
|
|
<tr><td>titulación</td><td>titulacióon</td><td>Duplicación</td></tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
</section>
|