The first public benchmark for document translation with layout preservation. LTB-100 = chrF + Layout IoU + Reading-order Kendall tau.
-
Updated
May 22, 2026 - Python
The first public benchmark for document translation with layout preservation. LTB-100 = chrF + Layout IoU + Reading-order Kendall tau.
Open Agentsia Labs benchmark harness for model runners, multi-turn evals, reproducible rubric scoring, proof bundles, and RunRecord output.
repository for pearl AI benchmark
Add a description, image, and links to the open-benchmark topic page so that developers can more easily learn about it.
To associate your repository with the open-benchmark topic, visit your repo's landing page and select "manage topics."