#

open-benchmark

Here are 3 public repositories matching this topic...

Lawrenzho-bit / LayoutTranslateBench

The first public benchmark for document translation with layout preservation. LTB-100 = chrF + Layout IoU + Reading-order Kendall tau.

benchmark ocr translation localization geo leaderboard document-translation document-ai layout-preservation open-benchmark

Updated May 22, 2026
Python

agentsia-uk / assay-harness

Open Agentsia Labs benchmark harness for model runners, multi-turn evals, reproducible rubric scoring, proof bundles, and RunRecord output.

benchmark adtech reproducibility llm-evaluation eval-harness frontier-ai proof-bundles open-benchmark assay-adtech assay-harness runrecord multi-turn-evals iab-tech-lab

Updated Jun 27, 2026
TypeScript

PearlEng / grade

repository for pearl AI benchmark

nlp education benchmark grounding ai-evaluation llm-evaluation education-analytics open-benchmark

Updated Jun 12, 2026
Python

Improve this page

Add a description, image, and links to the open-benchmark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the open-benchmark topic, visit your repo's landing page and select "manage topics."