cyclic/lsfbench

Go to file

cyclic 75891d2b8a no longer include config

2025-09-04 23:26:11 -06:00

base benchmark

2025-09-04 23:00:01 -06:00

initialize

2025-09-04 03:53:27 -06:00

.gitignore

no longer include config

2025-09-04 23:26:11 -06:00

.luaurc

base benchmark

2025-09-04 23:00:01 -06:00

config.luau

base benchmark

2025-09-04 23:00:01 -06:00

LICENSE

added license

2025-09-04 23:02:52 -06:00

README.md

base benchmark

2025-09-04 23:00:01 -06:00

rokit.toml

initialize

2025-09-04 03:53:27 -06:00

README.md

LSFBench

Minimal Luau/Lune benchmark to evaluate LLMs: one model answers questions, another model scores the answers against the reference key.

Quick Start

Prereqs

Install Lune (0.10.x)
Start Ollama at http://localhost:11434 and pull the models referenced in config.luau (e.g. qwen3:4b)

Notice

The evaluator model must support structured JSON outputs.