Files
lsf-bench/results.md
2025-09-07 00:12:29 -06:00

4.6 KiB

Rank Model Weighted % Total Count Sum Total
1 gpt-oss:20b 89.1% 49 733
2 hf.co/BasedBase/Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2:Q4_K_M 88.2% 49 726
3 hf.co/bartowski/NousResearch_Hermes-4-14B-GGUF:Q4_K_M 73.6% 49 606
4 hf.co/unsloth/gemma-3n-E4B-it-GGUF:Q8_0 66.0% 49 543
5 hf.co/unsloth/Phi-4-mini-reasoning-GGUF:Q8_0 65.5% 49 539
6 hf.co/unsloth/gemma-3n-E2B-it-GGUF:Q8_0 49.9% 49 411
7 hf.co/unsloth/Qwen3-0.6B-GGUF:BF16 45.8% 49 377
8 hf.co/unsloth/gemma-3-4b-it-GGUF:Q8_0 44.2% 49 364
9 hf.co/bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0 42.2% 49 347
10 hf.co/unsloth/gemma-3-1b-it-GGUF:BF16 35.0% 49 288

Per-category stats

1) gpt-oss:20b

  • Aggregated: 89.1% - Count: 49 - Sum Total: 733
    Category % Count Total Max
    syntax 100.0% 8 95 95
    types 95.5% 11 189 198
    advanced 93.0% 8 107 115
    internals 88.4% 8 190 215
    runtimes 78.9% 6 75 95
    compatibility 66.7% 8 70 105

2) hf.co/BasedBase/Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2:Q4_K_M

  • Aggregated: 88.2% - Count: 49 - Sum Total: 726
    Category % Count Total Max
    syntax 100.0% 8 95 95
    types 95.5% 11 189 198
    advanced 93.0% 8 107 115
    internals 88.4% 8 190 215
    runtimes 78.9% 6 75 95
    compatibility 66.7% 8 70 105

3) hf.co/bartowski/NousResearch_Hermes-4-14B-GGUF:Q4_K_M

  • Aggregated: 73.6% - Count: 49 - Sum Total: 606
    Category % Count Total Max
    syntax 82.1% 8 78 95
    types 80.8% 11 160 198
    advanced 78.3% 8 90 115
    internals 75.8% 8 163 215
    runtimes 57.9% 6 55 95
    compatibility 57.1% 8 60 105

4) hf.co/unsloth/gemma-3n-E4B-it-GGUF:Q8_0

  • Aggregated: 66.0% - Count: 49 - Sum Total: 543
    Category % Count Total Max
    syntax 76.8% 8 73 95
    advanced 73.0% 8 84 115
    internals 73.0% 8 157 215
    types 59.6% 11 118 198
    runtimes 55.8% 6 53 95
    compatibility 55.2% 8 58 105

5) hf.co/unsloth/Phi-4-mini-reasoning-GGUF:Q8_0

  • Aggregated: 65.5% - Count: 49 - Sum Total: 539
    Category % Count Total Max
    syntax 76.8% 8 73 95
    internals 76.7% 8 165 215
    types 70.7% 11 140 198
    runtimes 55.8% 6 53 95
    advanced 50.4% 8 58 115
    compatibility 47.6% 8 50 105

6) hf.co/unsloth/gemma-3n-E2B-it-GGUF:Q8_0

  • Aggregated: 49.9% - Count: 49 - Sum Total: 411
    Category % Count Total Max
    syntax 62.1% 8 59 95
    internals 54.4% 8 117 215
    advanced 48.7% 8 56 115
    types 47.5% 11 94 198
    compatibility 44.8% 8 47 105
    runtimes 40.0% 6 38 95

7) hf.co/unsloth/Qwen3-0.6B-GGUF:BF16

  • Aggregated: 45.8% - Count: 49 - Sum Total: 377
    Category % Count Total Max
    advanced 60.9% 8 70 115
    internals 49.8% 8 107 215
    compatibility 45.7% 8 48 105
    types 42.4% 11 84 198
    runtimes 36.8% 6 35 95
    syntax 34.7% 8 33 95

8) hf.co/unsloth/gemma-3-4b-it-GGUF:Q8_0

  • Aggregated: 44.2% - Count: 49 - Sum Total: 364
    Category % Count Total Max
    syntax 58.9% 8 56 95
    advanced 53.0% 8 61 115
    types 47.5% 11 94 198
    internals 44.2% 8 95 215
    compatibility 31.4% 8 33 105
    runtimes 26.3% 6 25 95

9) hf.co/bartowski/Llama-3.2-3B-Instruct-GGUF:Q8_0

  • Aggregated: 42.2% - Count: 49 - Sum Total: 347
    Category % Count Total Max
    advanced 65.2% 8 75 115
    syntax 46.3% 8 44 95
    internals 40.5% 8 87 215
    runtimes 37.9% 6 36 95
    types 35.4% 11 70 198
    compatibility 33.3% 8 35 105

10) hf.co/unsloth/gemma-3-1b-it-GGUF:BF16

  • Aggregated: 35.0% - Count: 49 - Sum Total: 288
    Category % Count Total Max
    advanced 43.5% 8 50 115
    runtimes 38.9% 6 37 95
    internals 37.2% 8 80 215
    types 33.3% 11 66 198
    syntax 31.6% 8 30 95
    compatibility 23.8% 8 25 105