clawbench/profiles
2026-05-02 10:24:03 -07:00
..
example_research_stack.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_deepseek_v4.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
frontier_gemini_3_pro.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_glm_5_1.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_gpt_5_2.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
frontier_gpt_5_4.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_gpt_5_5.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
frontier_kimi_k25.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_kimi_k26.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
frontier_minimax_m27.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_opus_4_6.yaml ClawBench: 7-model frontier baseline + bake-off tooling 2026-04-10 19:14:11 -07:00
frontier_opus_4_7.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
frontier_qwen_3_6.yaml sweep: per-container state isolation + qwen model-id fix 2026-04-20 19:48:30 -07:00
frontier_sonnet_4_6.yaml feat(eval): stabilize full-suite adapter runs 2026-05-02 10:24:03 -07:00
local_ollama_gpt_oss.yaml docs: fix ollama profile guidance 2026-04-16 19:49:04 -07:00