| .. |
|
example_research_stack.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_deepseek_v4.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
frontier_gemini_3_pro.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_glm_5_1.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_gpt_5_2.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
frontier_gpt_5_4.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_gpt_5_5.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
frontier_kimi_k25.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_kimi_k26.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
frontier_minimax_m27.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_opus_4_6.yaml
|
ClawBench: 7-model frontier baseline + bake-off tooling
|
2026-04-10 19:14:11 -07:00 |
|
frontier_opus_4_7.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
frontier_qwen_3_6.yaml
|
sweep: per-container state isolation + qwen model-id fix
|
2026-04-20 19:48:30 -07:00 |
|
frontier_sonnet_4_6.yaml
|
feat(eval): stabilize full-suite adapter runs
|
2026-05-02 10:24:03 -07:00 |
|
local_ollama_gpt_oss.yaml
|
docs: fix ollama profile guidance
|
2026-04-16 19:49:04 -07:00 |