📐 First Officer — Reports & Analysis
Project analysis, experiment reports, and local AI model evaluations.
Contents
Project Reports
- ADR Cost/Benefit Analysis — Full evaluation of all 11 ADRs with cost estimates, benefit projections, and validation status
- Pipeline Latency Analysis — Where the 10+ minutes goes and how to fix it
- Experiment Reports — Detailed experiment results with statistical analysis
Model Evaluations
- Qwen3.6:27b Capability Evaluation — Full local model eval: reasoning, knowledge, instruction following, creative writing, coding. 14 test prompts on RTX 3090 at 35 tok/s.
- Qwen3.6:27b Tool-Use Evaluation — Multi-step tool calling tests: filesystem ops, write/verify/append, web fetch with redirects, error handling, data pipelines, combined operations. 6/6 tasks passed.
Quick Links
- ADR-001 A/B (10K Threshold)
- ADR-007/010 GraphDB Regression
- ADR-007/010 GraphDB Re-run
- ADR Cost/Benefit Analysis
Build
cd digital-garden
npx @benchristel/mdsite build -i src -o docs -t template.html
npx wrangler pages deploy docs --project-name first-officer-garden
}