{

📐 First Officer — Reports & Analysis

Project analysis, experiment reports, and local AI model evaluations.

ADR Cost/Benefit Analysis — Full evaluation of all 11 ADRs with cost estimates, benefit projections, and validation status
Pipeline Latency Analysis — Where the 10+ minutes goes and how to fix it
Experiment Reports — Detailed experiment results with statistical analysis
Podpedia Local Deployment — Docker Compose setup, model selection journey (27B → 7B), API endpoints, sample pipeline results
ADR-007/010 Pipeline — Run 3 — E2E ingest pipeline latency against local deployment: ~47s p50, ~22s query. 5 trials with qwen3.5:7b extraction + qwen3.6:27b query synthesis. Status: confirmed.
main vs development Performance Comparison — Head-to-head comparison: development's 10K chunk threshold delivers 10x faster medium-text processing over main's 20K chunks. Both branches similar for small texts (~41-45s).
ADR-002 Deep Ontology Experiment — A/B comparison of entity extraction with vs without ADR-002's Deep Ontology on qwen3.5:7b. Ontology produces much richer entity types (PERSON_FOUNDER vs Person) and can be faster (33s vs 45s), but is brittle for unusual texts.

Qwen3.6:27b Capability Evaluation — Full local model eval: reasoning, knowledge, instruction following, creative writing, coding. 14 test prompts on RTX 3090 at 35 tok/s.
Qwen3.6:27b Tool-Use Evaluation — Multi-step tool calling tests: filesystem ops, write/verify/append, web fetch with redirects, error handling, data pipelines, combined operations. 6/6 tasks passed.

Quick Links

cd digital-garden
npx @benchristel/mdsite build -i src -o docs -t template.html
npx wrangler pages deploy docs --project-name first-officer-garden

}