Assistant evals now run in CI#

Assistant evals are now supported in CI. A new workflow runs on master pushes and on PRs labeled run-evals, then reports improvements/regressions across tool usage, SQL validity, goal completion, conciseness, and completeness. Local scripts (pnpm evals:run / pnpm evals:upload) included.













