GitHub is making Copilot CLI look less like a standalone assistant and more like a managed agent layer for software teams. Two changelog updates this week point in the same direction: administrators can now distribute enterprise-managed plugins, while Copilot's Rubber Duck review agent can mix model families for a second opinion.
That combination matters because coding agents are moving from individual experiments into company workflows. Once agents can run commands, touch repositories, call tools and suggest cross-file changes, the hard part is not only model quality. It is governance: which tools are available, which rules are always on, and how teams catch mistakes before code ships.
The CLI is getting company defaults
On May 6, GitHub said enterprise-managed plugins for Copilot CLI are now in public preview. The feature lets administrators define plugin marketplaces and auto-installed plugins through a settings file in `.github-private/.github/copilot/settings.json`. Copilot CLI then applies those settings for licensed Copilot Business and Copilot Enterprise users.
GitHub says plugins can include extensibility points such as custom agents, hooks and MCP configurations. In practical terms, that means a security team could make approved internal tools available by default, while an engineering platform team could standardize setup steps for new developers.
This is a small-looking admin feature with a large operational implication. AI coding tools are no longer only a chat window beside the editor. They are becoming a programmable interface to the developer environment, and companies need a way to make that interface consistent.
Rubber Duck becomes cross-model review
A day later, GitHub expanded Rubber Duck, its Copilot CLI second-opinion agent. When a session is using a GPT model as the orchestrator and experimental mode is enabled, Copilot can now dispatch a Claude-powered critic agent. For Claude-led sessions, GitHub says the second-opinion model has been upgraded to GPT-5.5.
The point is not that one model family is always better than another. It is that different models often fail differently. A second model can spot architectural issues, subtle bugs or cross-file conflicts that the first agent missed.
That makes Rubber Duck closer to an automated code review pattern than a novelty. The orchestrator can propose a path, while a critic model checks assumptions before the developer accepts the change. For teams already trying agentic coding, that workflow is more useful than another benchmark chart.
The enterprise bet
The bigger story is GitHub's attempt to make agentic coding acceptable inside large organizations. Enterprises do not just want faster code generation. They want tool access controls, auditability, repeatable defaults and review loops that fit existing engineering practices.
Managed plugins answer the distribution problem: how to give every developer the same approved agent capabilities without asking each person to configure them manually. Cross-model Rubber Duck answers part of the quality problem: how to reduce blind trust in a single agent run.
Neither feature eliminates the need for human review. A second model can still be wrong, and centrally managed plugins can still expose risky workflows if administrators approve too much. But these updates show where the market is going. The useful coding agent is not just smart; it is constrained, observable and wired into company policy.
What to watch next
The next step will be whether GitHub can make these controls measurable. Enterprises will want to know which plugins were invoked, which agents changed files, which hooks ran and whether a critic model materially reduced defects. Usage metrics and policy logs may become as important as model selection.
For developers, the shift is more immediate. The Copilot CLI experience may start to differ by workplace, with company-approved tools and review agents appearing automatically. That could reduce setup friction, but it also means AI-assisted development will increasingly reflect the rules of the organization, not just the preferences of the individual developer.



