Replace code reviews with verified intentAviator Verify
Aviator

Changelog

What’s new in Aviator

Features, improvements, fixes, and security updates — shipped continuously.

  1. Runbooks: sandboxes, tool controls, and MCP

    Three big additions on the Runbooks side — bring-your-own SSH sandboxes, allow/deny lists for Claude Code tools, and MCP server support — plus a handful of MergeQueue and Releases improvements.

    69d4548ec6584a795ea3dfe8_Screenshot-2026-04-06-at-5.43.47-PM.png

    Three big additions on the Runbooks side — bring-your-own SSH sandboxes, allow/deny lists for Claude Code tools, and MCP server support — plus a handful of MergeQueue and Releases improvements.

    Bring your own sandbox with SSH

    Runbooks can now execute steps inside any environment you can reach over SSH, instead of the Aviator-managed cloud sandbox. Point Runbooks at a host, drop in an SSH private key, and your agents will run on infrastructure you own.

    • **Pick your sandbox type** from the Runbooks settings page — Cloud (managed by Aviator) or SSH (your own infrastructure).
    • **Resilient connections**: automatic retry on SSH failures, reconnection support for long-running sessions, and process-state persistence across reconnects so an interrupted step picks back up cleanly.
    • **Self-hosted GitHub support** is included, so SSH sandboxes work the same whether you're on github.com or GitHub Enterprise Server.

    Learn more about SSH sandboxes .

    Allow lists and deny lists for Claude Code tools

    You can now constrain exactly which Claude Code tools — and which command patterns — are available to Runbooks during planning and execution.

    • **Allow list and deny list** uses Claude Code's native pattern syntax: Read , Edit , Bash(npm run build:*) , Edit(/path/to/dir/*) , and so on.
    • **Denylists take precedence over allowlists**, so a deny rule is always the final word. This makes it safe to start permissive and tighten down as you discover commands you'd rather the agent not touch.
    • Separate controls for the **planning** phase (where the agent generates runbook steps) and the **execution** phase (where steps actually run).

    MCP server support

    Runbooks now speaks MCP (Model Context Protocol), so you can plug your own tools into the agent without waiting for us to build a first-party integration. Configure MCP servers as JSON from the Runbooks settings page:

    • Supports stdio , sse , and http transports.
    • Combine MCP with the tool controls above to give Runbooks access to your internal systems on your terms.

    Also in November

    MergeQueue

    • Refreshed the Timeline page with better density and faster loads.
    • Better handling of GitHub secondary rate limits under sustained load.
    • Alt access tokens can now be used for PR creation and PR comments, so teams can keep bot identities separated from human ones.

    Releases

    • Scheduled crons now evaluate in UTC, removing timezone ambiguity.
    • Non-default-branch deployments now display the branch name, making it easier to tell release candidates and environment-specific builds apart at a glance.
  2. Runbooks - Turn AI Coding Multiplayer with spec-driven development

    Introducing Runbooks - a multiplayer AI coding platform that enables collaborative spec-driven development. Some key features:

    6903981abe664ab7dc78d70e_Screenshot-2025-10-29-at-1.18.05-PM.png

    Introducing Runbooks - a multiplayer AI coding platform that enables collaborative spec-driven development. Some key features:

    Spec-driven development

    Runbooks is purpose-built for complex brownfield projects that involve understanding large codebases. Aviator agents internally run Claude code to analyze the code and prepare multistep plans (called Runbooks).

    6903984f19c874c6bdf9a857_Screenshot-2025-10-29-at-1.13.18-PM.png

    Share and collaborate

    Invite engineers, PMs, and all stakeholders to share and collaborate on these Runbooks. Plan together, get feedback, and reuse existing templates.

    69039883aba882a704e19004_Screenshot-2025-10-29-at-1.13.47-PM.png

    Background agents

    Aviator manages ephemeral sandboxes that check out the repository and run Claude code in the background. These sandboxes can run custom environments and can scale to hundreds of agents instantaneously for parallel execution.

    690398c2bc903e9c70395c08_Screenshot-2025-10-29-at-1.15.30-PM.png

    Institutional knowledge

    Agents manage all learnings into editable context files that are used for future execution. Using these auto-generated institutional knowledge graphs, the agents build up business context and get smarter over time.

    690398eb1fc610ba7d746b0c_Screenshot-2025-10-29-at-1.16.16-PM.png

    Templates

    The Runbooks can be published and reused. Write once, execute N times to standardize common operations or large replicable migrations. It also comes pre-built with a Runbook Library with tried and tested templates.

    69039910d25bde70f6d7539a_Screenshot-2025-10-29-at-1.17.29-PM.png

    Additional Resources

    • Read our docs
    • Why Runbooks

    Interested? Join our beta .

  3. Announcing Aviator Agents (Beta)

    Introducing Aviator Agents - an LLM-based agentic framework to perform large scale code migrations in your repositories.

    67ff4009bfa73e4ad8dc3561_agents-2.png

    Introducing Aviator Agents - an LLM-based agentic framework to perform large scale code migrations in your repositories.

    67ff3fe33f94eb69b7fa22c5_agents-demo.jpg

    How it works

    These agents emulate human patterns to perform migrations by:

    • finding references and documentation
    • understanding dependencies
    • breaking down large chunks of code, and
    • verifying build status.

    Aviator Agents plan out steps to perform a task and iterate to improve accuracy.

    Unlike most LLM based frameworks, Aviator Agents is an end-to-end framework that connects directly with GitHub. This eliminates the need to set up a custom environment and build process.

    Models

    It directly integrates with GitHub and supports Open-AI o1, Claude Sonnet 3.5, Deepseek R1, and Llama 3.1 models.

    Early access

    Aviator Agents is currently in private beta. If you are interested in trying it out please reply back or sign up on: aviator.co/agents

  4. FlexReview in GA - Distributed owners, validation and more

    FlexReview is out of Beta now, and can be directly accessed self-serve. Here’s what’s new:

    FlexReview is out of Beta now, and can be directly accessed self-serve. Here’s what’s new:

    Distributed owners file

    Use aviator-config.yaml to define multi-level ownership definition in a YAML file . Distributed ownership simplifies updates and make it collaborative. Example:

    67ff41566fe06c169bd96b0f_owner-1.png67ff4162c1caddec7426f1f5_owners-2.png

    Validation

    A GitHub status check to ensure that appropriate approvals have been granted before a pull request is considered mergeable:

    • Reviewers are responsible for approving on behalf of the files they own.
    • Reviewers can be selectively dismissed when new code changes are pushed tied to their ownership.
    • Includes a break-glass override for emergency approvals.

    Chrome Extension

    Extension that shows which team is the owner for specific files. Highlights the files that you are responsible to review based on your ownership.

    67f6809aa1d14d0c97740898_owner-dark-1-.jpeg67f680c36106a13dd508473b_owner-light-1-.jpeg

    OOO tracking

    Set up OOO for team members who should be assigned for code reviews.

    67f680f35592d879a8e45d0a_Screenshot-2025-03-18-at-5.57.05-PM-1-.png

    Slack Notifications

    Set up each team’s default Slack channels and customize notifications . Get notified for pending team reviews, subscribe to listen to changes in specific code paths.

    67f6810580aa44f7a2239549_slack-notif-1-.png

    Reach out if you are interesting in trying it out or have any feedback!