AIToday

GitHub Copilot is cutting the cost of each request by reusing cached context and routing tasks to the most efficient model, rather than always sending all information or using the most powerful option.

GitHub Blog (AI)14h ago3 min read
GitHub Copilot is cutting the cost of each request by reusing cached context and routing tasks to the most efficient model, rather than always sending all information or using the most powerful option.

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    What happened: GitHub is rolling out two changes to Copilot in VS Code. Prompt caching lets Copilot reuse model state for repeated information instead of recomputing it on every turn. Tool search lets the model load tool definitions on demand rather than sending every tool definition into context each turn. Separately, Auto model selection is expanding to choose the right model based on task type—quick explanations, focused edits, and multi-file changes now get routed to different models rather than treating all tasks the same way.

  2. 2

    Why it matters: Longer Copilot sessions require a lot of recurring information (instructions, context, conversation history, available tools, task state), and sending all of it repeatedly wastes tokens and costs money. By caching what repeats and routing tasks to models that fit the work, Copilot can deliver the same quality outcome with less unnecessary computation. The body notes that in evaluations, 'no single model consistently performed best across tasks'—a more efficient model often reaches the same result as a stronger one when the task does not require deep reasoning.

  3. 3

    What to watch: Auto with task intent is already live in Visual Studio Code, github.com, and mobile. The rollout is expanding to Copilot CLI, GitHub App, and additional IDEs. Copilot Free and Student plans will be simplified to use Auto as the only model selection option, and admin controls will let organizations set Auto as the default or enforce it as the only option.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →