Skip to content

Create blog post on AKS NAP disruption management#5685

Open
wdarko1 wants to merge 7 commits intoAzure:masterfrom
wdarko1:nap-disruption-blog
Open

Create blog post on AKS NAP disruption management#5685
wdarko1 wants to merge 7 commits intoAzure:masterfrom
wdarko1:nap-disruption-blog

Conversation

@wdarko1
Copy link
Copy Markdown
Contributor

@wdarko1 wdarko1 commented Mar 27, 2026

Added a blog post on managing disruption with AKS Node Auto-Provisioning, covering best practices for Pod Disruption Budgets and consolidation.

Added a blog post on managing disruption with AKS Node Auto-Provisioning, covering best practices for Pod Disruption Budgets and consolidation.
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new AKS blog post focused on managing voluntary disruption when using Node Auto-Provisioning (NAP), with guidance on Pod Disruption Budgets (PDBs), consolidation controls, disruption budgets, and maintenance windows.

Changes:

  • Added a new blog post covering NAP disruption concepts and common pitfalls.
  • Included YAML examples for PDBs and NodePool disruption settings (consolidation policy, budgets, schedules).
  • Added operational guidance on observability and drift/image update considerations.

Comment on lines +26 to +34
---

<!-- truncate -->

:::info

Learn more about how to [configure disruption policies for NAP](https://learn.microsoft.com/azure/aks/node-auto-provisioning-disruption)

:::
Copy link

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Per the repo’s blog post structure, add a hero image immediately after <!-- truncate -->. The post directory currently contains only index.md, so readers won’t get a hero/social image unless you add one (for example ./hero-image.png) and reference it here with descriptive alt text.

Copilot uses AI. Check for mistakes.
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings March 28, 2026 04:26
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

Clarified descriptions of Pod Disruption Budgets and their impact on voluntary evictions. Improved wording for clarity and corrected minor typos.
Copy link
Copy Markdown
Contributor

@sabbour sabbour left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Content review: solid technical depth on a high-value topic. The main blockers are em dashes throughout (banned by style guide), third-person voice in the opening, a cluster of typos in Part 4, and a duplicated troubleshooting section. Nine inline comments with specific fixes.

Copy link
Copy Markdown
Contributor

@sabbour sabbour left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Content review: solid technical depth on a high-value topic. The main blockers are em dashes throughout (banned by style guide), third-person voice in the opening, a cluster of typos in Part 4, and a duplicated troubleshooting section. Nine inline comments with specific fixes.

Copilot AI review requested due to automatic review settings March 31, 2026 08:28
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 2 changed files in this pull request and generated 6 comments.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Ahmed Sabbour <103856+sabbour@users.noreply.github.com>
Copilot AI review requested due to automatic review settings April 2, 2026 19:32
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 2 changed files in this pull request and generated no new comments.

wdarko1 added 2 commits April 2, 2026 12:42
Updated formatting and clarified sections on NAP disruption best practices, including node disruption budgets and observability.
Updated guidance on managing NAP node disruptions, including operational takeaways and common pitfalls with suggested fixes.
Copilot AI review requested due to automatic review settings April 2, 2026 19:59
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 1 out of 2 changed files in this pull request and generated 3 comments.

- Why won’t NAP scale down, even though I have lots of underused capacity?
- Why do upgrades get “stuck” on certain nodes?

This post focuses on **NAP disruption best practices**, not workload scheduling (tools like topology spread constraints, node affinity, and taints). For scheduling best practices, see the NAP scheduling fundamentals post (link TBD).
Copy link

Copilot AI Apr 2, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This sentence contains a placeholder "(link TBD)". Please replace it with a real link to the referenced scheduling fundamentals post or remove the reference before publishing.

Suggested change
This post focuses on **NAP disruption best practices**, not workload scheduling (tools like topology spread constraints, node affinity, and taints). For scheduling best practices, see the NAP scheduling fundamentals post (link TBD).
This post focuses on **NAP disruption best practices**, not workload scheduling (tools like topology spread constraints, node affinity, and taints).

Copilot uses AI. Check for mistakes.

<!-- truncate -->

![Diagram showing two concentric defensive layers protecting workloads during NAP node consolidation](./hero-image.png)
Copy link

Copilot AI Apr 2, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The hero image referenced here is very large (~1.7 MB). Please compress/resize it (ideally <500 KB) to reduce page weight and improve load performance.

Suggested change
![Diagram showing two concentric defensive layers protecting workloads during NAP node consolidation](./hero-image.png)
![Diagram showing two concentric defensive layers protecting workloads during NAP node consolidation](./hero-image.webp)

Copilot uses AI. Check for mistakes.
Comment on lines +13 to +15

- How do I control when scale downs happen, or where it shouldn't?
- How do I control workload disruption so it happens predictably (and not in the middle of business hours)?
Copy link

Copilot AI Apr 2, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This bullet is grammatically incomplete ("where it shouldn't?"). Consider rephrasing to include the missing verb/object (for example, "where it shouldn't happen").

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants