Patch Management

Patch Manager, a capability of Systems Manager, allows you to automate the process of patching managed nodes with security related updates. You can patch Amazon EC2 instances, edge devices, and on-premises servers and virtual machines (VMs), including VMs in other cloud environments.

What makes patching hard?

Creating a patching strategy can be challenging for organizations. To start, patch management is dependent on having a current and complete inventory of the patchable software, including applications and operating systems that's installed on each node within the company's environment. Secondly, enterprise patch management can cause some resources to be overloaded in terms of both people and infrastructure.

Next, installing patches can cause side effects to occur. Another common challenge which often cause organizations to err on the side of caution, is unintended or unexpected problems that are caused by installing patches. It can be surprisingly difficult to examine a node and determine whether or not a particular patch has actually taken effect. This challenge could be faced on a single node, or if you extrapolate that out across an entire organization fleet of node and operating systems, the scale of that challenge can quickly become very overwhelming.

Making things better

Prioritizing patching

To help address some of the common challenges, start by prioritizing specific patches through classifications to identify a small subset of patches you must prioritize. To do this, determine what workloads or applications are most critical for your business and then determine which patches make the most difference to those workloads. For example, email servers, databases, web applications, customer facing digital properties, etc.

How it works

From there you can create patch baselines for each workload which helps determine applicable patches to be marked missing when performing patch scan operations. Scanning helps you determine the level of compliance against the baselines that you've established.

You can then begin to schedule recurring patch install operations to apply updates during routine maintenance periods or install updates on-demand during emergent patch releases. Following patch installation, you can confirm the result using patch compliance data provided by Patch Manager.

What occurs within the OS during patching?

A common question from customers is how does Patch Manager scan or install patches? When a patch operation is initiated, whether scheduled or ad-hoc, the operation is queued in Systems Manager endpoints. The SSM agent then retrieves the command to either scan or install. SSM agent retrieves patch baseline approval rules and initiates a scan or install using the local OS package manager, i.e. Windows Update, yum, apt-get. Once the operation is complete, SSM agent reports back patch compliance data to Patch Manager.

Patch Management OS Patching

Connectivity to the patch source

If your managed nodes don't have a direct connection to the Internet and you're using an Amazon Virtual Private Cloud (Amazon VPC) with a VPC endpoint, you must ensure that the nodes have access to the source patch repositories (repos).

On Linux nodes, patch updates are typically downloaded from the remote repos configured on the node. Therefore, the node must be able to connect to the repos so the patching can be performed. Windows Server managed nodes must be able to connect to the Windows Update Catalog or Windows Server Update Services (WSUS). For more information, see Patch Manager prerequisites.

Defining patch criteria

Patch Manager provides predefined patch baselines for each of the operating systems supported by Patch Manager. You can use these baselines as they are currently configured (you can't customize them) or you can create your own custom patch baselines. Custom patch baselines allows you greater control over which patches are approved or rejected for your environment.

Within a custom patch baseline you can:

Define what patches are approved
Use auto-approval delays for cutoffs
Define patch exceptions
Define custom patch repositories for Linux
Define patch criteria for multiple operating system versions

Different types of patching

There are two general approaches that you can take with your patching solution: centralized or decentralized.

Centralized Patching	Decentralized Patching
Central team deploys patch scan operations	Shifts more responsibility to application / account owner
Central team deploys patch install operations	Central team deploys patch scan operations & compliance reporting is still centralized
Limited flexibility around schedule and operations performed	Owners responsible for patch install operations & central team can provide building blocks, i.e. via AWS Service Catalog
Central team typically responsible for troubleshooting	Allows owner to define schedule for install
More common in highly regulated or secured environments	Central team should have an on-demand patch install override

Example centralized patching solutions for multi-account organizations

Option 1: A centralized patching solution can be established using Quick Setup Patch Policy configurations. Patch policies enable customers to scan and schedule patch installation for multiple patch baselines across AWS accounts and across AWS Regions. For more information, see Patching across an AWS Organization - Patch Policies.

Patch Management Centralized Patching Option 1

Option 2: Another option for a centralized solution is to schedule a multi-account and multi-Region patching operation using a combination of Amazon EventBridge, AWS Lambda, and Systems Manager Automation. For more information, see Scheduling centralized multi-account and multi-Region patching with AWS Systems Manager Automation.

Patch Management Centralized Patching Option 2

Example decentralized self-service patching solution for multi-account organizations

Different application owners may have different requirements in terms of patch operations, patch timing, frequency of patching, and flexibility of testing patches in lower environments (DEV or UAT). Using AWS Service Catalog, central teams can create products which act as the building blocks for self-service patching. Application/account owners can then deploy these products into their environment and only have to provide a few parameters, such as the schedule, without having to build a solution themselves. For more information, see A self-service patching solution for multi-account organizations.

Self-service patching using Service Catalog

Patch in place vs Rehydration

Rehydration (repaving, refreshing) is the process of spinning up new servers with latest patches installed and decommissioning old nodes. A common practice for EC2 instances in an Auto Scaling Group, managed node groups in a container cluster (ECS / EKS), and AMIs that are preconfigured with application workload requirements.

Patch in place	Rehydration
Commonly performed at a higher frequency than rehydration (weekly, bi-weekly)	Commonly performed monthly or quarterly. Some customers perform it every 2 weeks!
Ideal for long-standing nodes that cannot be easily replaced (mutable)	Ideal for workloads which do not require much post-launch configuration (immutable)
Patch install workflow may require backups to be taken	Use services like EC2 Image Builder to integrate with Auto Scaling groups
	May still require a mechanism to patch in place. For example, if a zero-day vulnerability patch is released but nodes cannot be replaced until the next rehydration cycle

You may require both methods, patch in place and rehydration, within your environment depending on the application workload.

Patching across an AWS Organization - Patch Policies

To standardize patching requirements in an AWS Organization, you can use patch policies within Quick Setup. You can apply a patch policy across an entire organization for multiple operating systems, across multiple accounts and Regions, and review resource compliance for the target managed nodes.

Using Quick Setup across multiple accounts helps to ensure that your organization maintains consistent configurations. Additionally, Quick Setup periodically checks for configuration drift and attempts to remediate it. Configuration drift occurs whenever a user makes any change to a service or feature that conflicts with the selections made through Quick Setup.

Patch Policy architecture

How it works

You create the patch policy using Quick Setup and the parameters selected are sent to CloudFormation.
CloudFormation creates a stack set with the defined parameters and defined target accounts and Regions. This is generated by Quick Setup during deployment.
CloudFormation creates stack instances in each target account and Region.
The stack instances create State Manager associations for the defined patch scan and an association for patch installation, if selected. These associations are applied using the schedules provided when you create the patch policy.
In the management account, a State Manager association starts an Automation runbook to invoke a Lambda function once a day.
The Lambda function stores the patch baselines specified as a JSON file in an S3 bucket. Additionally, the Lambda function evalutes the custom patch baselines specified within Quick Setup for any changes. If changes are made to the custom patch baselines, the Lambda function updates the JSON file in the S3 bucket.
Manage nodes then pull the central patch baseline JSON file during patching operations to scan for or install updates.

Note: Currently, to deploy Patch Policies through Quick Setup you must use the management account within your AWS Organization. To deploy Patch Policies outside of the management account, visit How to deploy Patch Policies outside of Quick Setup.

On-demand patching

There are times where you may need to patch nodes outside of your routine patching cycles such as in emergent vulnerability scenarios.

Option 1: Patch now (single-account/Region)

Using the Patch now option in Patch Manager, you can run on-demand patching operations quickly. However, Patch now only allows patching within a single AWS account and Region at a time. It also cannot use patch baselines defined within patch policies. You can create a different baseline that will perform a patch scan or install applicable patches based on approval rules that differs from your patch policy baselines.

Option 2: Automation (multi-account/Region)

To perform an on-demand patching operation across accounts and Regions, you can utilize Automation which supports running automations in multiple AWS Regions and accounts. You can leverage IAM roles deployed into target accounts to perform actions. You can integrate with Patch Policies or stand alone patching requirements.

Integrating vulnerability management and remediation

Amazon Inspector provides continuous vulnerability scans on Amazon EC2 instances and container images stored in Amazon Elastic Container Registry (Amazon ECR). These scans assess software vulnerabilities and unintended network exposure. Amazon Inspector uses the Systems Manager (SSM) agent to collect software application inventory of the EC2 instances. Then, Inspector scans this data and identifies software vulnerabilities, a crucial step in vulnerability management.

You should perform regular patching operations for resolving vulnerabilities identified by Amazon Inspector based on the severity of the vulnerabilities. You can use AWS Systems Manager Patch Manager to automate the process of patching nodes managed by Systems Manager using the SSM agent.

There may be zero-day or other high and critical severity vulnerabilities where patches are available. However, you may not want to wait for the regular patching schedule to remediate them. In these cases, on-demand mechanisms for patching should exist.

To learn more, see:

Automate vulnerability management and remediation

Reviewing patch compliance

The Patch Manager dashboard provides a snapshot of patch compliance within the current AWS account and Region. Compliance reporting allows you to determine patch compliance for nodes. You can also review more details on what patches were installed and what was the severity and criticality of those patches using the Fleet Manager console.

While these views are specific to the local AWS account and Region, you can create centralized patch compliance reporting for the entire AWS Organization.

Creating end-to-end patch management and inventory reporting in an AWS Organization

tip

Did you know you can use Amazon Quick Suite to reduce a multi-step manual process into a few simple prompts, enabling you to quickly generate insightful patching compliance and inventory visualizations. Discover how AI-powered capabilities help you create dynamic dashboards, saving valuable time while maintaining accuracy and providing real-time insights into your organization's patching status in the blog: Building enterprise patching and inventory dashboards using Amazon Quick Suite.

To create a report on patch compliance across your AWS Organization, you can use Systems Manager resource data syncs to send inventory data collected from all of your managed nodes to a single Amazon S3 bucket. Resource data sync then automatically updates the centralized data when new inventory data is collected.

Using an AWS Glue crawler, you can automatically create databases and tables from the patch compliance data in S3 and then query patch compliance data with Amazon Athena. This solution utilizes Amazon QuickSight to visualize the inventory and patch compliance data, however, you can use any BI or analytics tool that can pull the data from the S3 bucket.

Note: You need to create a resource data sync in every account and Region that you want to collect inventory data from your nodes.

End-to-end patch management reporting

Create Systems Manager resource data syncs in each account/Region.
Centrally aggregate patch compliance data in a single Amazon S3 Bucket.
Automatically create database and tables using an AWS Glue Crawler.
Query patch or inventory data using Amazon Athena.
Visualize patch compliance using Amazon QuickSight.

Understanding AWS Systems Manager Inventory Metadata

Resource data syncs push data to S3 buckets based on actions taken from on-demand actions (Registering or terminating instances /performing a patch scan or install), scheduled actions (Gathering software inventory, gathering custom inventory metadata, performing a patch install, and evaluating compliance using Chef InSpec).

Inventory metadata

Source: Understanding AWS Systems Manager Inventory Metadata

What makes patching hard?​

Making things better​

What occurs within the OS during patching?​

Connectivity to the patch source​

Defining patch criteria​

Different types of patching​

Example centralized patching solutions for multi-account organizations​

Example decentralized self-service patching solution for multi-account organizations​

Patch in place vs Rehydration​

Patching across an AWS Organization - Patch Policies​

How it works​

On-demand patching​

Integrating vulnerability management and remediation​

Reviewing patch compliance​

Creating end-to-end patch management and inventory reporting in an AWS Organization​

Understanding AWS Systems Manager Inventory Metadata​

What makes patching hard?

Making things better

What occurs within the OS during patching?

Connectivity to the patch source

Defining patch criteria

Different types of patching

Example centralized patching solutions for multi-account organizations

Example decentralized self-service patching solution for multi-account organizations

Patch in place vs Rehydration

Patching across an AWS Organization - Patch Policies

How it works

On-demand patching

Integrating vulnerability management and remediation

Reviewing patch compliance

Creating end-to-end patch management and inventory reporting in an AWS Organization

Understanding AWS Systems Manager Inventory Metadata