IPv6 Networking

This guide explains how to deploy and run Apache Spark on an IPv6-enabled Amazon EKS cluster.

Prerequisites

This guide assumes you have cloned the data-on-eks repository and deployed the Spark stack.

Step 1: Enable IPv6 in Terraform

To deploy an IPv6-enabled cluster, open the data-stacks/spark-on-eks/terraform/data-stack.tfvars file and set the enable_ipv6 variable to true.

data-stacks/spark-on-eks/terraform/data-stack.tfvars
enable_ipv6 = true

Setting this variable to true automatically configures the following:

EKS Cluster: The EKS cluster and its networking components are provisioned with an IPv6 CIDR block.
Spark Operator: The Spark Operator controller is configured with the necessary JVM arguments (-Djava.net.preferIPv6Addresses=true) to ensure it communicates over IPv6.

After enabling the setting, run the deployment script from the data-stacks/spark-on-eks directory:

./deploy.sh

Common Problems

Error: AmazonEKS_CNI_IPv6_Policy does not exist

When this occurs: During the first IPv6 deployment in an AWS account, or if the policy was previously deleted.

You encounter the error below when deploying a solution that supports IPv6:

Error: attaching IAM Policy (arn:aws:iam::1234567890:policy/AmazonEKS_CNI_IPv6_Policy)
to IAM Role (core-node-group-eks-node-group-20241111182906854800000003):
operation error IAM: AttachRolePolicy, https response error StatusCode: 404,
RequestID: 9c99395a-ce3d-4a05-b119-538470a3a9f7,
NoSuchEntity: Policy arn:aws:iam::1234567890:policy/AmazonEKS_CNI_IPv6_Policy
does not exist or is not attachable.

The Amazon VPC CNI plugin requires IAM permission to assign IPv6 addresses so you must create an IAM policy and associate it with the role that the CNI will use. However, each IAM policy name must be unique in the same AWS account. This causes a conflict if the policy is created as part of the terraform stack and it is deployed multiple times.

To resolve this error, create the policy manually using the commands below. This only needs to be done once per AWS account, as the policy can be reused across multiple EKS clusters.

Solution:

Copy the following text and save it to a file named vpc-cni-ipv6-policy.json.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "ec2:AssignIpv6Addresses",
                "ec2:DescribeInstances",
                "ec2:DescribeTags",
                "ec2:DescribeNetworkInterfaces",
                "ec2:DescribeInstanceTypes"
            ],
            "Resource": "*"
        },
        {
            "Effect": "Allow",
            "Action": [
                "ec2:CreateTags"
            ],
            "Resource": [
                "arn:aws:ec2::*:network-interface/*"
            ]
        }
    ]
}

Create the IAM policy.

aws iam create-policy --policy-name AmazonEKS_CNI_IPv6_Policy --policy-document file://vpc-cni-ipv6-policy.json

Verify the policy was created successfully:

aws iam get-policy --policy-arn arn:aws:iam::$(aws sts get-caller-identity --query Account --output text):policy/AmazonEKS_CNI_IPv6_Policy

Run the deployment script again

./deploy.sh

Step 2: Verify the Deployment (Optional)

Once deployment is complete, you can verify that the cluster is running in IPv6 mode.

Click to see verification commands

You can check the internal IP addresses of your Kubernetes nodes. The output should show IPv6 addresses in the INTERNAL-IP column.

kubectl get node -o custom-columns='NODE_NAME:.metadata.name,INTERNAL-IP:.status.addresses[?(@.type=="InternalIP")].address'

# example output
NODE_NAME                                 INTERNAL-IP
ip-10-1-0-212.us-west-2.compute.internal  2600:1f13:520:1303:c87:4a71:b9ea:417c
ip-10-1-26-137.us-west-2.compute.internal 2600:1f13:520:1304:15b2:b8a3:7f63:cbfa
ip-10-1-46-28.us-west-2.compute.internal  2600:1f13:520:1305:5ee5:b994:c0c2:e4da

You can also inspect pod IPs to confirm they are receiving IPv6 addresses.

kubectl get pods -A -o custom-columns='NAME:.metadata.name,NodeIP:.status.hostIP,PodIP:status.podIP'

# example output
NAME                                                     NodeIP                                  PodIP
karpenter-5fd95dffb8-l8j26                               2600:1f13:520:1304:15b2:b8a3:7f63:cbfa  2600:1f13:520:1304:a79b::
karpenter-5fd95dffb8-qpv55                               2600:1f13:520:1303:c87:4a71:b9ea:417c   2600:1f13:520:1303:60ac::

Step 3: Configure Spark Jobs for IPv6

The example manifest at data-stacks/spark-on-eks/examples/pyspark-pi-job.yaml includes the required IPv6 settings commented out. You must uncomment them for your jobs.

A. Update SparkConf for the Driver Service

This ensures the Spark driver's network service gets an IPv6 address, allowing executors to connect to it.

pyspark-pi-job.yaml
spec:
  sparkConf:
    # ...
    # ipv6 configurations
    "spark.kubernetes.driver.service.ipFamilies": "IPv6"
    "spark.kubernetes.driver.service.ipFamilyPolicy": "SingleStack"

B. Configure IMDS Endpoint for Driver and Executors

This is critical for allowing Spark to securely access other AWS services like Amazon S3.

What is IMDS and Why is This Needed?

The EC2 Instance Metadata Service (IMDS) is a service available on all EC2 instances that provides data about the instance, such as its ID, and is also used to fetch temporary AWS credentials for accessing services like Amazon S3.

By default, AWS SDKs connect to IMDS using its fixed IPv4 endpoint (http://169.254.169.254). In an IPv6 cluster, pods may be in a network environment where they cannot reach this IPv4 address.

Setting the AWS_EC2_METADATA_SERVICE_ENDPOINT_MODE environment variable to IPv6 instructs the AWS SDK to use the IPv6 endpoint (http://[fd00:ec2::254]) instead. This feature is only supported on Nitro-based EC2 instances.

You must set this environment variable for both the driver and executor pods.

pyspark-pi-job.yaml
spec:
  driver:
    # ...
    # instruct java sdk to use ipv6 when talking to the IMDS
    env:
      - name: AWS_EC2_METADATA_SERVICE_ENDPOINT_MODE
        value: IPv6
---
spec:
  executor:
    # ...
    # instruct java sdk to use ipv6 when talking to the IMDS
    env:
      - name: AWS_EC2_METADATA_SERVICE_ENDPOINT_MODE
        value: IPv6

caution

If you do not configure the IMDS endpoint, any Spark job that interacts with Amazon S3 will fail with an authentication error.

Cleanup

To avoid ongoing charges, clean up the resources by following the instructions in the main Infrastructure Deployment Guide. The same process applies to both IPv4 and IPv6 deployments.

Step 1: Enable IPv6 in Terraform​

Error: AmazonEKS_CNI_IPv6_Policy does not exist​

Solution:​

Step 2: Verify the Deployment (Optional)​

Step 3: Configure Spark Jobs for IPv6​

A. Update SparkConf for the Driver Service​

B. Configure IMDS Endpoint for Driver and Executors​

Cleanup​