KServe

Model serving using KServe with Kubeflow on AWS

Configuration for accessing AWS services for inference services such as pulling images from private ECR and downloading models from S3 bucket.

Serve prediction requests using Knative Serving and AWS Load Balancer

Run inference using Kubeflow on AWS using AWS Deep Learning Containers