<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Posts on Awsome-Distributed-AI Deep Dives by the AWS Frameworks Team</title><link>https://awslabs.github.io/awsome-distributed-ai/posts/</link><description>Recent content in Posts on Awsome-Distributed-AI Deep Dives by the AWS Frameworks Team</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 17 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://awslabs.github.io/awsome-distributed-ai/posts/index.xml" rel="self" type="application/rss+xml"/><item><title>Multi-Tenant Slurm on AWS ParallelCluster, Part 2: QoS Deep Dive</title><link>https://awslabs.github.io/awsome-distributed-ai/posts/slurm-qos-on-parallelcluster/</link><pubDate>Sun, 17 May 2026 00:00:00 +0000</pubDate><guid>https://awslabs.github.io/awsome-distributed-ai/posts/slurm-qos-on-parallelcluster/</guid><description>Use Slurm QoS — associations, limits, fairshare, preemption, partition QoS — to govern shared GPU capacity across teams on top of the multi-user ParallelCluster from Part 1.</description></item><item><title>Multi-Tenant Slurm on AWS ParallelCluster, Part 1: Accounting Database + Multi-User Setup</title><link>https://awslabs.github.io/awsome-distributed-ai/posts/slurm-accounting-multi-user-on-parallelcluster/</link><pubDate>Sat, 16 May 2026 00:00:00 +0000</pubDate><guid>https://awslabs.github.io/awsome-distributed-ai/posts/slurm-accounting-multi-user-on-parallelcluster/</guid><description>Stand up a Slurm accounting database on Aurora Serverless v2, wire it into AWS ParallelCluster, and onboard multiple users without LDAP. Part 1 of a two-post series; Part 2 adds QoS on top.</description></item><item><title>Building Blocks for Foundation Model Training and Inference on AWS</title><link>https://awslabs.github.io/awsome-distributed-ai/posts/building-blocks-fm-training-aws/</link><pubDate>Thu, 07 May 2026 00:00:00 +0000</pubDate><guid>https://awslabs.github.io/awsome-distributed-ai/posts/building-blocks-fm-training-aws/</guid><description>A four-layer view of the foundation-model lifecycle on AWS — accelerator compute and EFA networking, resource orchestration with Slurm and Kubernetes, the ML software stack, and observability.</description></item></channel></rss>