AWS HealthOmics MCP Server

A Model Context Protocol (MCP) server that provides AI assistants with comprehensive access to AWS HealthOmics services for genomic workflow management, execution, and analysis.

Overview

AWS HealthOmics is a purpose-built service for storing, querying, and analyzing genomic, transcriptomic, and other omics data. This MCP server enables AI assistants to interact with HealthOmics workflows through natural language, making genomic data analysis more accessible and efficient.

Key Capabilities

This MCP server provides tools for:

🧬 Workflow Management

Create and validate workflows: Support for WDL, CWL, and Nextflow workflow languages
Version management: Create and manage workflow versions with different configurations
Package workflows: Bundle workflow definitions into deployable packages

🚀 Workflow Execution

Start and monitor runs: Execute workflows with custom parameters and monitor progress
Task management: Track individual workflow tasks and their execution status
Resource configuration: Configure compute resources, storage, and caching options

📊 Analysis and Troubleshooting

Performance analysis: Analyze workflow execution performance and resource utilization
Failure diagnosis: Comprehensive troubleshooting tools for failed workflow runs
Log access: Retrieve detailed logs from runs, engines, tasks, and manifests

🌍 Region Management

Multi-region support: Get information about AWS regions where HealthOmics is available

Available Tools

Workflow Management Tools

ListAHOWorkflows - List available HealthOmics workflows with pagination support
CreateAHOWorkflow - Create new workflows with WDL, CWL, or Nextflow definitions
GetAHOWorkflow - Retrieve detailed workflow information and export definitions
CreateAHOWorkflowVersion - Create new versions of existing workflows
ListAHOWorkflowVersions - List all versions of a specific workflow
PackageAHOWorkflow - Package workflow files into base64-encoded ZIP format

Workflow Execution Tools

StartAHORun - Start workflow runs with custom parameters and resource configuration
ListAHORuns - List workflow runs with filtering by status and date ranges
GetAHORun - Retrieve detailed run information including status and metadata
ListAHORunTasks - List tasks for specific runs with status filtering
GetAHORunTask - Get detailed information about specific workflow tasks

Analysis and Troubleshooting Tools

AnalyzeAHORunPerformance - Analyze workflow run performance and resource utilization
DiagnoseAHORunFailure - Comprehensive diagnosis of failed workflow runs with remediation suggestions
GetAHORunLogs - Access high-level workflow execution logs and events
GetAHORunEngineLogs - Retrieve workflow engine logs (STDOUT/STDERR) for debugging
GetAHORunManifestLogs - Access run manifest logs with runtime information and metrics
GetAHOTaskLogs - Get task-specific logs for debugging individual workflow steps

Region Management Tools

GetAHOSupportedRegions - List AWS regions where HealthOmics is available

Instructions for AI Assistants

This MCP server enables AI assistants to help users with AWS HealthOmics genomic workflow management. Here's how to effectively use these tools:

Understanding AWS HealthOmics

AWS HealthOmics is designed for genomic data analysis workflows. Key concepts:

Workflows: Computational pipelines written in WDL, CWL, or Nextflow that process genomic data
Runs: Executions of workflows with specific input parameters and data
Tasks: Individual steps within a workflow run
Storage Types: STATIC (fixed storage) or DYNAMIC (auto-scaling storage)

Workflow Management Best Practices

Creating Workflows:
- Use PackageAHOWorkflow to bundle workflow files before creating
- Validate workflows with appropriate language syntax (WDL, CWL, Nextflow)
- Include parameter templates to guide users on required inputs
Version Management:
- Create new versions for workflow updates rather than modifying existing ones
- Use descriptive version names that indicate changes or improvements
- List versions to help users choose the appropriate one

Workflow Execution Guidance

Starting Runs:
- Always specify required parameters: workflow_id, role_arn, name, output_uri
- Choose appropriate storage type (DYNAMIC recommended for most cases)
- Use meaningful run names for easy identification
- Configure caching when appropriate to save costs and time
Monitoring Runs:
- Use ListAHORuns with status filters to track active workflows
- Check individual run details with GetAHORun for comprehensive status
- Monitor tasks with ListAHORunTasks to identify bottlenecks

Troubleshooting Failed Runs

When workflows fail, follow this diagnostic approach:

Start with DiagnoseAHORunFailure: This comprehensive tool provides:
- Failure reasons and error analysis
- Failed task identification
- Log summaries and recommendations
- Actionable troubleshooting steps
Access Specific Logs:
- Run Logs: High-level workflow events and status changes
- Engine Logs: Workflow engine STDOUT/STDERR for system-level issues
- Task Logs: Individual task execution details for specific failures
- Manifest Logs: Resource utilization and workflow summary information
Performance Analysis:
- Use AnalyzeAHORunPerformance to identify resource bottlenecks
- Review task resource utilization patterns
- Optimize workflow parameters based on analysis results

Common Use Cases

Workflow Development:

User: "Help me create a new genomic variant calling workflow"
→ Use CreateAHOWorkflow with WDL/CWL/Nextflow definition
→ Package workflow files appropriately
→ Validate syntax and parameters

Production Execution:

User: "Run my alignment workflow on these FASTQ files"
→ Use StartAHORun with appropriate parameters
→ Monitor with ListAHORuns and GetAHORun
→ Track task progress with ListAHORunTasks

Troubleshooting:

User: "My workflow failed, what went wrong?"
→ Use DiagnoseAHORunFailure for comprehensive analysis
→ Access specific logs based on failure type
→ Provide actionable remediation steps

Performance Optimization:

User: "How can I make my workflow run faster?"
→ Use AnalyzeAHORunPerformance to identify bottlenecks
→ Review resource utilization patterns
→ Suggest optimization strategies

Important Considerations

IAM Permissions: Ensure proper IAM roles with HealthOmics permissions
Regional Availability: Use GetAHOSupportedRegions to verify service availability
Cost Management: Monitor storage and compute costs, especially with STATIC storage
Data Security: Follow genomic data handling best practices and compliance requirements
Resource Limits: Be aware of service quotas and limits for concurrent runs

Error Handling

When tools return errors:

Check AWS credentials and permissions
Verify resource IDs (workflow_id, run_id, task_id) are valid
Ensure proper parameter formatting and required fields
Use diagnostic tools to understand failure root causes
Provide clear, actionable error messages to users

Installation

Cursor	VS Code

Install using uvx:

uvx awslabs.aws-healthomics-mcp-server

Or install from source:

git clone <repository-url>
cd mcp/src/aws-healthomics-mcp-server
uv sync
uv run -m awslabs.aws_healthomics_mcp_server.server

Configuration

Environment Variables

AWS_REGION - AWS region for HealthOmics operations (default: us-east-1)
AWS_PROFILE - AWS profile for authentication
FASTMCP_LOG_LEVEL - Server logging level (default: WARNING)

AWS Credentials

This server requires AWS credentials with appropriate permissions for HealthOmics operations. Configure using:

AWS CLI: aws configure
Environment variables: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY
IAM roles (recommended for EC2/Lambda)
AWS profiles: Set AWS_PROFILE environment variable

Required IAM Permissions

The following IAM permissions are required:

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "omics:ListWorkflows",
                "omics:CreateWorkflow",
                "omics:GetWorkflow",
                "omics:CreateWorkflowVersion",
                "omics:ListWorkflowVersions",
                "omics:StartRun",
                "omics:ListRuns",
                "omics:GetRun",
                "omics:ListRunTasks",
                "omics:GetRunTask",
                "logs:DescribeLogGroups",
                "logs:DescribeLogStreams",
                "logs:GetLogEvents"
            ],
            "Resource": "*"
        },
        {
            "Effect": "Allow",
            "Action": [
                "iam:PassRole"
            ],
            "Resource": "arn:aws:iam::*:role/HealthOmicsExecutionRole*"
        }
    ]
}

Usage with MCP Clients

Claude Desktop

Add to your Claude Desktop configuration:

{
  "mcpServers": {
    "aws-healthomics": {
      "command": "uvx",
      "args": ["awslabs.aws-healthomics-mcp-server"],
      "env": {
        "AWS_REGION": "us-east-1",
        "AWS_PROFILE": "your-profile"
      }
    }
  }
}

Other MCP Clients

Configure according to your client's documentation, using:

Command: uvx
Args: ["awslabs.aws-healthomics-mcp-server"]
Environment variables as needed

Development

Setup

git clone <repository-url>
cd aws-healthomics-mcp-server
uv sync

Testing

# Run tests with coverage
uv run pytest --cov --cov-branch --cov-report=term-missing

# Run specific test file
uv run pytest tests/test_server.py -v

Code Quality

# Format code
uv run ruff format

# Lint code
uv run ruff check

# Type checking
uv run pyright

Contributing

Contributions are welcome! Please see the contributing guidelines for more information.

License

This project is licensed under the Apache-2.0 License. See the LICENSE file for details.

Overview​

Key Capabilities​

🧬 Workflow Management​

🚀 Workflow Execution​

📊 Analysis and Troubleshooting​

🌍 Region Management​

Available Tools​

Workflow Management Tools​

Workflow Execution Tools​

Analysis and Troubleshooting Tools​

Region Management Tools​

Instructions for AI Assistants​

Understanding AWS HealthOmics​

Workflow Management Best Practices​

Workflow Execution Guidance​

Troubleshooting Failed Runs​

Common Use Cases​

Important Considerations​

Error Handling​

Installation​

Configuration​

Environment Variables​

AWS Credentials​

Required IAM Permissions​

Usage with MCP Clients​

Claude Desktop​

Other MCP Clients​

Development​

Setup​

Testing​

Code Quality​

Contributing​

License​