NEW Browse AI tools across categories — updated daily. See what's new →

AWS Cost & Operations

This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.

Authorzxkane
Version1.0.0
LicenseMIT
Token count~2,774
UpdatedJun 5, 2026

Install

Quick install

via npx skills · works with 57+ agents
npx skills add https://github.com/zxkane/aws-skills/tree/main/plugins/aws-cost-ops/skills/aws-cost-operations
Or pick agent:
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent claude-code
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent cursor
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent codex
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent opencode
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent github-copilot
npx skills add zxkane/aws-skills --skill "AWS Cost & Operations" --agent windsurf
More install options

Shorthand — useful for multi-skill repos:

npx skills add zxkane/aws-skills --skill "AWS Cost & Operations"

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/zxkane/aws-skills.git
cp -r aws-skills/plugins/aws-cost-ops/skills/aws-cost-operations ~/.claude/skills/
How to use: Once installed, ask your agent to "use the AWS Cost & Operations skill" or describe what you want (e.g. "This skill provides AWS cost optimization, monitoring, and operational best prac"). Requires Node.js 18+.

AWS Cost & Operations

This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.

AWS Cost & Operationsby zxkane

This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.

npx skills add https://github.com/zxkane/aws-skills --skill aws-cost-operationsDownload ZIPGitHub

AWS Cost & Operations

This skill provides comprehensive guidance for AWS cost optimization, monitoring, observability, and operational excellence with integrated MCP servers.

AWS Documentation Requirement

Always verify AWS facts using MCP tools (mcp__aws-mcp__ or mcp__awsdocs__) before answering. The aws-mcp-setup dependency is auto-loaded — if MCP tools are unavailable, guide the user through that skill's setup flow.

Integrated MCP Servers

This plugin provides 3 MCP servers:

Bundled Servers

  1. AWS Pricing MCP Server (pricing)
Purpose: Pre-deployment cost estimation and optimization
  • Estimate costs before deploying resources
  • Compare pricing across regions
  • Calculate Total Cost of Ownership (TCO)
  • Evaluate different service options for cost efficiency
  1. AWS Cost Explorer MCP Server (costexp)
Purpose: Detailed cost analysis and reporting
  • Analyze historical spending patterns
  • Identify cost anomalies and trends
  • Forecast future costs
  • Analyze cost by service, region, or tag
  1. Amazon CloudWatch MCP Server (cw)
Purpose: Metrics, alarms, and logs analysis
  • Query CloudWatch metrics and logs
  • Create and manage CloudWatch alarms
  • Troubleshoot operational issues
  • Monitor resource utilization

Note: The following servers are available separately via the Full AWS MCP Server (see aws-mcp-setup skill) and are not bundled with this plugin:

  • AWS Billing and Cost Management MCP — Real-time billing details
  • CloudWatch Application Signals MCP — APM and SLOs
  • AWS Managed Prometheus MCP — PromQL queries for containers
  • AWS CloudTrail MCP — API activity audit
  • AWS Well-Architected Security Assessment MCP — Security posture assessment

When to Use This Skill

Use this skill when:

  • Optimizing AWS costs and reducing spending
  • Estimating costs before deployment
  • Monitoring application and infrastructure performance
  • Setting up observability and alerting
  • Analyzing spending patterns and trends
  • Investigating operational issues
  • Auditing AWS activity and changes
  • Assessing security posture
  • Implementing operational excellence

Cost Optimization Best Practices

Pre-Deployment Cost Estimation

Always estimate costs before deploying:

  • Use AWS Pricing MCP to estimate resource costs
  • Compare pricing across different regions
  • Evaluate alternative service options
  • Calculate expected monthly costs
  • Plan for scaling and growth

Example workflow:

`"Estimate the monthly cost of running a Lambda function with
1 million invocations, 512MB memory, 3-second duration in us-east-1"
`

Cost Analysis and Optimization

Regular cost reviews:

  • Use Cost Explorer MCP to analyze spending trends
  • Identify cost anomalies and unexpected charges
  • Review costs by service, region, and environment
  • Compare actual vs. budgeted costs
  • Generate cost optimization recommendations

Cost optimization strategies:

  • Right-size over-provisioned resources
  • Use appropriate storage classes (S3, EBS)
  • Implement auto-scaling for dynamic workloads
  • Leverage Savings Plans and Reserved Instances
  • Delete unused resources and snapshots
  • Use cost allocation tags effectively

Budget Monitoring

Track spending against budgets:

  • Use Billing and Cost Management MCP to monitor budgets
  • Set up budget alerts for threshold breaches
  • Review budget utilization regularly
  • Adjust budgets based on trends
  • Implement cost controls and governance

Monitoring and Observability Best Practices

CloudWatch Metrics and Alarms

Implement comprehensive monitoring:

  • Use CloudWatch MCP to query metrics and logs
  • Set up alarms for critical metrics:
  • CPU and memory utilization
  • Error rates and latency
  • Queue depths and processing times
  • API gateway throttling
  • Lambda errors and timeouts
  • Create CloudWatch dashboards for visualization
  • Use log insights for troubleshooting

Example alarm scenarios:

  • Lambda error rate > 1%
  • EC2 CPU utilization > 80%
  • API Gateway 4xx/5xx error spike
  • DynamoDB throttled requests
  • ECS task failures

Application Performance Monitoring

Monitor application health:

  • Use CloudWatch Application Signals MCP for APM
  • Track service-level objectives (SLOs)
  • Monitor application dependencies
  • Identify performance bottlenecks
  • Set up distributed tracing

Container and Kubernetes Monitoring

For containerized workloads:

  • Use AWS Managed Prometheus MCP for metrics
  • Monitor container resource utilization
  • Track pod and node health
  • Create PromQL queries for custom metrics
  • Set up alerts for container anomalies

Audit and Security Best Practices

CloudTrail Activity Analysis

Audit AWS activity:

  • Use CloudTrail MCP to analyze API activity
  • Track who made changes to resources
  • Investigate security incidents
  • Monitor for suspicious activity patterns
  • Audit compliance with policies

Common audit scenarios:

  • "Who deleted this S3 bucket?"
  • "Show all IAM role changes in the last 24 hours"
  • "List failed login attempts"
  • "Find all actions by a specific user"
  • "Track modifications to security groups"

Security Assessment

Regular security reviews:

  • Use Well-Architected Security Assessment MCP
  • Assess security posture against best practices
  • Identify security gaps and vulnerabilities
  • Implement recommended security improvements
  • Document security compliance

Security assessment areas:

  • Identity and Access Management (IAM)
  • Detective controls and monitoring
  • Infrastructure protection
  • Data protection and encryption
  • Incident response preparedness

Using MCP Servers Effectively

Cost Analysis Workflow

  • Pre-deployment: Use Pricing MCP to estimate costs
  • Post-deployment: Use Billing MCP to track actual spending
  • Analysis: Use Cost Explorer MCP for detailed cost analysis
  • Optimization: Implement recommendations from Cost Explorer

Monitoring Workflow

  • Setup: Configure CloudWatch metrics and alarms
  • Monitor: Use CloudWatch MCP to track key metrics
  • Analyze: Use Application Signals for APM insights
  • Troubleshoot: Query CloudWatch Logs for issue resolution

Security Workflow

  • Audit: Use CloudTrail MCP to review activity
  • Assess: Use Well-Architected Security Assessment
  • Remediate: Implement security recommendations
  • Monitor: Track security events via CloudWatch

MCP Usage Best Practices

  • Cost Awareness: Check pricing before deploying resources
  • Proactive Monitoring: Set up alarms for critical metrics
  • Regular Reviews: Analyze costs and performance weekly
  • Audit Trails: Review CloudTrail logs for compliance
  • Security First: Run security assessments regularly
  • Optimize Continuously: Act on cost and performance recommendations

Operational Excellence Guidelines

Cost Optimization

  • Tag Everything: Use consistent cost allocation tags
  • Review Monthly: Analyze spending trends and anomalies
  • Right-size: Match resources to actual usage
  • Automate: Use auto-scaling and scheduling
  • Monitor Budgets: Set alerts for cost overruns

Monitoring and Alerting

  • Critical Metrics: Alert on business-critical metrics
  • Noise Reduction: Fine-tune thresholds to reduce false positives
  • Actionable Alerts: Ensure alerts have clear remediation steps
  • Dashboard Visibility: Create dashboards for key stakeholders
  • Log Retention: Balance cost and compliance needs

Security and Compliance

  • Least Privilege: Grant minimum required permissions
  • Audit Regularly: Review CloudTrail logs for anomalies
  • Encrypt Data: Use encryption at rest and in transit
  • Assess Continuously: Run security assessments frequently
  • Incident Response: Have procedures for security events

Additional Resources

For detailed operational patterns and best practices, refer to the comprehensive reference:

File: references/operations-patterns.md

This reference includes:

  • Cost optimization strategies
  • Monitoring and alerting patterns
  • Observability best practices
  • Security and compliance guidelines
  • Troubleshooting workflows

CloudWatch Alarms Reference

File: references/cloudwatch-alarms.md

Common alarm configurations for:

  • Lambda functions
  • EC2 instances
  • RDS databases
  • DynamoDB tables
  • API Gateway
  • ECS services
  • Application Load Balancers

Related Skills

wpdsby firecrawlUse when building UIs leveraging the WordPress Design System (WPDS) and its components, tokens, patterns, etc.assistantby pinecone-ioCreate, manage, and chat with Pinecone Assistants for document Q&A with citations. Handles all assistant operations - create, upload, sync, chat, context…search-for-serviceby coinbaseSearch and discover paid API services available on the x402 bazaar marketplace. Query the marketplace using BM25 relevance search, list all available resources, or inspect specific endpoints to see pricing and payment requirements without paying Supports filtering by network (base, base-sepolia) and output formats (human-readable or JSON) Results are cached locally and auto-refresh every 12 hours; no authentication required for any search or discovery operation Use as a fallback when no...ai-elementsby vercelAI Elements component library guidance — pre-built React components for AI interfaces built on shadcn/ui. Use when building chat UIs, message displays, tool…verify-dotnet-samplesby microsoftHow to build, run and verify the .NET sample projects in the Agent Framework repository. Use this when a user wants to verify that the samples still function…wp-project-triageby wordpressDeterministic WordPress repository inspection with structured JSON output for workflow guidance. Detects project kind (plugin, theme, block theme, WP core, Gutenberg, full site) and outputs a schema-validated JSON report including tooling, tests, and version hints Runs via Node.js detector script at repo root; outputs project.kind , signals , and tooling fields to guide downstream workflows Identifies PHP/Node tooling presence and test frameworks to inform which commands and conventions...data-context-extractorby anthropicA meta-skill that extracts company-specific data knowledge from analysts and generates tailored data analysis skills.applying-brand-guidelinesby anthropicThis skill applies consistent corporate branding and styling to all generated documents including colors, fonts, layouts, and messaging

---

Source: https://github.com/zxkane/aws-skills/tree/main/plugins/aws-cost-ops/skills/aws-cost-operations
Author: zxkane
Discovered via: mcpservers.org

SKILL.md source

---
name: AWS Cost & Operations
description: This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.
---

# AWS Cost & Operations

This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.

# AWS Cost & Operationsby zxkane
This skill provides AWS cost optimization, monitoring, and operational best practices with integrated MCP servers for billing analysis, cost estimation, observability, and security assessment.

`npx skills add https://github.com/zxkane/aws-skills --skill aws-cost-operations`Download ZIPGitHub

## AWS Cost & Operations

This skill provides comprehensive guidance for AWS cost optimization, monitoring, observability, and operational excellence with integrated MCP servers.

## AWS Documentation Requirement

Always verify AWS facts using MCP tools (`mcp__aws-mcp__*` or `mcp__*awsdocs*__*`) before answering. The `aws-mcp-setup` dependency is auto-loaded — if MCP tools are unavailable, guide the user through that skill's setup flow.

## Integrated MCP Servers

This plugin provides 3 MCP servers:

### Bundled Servers

1. AWS Pricing MCP Server (`pricing`)
Purpose: Pre-deployment cost estimation and optimization

* Estimate costs before deploying resources

* Compare pricing across regions

* Calculate Total Cost of Ownership (TCO)

* Evaluate different service options for cost efficiency

2. AWS Cost Explorer MCP Server (`costexp`)
Purpose: Detailed cost analysis and reporting

* Analyze historical spending patterns

* Identify cost anomalies and trends

* Forecast future costs

* Analyze cost by service, region, or tag

3. Amazon CloudWatch MCP Server (`cw`)
Purpose: Metrics, alarms, and logs analysis

* Query CloudWatch metrics and logs

* Create and manage CloudWatch alarms

* Troubleshoot operational issues

* Monitor resource utilization

Note: The following servers are available separately via the Full AWS MCP Server (see `aws-mcp-setup` skill) and are not bundled with this plugin:

* AWS Billing and Cost Management MCP — Real-time billing details

* CloudWatch Application Signals MCP — APM and SLOs

* AWS Managed Prometheus MCP — PromQL queries for containers

* AWS CloudTrail MCP — API activity audit

* AWS Well-Architected Security Assessment MCP — Security posture assessment

## When to Use This Skill

Use this skill when:

* Optimizing AWS costs and reducing spending

* Estimating costs before deployment

* Monitoring application and infrastructure performance

* Setting up observability and alerting

* Analyzing spending patterns and trends

* Investigating operational issues

* Auditing AWS activity and changes

* Assessing security posture

* Implementing operational excellence

## Cost Optimization Best Practices

### Pre-Deployment Cost Estimation

Always estimate costs before deploying:

* Use AWS Pricing MCP to estimate resource costs

* Compare pricing across different regions

* Evaluate alternative service options

* Calculate expected monthly costs

* Plan for scaling and growth

Example workflow:

```
`"Estimate the monthly cost of running a Lambda function with
1 million invocations, 512MB memory, 3-second duration in us-east-1"
`
```

### Cost Analysis and Optimization

Regular cost reviews:

* Use Cost Explorer MCP to analyze spending trends

* Identify cost anomalies and unexpected charges

* Review costs by service, region, and environment

* Compare actual vs. budgeted costs

* Generate cost optimization recommendations

Cost optimization strategies:

* Right-size over-provisioned resources

* Use appropriate storage classes (S3, EBS)

* Implement auto-scaling for dynamic workloads

* Leverage Savings Plans and Reserved Instances

* Delete unused resources and snapshots

* Use cost allocation tags effectively

### Budget Monitoring

Track spending against budgets:

* Use Billing and Cost Management MCP to monitor budgets

* Set up budget alerts for threshold breaches

* Review budget utilization regularly

* Adjust budgets based on trends

* Implement cost controls and governance

## Monitoring and Observability Best Practices

### CloudWatch Metrics and Alarms

Implement comprehensive monitoring:

* Use CloudWatch MCP to query metrics and logs

* Set up alarms for critical metrics:

* CPU and memory utilization

* Error rates and latency

* Queue depths and processing times

* API gateway throttling

* Lambda errors and timeouts

* Create CloudWatch dashboards for visualization

* Use log insights for troubleshooting

Example alarm scenarios:

* Lambda error rate > 1%

* EC2 CPU utilization > 80%

* API Gateway 4xx/5xx error spike

* DynamoDB throttled requests

* ECS task failures

### Application Performance Monitoring

Monitor application health:

* Use CloudWatch Application Signals MCP for APM

* Track service-level objectives (SLOs)

* Monitor application dependencies

* Identify performance bottlenecks

* Set up distributed tracing

### Container and Kubernetes Monitoring

For containerized workloads:

* Use AWS Managed Prometheus MCP for metrics

* Monitor container resource utilization

* Track pod and node health

* Create PromQL queries for custom metrics

* Set up alerts for container anomalies

## Audit and Security Best Practices

### CloudTrail Activity Analysis

Audit AWS activity:

* Use CloudTrail MCP to analyze API activity

* Track who made changes to resources

* Investigate security incidents

* Monitor for suspicious activity patterns

* Audit compliance with policies

Common audit scenarios:

* "Who deleted this S3 bucket?"

* "Show all IAM role changes in the last 24 hours"

* "List failed login attempts"

* "Find all actions by a specific user"

* "Track modifications to security groups"

### Security Assessment

Regular security reviews:

* Use Well-Architected Security Assessment MCP

* Assess security posture against best practices

* Identify security gaps and vulnerabilities

* Implement recommended security improvements

* Document security compliance

Security assessment areas:

* Identity and Access Management (IAM)

* Detective controls and monitoring

* Infrastructure protection

* Data protection and encryption

* Incident response preparedness

## Using MCP Servers Effectively

### Cost Analysis Workflow

* Pre-deployment: Use Pricing MCP to estimate costs

* Post-deployment: Use Billing MCP to track actual spending

* Analysis: Use Cost Explorer MCP for detailed cost analysis

* Optimization: Implement recommendations from Cost Explorer

### Monitoring Workflow

* Setup: Configure CloudWatch metrics and alarms

* Monitor: Use CloudWatch MCP to track key metrics

* Analyze: Use Application Signals for APM insights

* Troubleshoot: Query CloudWatch Logs for issue resolution

### Security Workflow

* Audit: Use CloudTrail MCP to review activity

* Assess: Use Well-Architected Security Assessment

* Remediate: Implement security recommendations

* Monitor: Track security events via CloudWatch

### MCP Usage Best Practices

* Cost Awareness: Check pricing before deploying resources

* Proactive Monitoring: Set up alarms for critical metrics

* Regular Reviews: Analyze costs and performance weekly

* Audit Trails: Review CloudTrail logs for compliance

* Security First: Run security assessments regularly

* Optimize Continuously: Act on cost and performance recommendations

## Operational Excellence Guidelines

### Cost Optimization

* Tag Everything: Use consistent cost allocation tags

* Review Monthly: Analyze spending trends and anomalies

* Right-size: Match resources to actual usage

* Automate: Use auto-scaling and scheduling

* Monitor Budgets: Set alerts for cost overruns

### Monitoring and Alerting

* Critical Metrics: Alert on business-critical metrics

* Noise Reduction: Fine-tune thresholds to reduce false positives

* Actionable Alerts: Ensure alerts have clear remediation steps

* Dashboard Visibility: Create dashboards for key stakeholders

* Log Retention: Balance cost and compliance needs

### Security and Compliance

* Least Privilege: Grant minimum required permissions

* Audit Regularly: Review CloudTrail logs for anomalies

* Encrypt Data: Use encryption at rest and in transit

* Assess Continuously: Run security assessments frequently

* Incident Response: Have procedures for security events

## Additional Resources

For detailed operational patterns and best practices, refer to the comprehensive reference:

File: `references/operations-patterns.md`

This reference includes:

* Cost optimization strategies

* Monitoring and alerting patterns

* Observability best practices

* Security and compliance guidelines

* Troubleshooting workflows

## CloudWatch Alarms Reference

File: `references/cloudwatch-alarms.md`

Common alarm configurations for:

* Lambda functions

* EC2 instances

* RDS databases

* DynamoDB tables

* API Gateway

* ECS services

* Application Load Balancers

## Related Skills
wpdsby firecrawlUse when building UIs leveraging the WordPress Design System (WPDS) and its components, tokens, patterns, etc.assistantby pinecone-ioCreate, manage, and chat with Pinecone Assistants for document Q&A with citations. Handles all assistant operations - create, upload, sync, chat, context…search-for-serviceby coinbaseSearch and discover paid API services available on the x402 bazaar marketplace. Query the marketplace using BM25 relevance search, list all available resources, or inspect specific endpoints to see pricing and payment requirements without paying Supports filtering by network (base, base-sepolia) and output formats (human-readable or JSON) Results are cached locally and auto-refresh every 12 hours; no authentication required for any search or discovery operation Use as a fallback when no...ai-elementsby vercelAI Elements component library guidance — pre-built React components for AI interfaces built on shadcn/ui. Use when building chat UIs, message displays, tool…verify-dotnet-samplesby microsoftHow to build, run and verify the .NET sample projects in the Agent Framework repository. Use this when a user wants to verify that the samples still function…wp-project-triageby wordpressDeterministic WordPress repository inspection with structured JSON output for workflow guidance. Detects project kind (plugin, theme, block theme, WP core, Gutenberg, full site) and outputs a schema-validated JSON report including tooling, tests, and version hints Runs via Node.js detector script at repo root; outputs project.kind , signals , and tooling fields to guide downstream workflows Identifies PHP/Node tooling presence and test frameworks to inform which commands and conventions...data-context-extractorby anthropicA meta-skill that extracts company-specific data knowledge from analysts and generates tailored data analysis skills.applying-brand-guidelinesby anthropicThis skill applies consistent corporate branding and styling to all generated documents including colors, fonts, layouts, and messaging

---

**Source**: https://github.com/zxkane/aws-skills/tree/main/plugins/aws-cost-ops/skills/aws-cost-operations
**Author**: zxkane
**Discovered via**: mcpservers.org

Related skills 6

microsoft-foundry

★ Featured Official

Deploy, evaluate, and manage Foundry agents end-to-end: Docker build, ACR push, hosted/prompt agent create, container start, batch eval, continuous eval, prompt optimizer workflows, agent.yaml, dataset curation from traces. USE FOR: deploy agent to Foundry, hosted agent, create agent, invoke agent, evaluate agent, run batch eval, continuous eval, continuous monitoring, continuous eval status, optimize prompt, improve prompt, prompt optimizer, optimize agent instructions, improve agent instruc...

microsoft 340k
DevOps & Infrastructure

azure-ai

★ Featured Official

Use for Azure AI: Search, Speech, OpenAI, Document Intelligence. Helps with search, vector/hybrid search, speech-to-text, text-to-speech, transcription, OCR. WHEN: AI Search, query search, vector search, hybrid search, semantic search, speech-to-text, text-to-speech, transcribe, OCR, convert text to speech.

microsoft 338k
DevOps & Infrastructure

azure-deploy

★ Featured Official

Execute Azure deployments for ALREADY-PREPARED applications that have existing .azure/deployment-plan.md and infrastructure files. DO NOT use this skill when the user asks to CREATE a new application — use azure-prepare instead. This skill runs azd up, azd deploy, terraform apply, and az deployment commands with built-in error recovery. Requires .azure/deployment-plan.md from azure-prepare and validated status from azure-validate. WHEN: "run azd up", "run azd deploy", "execute deployment", "p...

microsoft 338k
DevOps & Infrastructure

azure-diagnostics

★ Featured Official

Debug Azure production issues on Azure using AppLens, Azure Monitor, resource health, and safe triage. WHEN: debug production issues, troubleshoot app service, app service high CPU, app service deployment failure, troubleshoot container apps, troubleshoot functions, troubleshoot AKS, kubectl cannot connect, kube-system/CoreDNS failures, pod pending, crashloop, node not ready, upgrade failures, analyze logs, KQL, insights, image pull failures, cold start issues, health probe failures, resource...

microsoft 338k
DevOps & Infrastructure

azure-resource-lookup

★ Featured Official

List, find, and show Azure resources across subscriptions or resource groups. Handles prompts like "list the websites in my subscription", "list my web apps", "show my app services", "list virtual machines", "list my VMs", "show storage accounts", "find container apps", and "what resources do I have". USE FOR: list websites, list web apps, list app services, show websites in subscription, resource inventory, find resources by tag, tag analysis, orphaned resource discovery (not for cost analys...

microsoft 337k
DevOps & Infrastructure

azure-resource-visualizer

★ Featured Official

Analyze Azure resource groups and generate detailed Mermaid architecture diagrams showing the relationships between individual resources. WHEN: create architecture diagram, visualize Azure resources, show resource relationships, generate Mermaid diagram, analyze resource group, diagram my resources, architecture visualization, resource topology, map Azure infrastructure.

microsoft 337k
DevOps & Infrastructure