npx skills add https://github.com/andrejones92/canifi-life-os --skill datadogSKILL.md
Datadog Skill
Overview
Enables Claude to access Datadog to view infrastructure metrics, APM traces, logs, dashboards, and monitor alerts across your entire observability stack.
Quick Install
curl -sSL https://canifi.com/skills/datadog/install.sh | bash
Or manually:
cp -r skills/datadog ~/.canifi/skills/
Setup
Configure via canifi-env:
# First, ensure canifi-env is installed:
# curl -sSL https://canifi.com/install.sh | bash
canifi-env set DATADOG_EMAIL "your-email@example.com"
Privacy & Authentication
Your credentials, your choice. Canifi LifeOS respects your privacy.
Option 1: Manual Browser Login (Recommended)
If you prefer not to share credentials with Claude Code:
- Complete the Browser Automation Setup using CDP mode
- Login to the service manually in the Playwright-controlled Chrome window
- Claude will use your authenticated session without ever seeing your password
Option 2: Environment Variables
If you're comfortable sharing credentials, you can store them locally:
canifi-env set SERVICE_EMAIL "your-email"
canifi-env set SERVICE_PASSWORD "your-password"
Note: Credentials stored in canifi-env are only accessible locally on your machine and are never transmitted.
Capabilities
- View infrastructure metrics
- Check APM traces and services
- Monitor log streams
- View custom dashboards
- Check alert status
- Analyze host health
Usage Examples
Example 1: Check Infrastructure
User: "How's my infrastructure looking?"
Claude: I'll check your Datadog dashboard.
- Opens app.datadoghq.com via Playwright MCP
- Logs into account
- Infrastructure:
- Hosts: 12 active
- Containers: 45 running
- CPU avg: 34%
- Memory avg: 67%
- Alerts: 1 warning (disk space)
- All critical services healthy
Example 2: Check APM
User: "How are my services performing?"
Claude: I'll check APM.
- Views APM section
- Services:
- api-gateway: 45ms p50, 120ms p99
- user-service: 23ms p50, 89ms p99
- order-service: 67ms p50, 234ms p99
- Throughput: 2.3k req/s
- Error rate: 0.12%
- Slowest endpoint: /api/reports (890ms)
Example 3: Check Logs
User: "Any errors in the logs?"
Claude: I'll check Log Management.
- Views Logs section
- Last hour:
- ERROR: 23 events
- WARN: 156 events
- INFO: 45,678 events
- Top errors:
- ConnectionTimeout: 12 occurrences
- ValidationError: 8 occurrences
- Pattern detected: DB connection spikes
Authentication Flow
- Navigate to app.datadoghq.com via Playwright MCP
- Enter email address
- Enter password
- Handle 2FA/SAML if enabled
- Select organization if multiple
- Maintain session for dashboard access
Error Handling
- Login Failed: Retry credentials
- 2FA Required: Complete verification
- SAML/SSO: Follow redirect
- Session Expired: Re-authenticate
- Rate Limited: Wait and retry
- Access Denied: Check org permissions
Self-Improvement Instructions
After each interaction:
- Track metric patterns
- Note alert frequency
- Log performance trends
- Document UI changes
Suggest updates when:
- Datadog updates dashboard
- New features added
- Integrations expand
- Metrics added
Notes
- Comprehensive observability
- APM and tracing
- Log management
- Custom dashboards
- Alert management
- 600+ integrations
- Enterprise-grade
Repository Stats
Stars1
Forks0