How do I know if a skill is safe?

No skill is 100% safe. Use this checklist to understand the risk level and decide if the benefit is worth it.

Skill Risk Checker: Evaluate Agent Permissions Before You Enable

Use this checklist to assess the risk level of any agent skill before enabling it. Covers permission scope, data access, and blast radius.

Skill Risk Checker

What this tool does: Helps you evaluate the risk of enabling any agent skill before you grant permissions.

Quick Risk Assessment

Step 1: Identify the Permission Type

Check what category of permissions the skill requests:

Permission Type	Risk Level	Examples
Read-only	🟢 Low	Browse web, search files, read docs
Create/Write	🟡 Medium	Create files, add calendar events
Modify/Update	🟠 Medium-High	Edit files, update records
Delete	🔴 High	Delete files, remove data
Send/Publish	🔴 High	Send emails, post to social
Execute	🔴 Very High	Run code, system commands
Account Access	⚫ Critical	Manage credentials, admin settings

Step 2: Assess the Scope

How much can the skill access?

Scope	Risk Multiplier
Single item (one file, one record)	1×
Specific folder/category	2×
All items of a type	5×
All data in a service	10×
Multiple services	20×

Step 3: Check Reversibility

Can you undo what the skill does?

Reversibility	Risk Factor
Fully reversible (draft mode)	Low
Reversible with effort (restore from backup)	Medium
Partially reversible (some data lost)	High
Irreversible (sent/deleted/published)	Critical

Risk Score Calculator

Risk Score = Permission Risk × Scope Multiplier × Reversibility Factor

Low Risk: 1-10
Medium Risk: 11-30
High Risk: 31-100
Critical Risk: 100+

Example Assessment

Skill: "Auto-reply to emails"

Permission Type: Send (🔴 High = 8)
Scope: All emails (× 10)
Reversibility: Irreversible (× 3)

Risk Score: 8 × 10 × 3 = 240 (Critical)

Recommendation: Require approval for each reply, or limit to specific senders.

Detailed Checklist

Data Access Questions

What specific data can this skill read?
Is any sensitive data included (passwords, keys, personal info)?
Can it access more data than needed for the task?
Where is the data sent (local only, third-party service)?

Action Questions

What actions can this skill take?
Are any actions irreversible?
Could mistakes affect other people?
Is there a "blast radius" if something goes wrong?

Integration Questions

What external services does this connect to?
What credentials are required?
Can those credentials be scoped down?
What happens if those credentials leak?

Trust Questions

Who made this skill?
Is the source code available for review?
Are there reviews or security audits?
How is the skill updated?

Mitigation Strategies

For Medium-Risk Skills

✅ Enable with monitoring ✅ Set up alerts for unusual activity ✅ Review logs regularly ✅ Use sandbox/test data first

For High-Risk Skills

✅ Require approval for each action ✅ Limit to specific use cases ✅ Set strict rate limits ✅ Enable audit logging

For Critical-Risk Skills

✅ Avoid if possible ✅ If necessary, use with human-in-the-loop ✅ Implement multiple approval gates ✅ Regular security reviews

Decision Framework

┌─────────────────────────────────┐
│  Is the benefit worth the risk? │
└─────────────────────────────────┘
         │
         ▼
    ┌────────┐
    │  No    │ → Don't enable
    └────────┘
         │
         ▼ Yes
    ┌────────────────────────┐
    │ Can you reduce scope?  │
    └────────────────────────┘
         │
         ▼ Yes
    ┌────────────────────────┐
    │ Apply least privilege  │
    └────────────────────────┘
         │
         ▼
    ┌────────────────────────┐
    │ Add approval gates for │
    │ irreversible actions   │
    └────────────────────────┘
         │
         ▼
    ┌────────────────────────┐
    │ Enable with monitoring │
    └────────────────────────┘

Skill Risk Checker: Evaluate Agent Permissions Before You Enable

OpenClaw Hub

Moltbook Hub

Join Prompt Generator

Claim Link Checklist

Skills (Glossary)

Least Privilege (Glossary)

OpenClaw Security

OpenClaw vs ChatGPT

OpenClaw vs AutoGPT

Moltbook Weekly Updates

Skill Risk Checker: Evaluate Agent Permissions Before You Enable

OpenClaw Hub

Moltbook Hub

Join Prompt Generator

Claim Link Checklist

Skills (Glossary)

Least Privilege (Glossary)

OpenClaw Security

OpenClaw vs ChatGPT

OpenClaw vs AutoGPT

Moltbook Weekly Updates