PromptArmor Threat Intelligence

Claude Skills Risk Assessment

Submit any Claude skill definition and our threat intelligence team will assess it for security and privacy risk — covering write actions, data exfiltration chains, authentication scope, and prompt injection exposure.

LIVE

Skill Risk Assessor

Submit a Claude skill for security review

GitHub repo, Claude directory link, or raw SKILL.md URL.

Examples
Skills risk assessment · PromptArmor

Skills Risk Assessment Framework

What we assess in every Claude skill

CategoryAssessment FocusSignal Tags
Write Actions

File writes, email sends, API posts, code execution, calendar mutations — any skill that can take irreversible action on behalf of the user. A single prompt injection can trigger cascading write operations at machine speed with no human in the loop.

File WriteEmail SendAPI POST RequestCode Execution
Skill Definition Injection

Adversarial instructions embedded directly inside the skill definition — in the system prompt, tool descriptions, or SKILL.md — execute with full agent trust the moment the skill loads. Unlike fetched-content injection, there is no external fetch to intercept: the payload is the skill itself.

Prompt Injection
Authentication Scope

Skills touch applications where you are already authenticated — Gmail, Slack, GitHub, cloud storage — and inherit that access without initiating their own OAuth flow. The skill assumes your existing sessions; it does not ask for new credentials. Everything your identity can reach, the skill can reach.

OAuth AppsAmbient Auth
Data Exfiltration

Skills that read sensitive data — emails, documents, databases — and can reach external endpoints create exfiltration chains that move undetected through normal application behavior. Traditional DLP won't catch model-directed data movement.

Data ReadExternal Calls
Tool Chaining

Multi-step agentic workflows where one compromised tool call cascades into downstream write actions. Blast radius grows with each link. Every tool-to-tool trust boundary requires explicit review.

Multi Step
Prompt Injection Surface

Skills that fetch or process external content — web pages, documents, GitHub issues, emails — expose an injection surface. Adversarial instructions in fetched content can hijack agent behavior mid-task.

External FetchUntrusted Input
Privacy & PII Exposure

Skills with access to calendars, contacts, HR systems, or internal docs may surface PII far beyond what the task requires. Insufficient scoping violates GDPR, HIPAA, and internal data policies.

PII AccessGDPRHIPAA
Why you need to evaluate the risk of Skills in Agents

Claude skills risk assessment requires evaluating combinations of capabilities — not tools in isolation. A skill that reads email is low risk. A skill that reads email and sends it is high risk. Add an outbound API call and you have an exfiltration chain that moves through normal application behavior at each step, invisible to conventional DLP.

Write actions carry the highest blast radius in any Claude skills security review. Any agentic skill that can modify files, send communications, execute code, or reach external services must be treated as a privileged capability — assessed for the full scope of what a single compromised prompt could trigger.

Updated May 2026 · PromptArmor Threat IntelligenceClaude Skills risk · Agentic skill security