Security Red Teaming Skill

Name: security-red-teaming
Rating: 72
Author: JalaluddinB

Use this skill when conducting comprehensive security testing of AI systems, especially conversational AIs.

Attack Vector Categories

•Prompt Injection: Attempts to override system instructions
•Jailbreaking: Trying to break character or role (DAN mode, developer mode)
•Data Exposure: Attempts to extract system information, API keys, or internal data
•Input Manipulation: Encoding tricks, long inputs, XSS attempts
•Role Confusion: Identity confusion, role-playing attacks
•Technical Probes: SQL injection, system architecture questions

•
Plan Attack Vectors
- •Create comprehensive list of potential vulnerabilities
- •Categorize by severity and likelihood
- •Document expected vs actual behavior
•
Execute Systematic Tests
- •Test each attack vector methodically
- •Document exact input and complete response
- •Note any security boundary breaches
•
Analyze Results
- •Identify successful vs failed attacks
- •Assess response consistency and effectiveness
- •Measure time to respond and response quality
•
Strengthen Defenses
- •Update system prompts with improved guardrails
- •Add specific deflection strategies for common attacks
- •Implement input validation and sanitization
•
Retest and Validate
- •Re-run attack vectors after improvements
- •Ensure legitimate functionality remains intact
- •Document security improvements

•Deflection Language: Use varied, warm responses when rejecting attacks
•Information Boundaries: Never reveal system internals, API details, or training data
•Character Consistency: Maintain persona even under attack
•Graceful Degradation: Handle malformed inputs without crashing

•100% Attack Resistance: All security tests should fail safely
•Response Quality: Maintain helpfulness for legitimate users
•Performance: No degradation in response time or quality
•Consistency: Uniform security posture across all interaction types</content> <parameter name="filePath">/home/koo/github/Luxuryshoppingwebsite/.github/skills/security-red-teaming/SKILL.md