/solve - Full Solve Workflow

Autonomously attempt to solve a CTF challenge through the complete workflow.

Usage

code

/solve <challenge_path_or_url>
/solve <challenge_directory>

Instructions

When the user invokes /solve, execute the full CTF solving workflow:

Phase 1: RECON

•Identify all challenge files
•Read any README or description
•Determine challenge category
•Note flag format if specified

Phase 2: ANALYSIS

Based on category, perform deep analysis:

For PWN:

•Run checksec
•Identify vulnerability class
•Find target functions/addresses
•Map out exploitation path

For Crypto:

•Identify cipher/algorithm
•Find weaknesses
•Gather parameters (n, e, c for RSA, etc.)

For Web:

•Map attack surface
•Identify injection points
•Check authentication/session handling

For Forensics:

•Extract hidden data
•Analyze file structure
•Check for steganography

For RE:

•Static analysis of key functions
•Identify algorithm/logic
•Find key validation routine

Phase 3: EXPLOIT

•
Select appropriate agent if complex:
- •pwn-expert for binary exploitation
- •crypto-solver for cryptography
- •web-hacker for web challenges
- •forensics-analyst for forensics
- •reverse-engineer for reverse engineering
•
Generate exploit using Python tools in tools/
•
Create and test exploit script

Phase 4: VERIFY

•Execute exploit
•Extract flag
•Validate flag format
•Report success or iterate

Phase 5: ITERATE (if needed)

If initial approach fails:

•Re-analyze with new information
•Try alternative techniques
•Adjust exploit parameters
•Maximum 3 iteration attempts

Phase 6: WRITEUP

On success, offer to generate writeup:

code

/writeup <challenge_name>

Autonomous Decision Making

During /solve, make intelligent decisions:

•Tool Selection: Choose appropriate tools based on challenge type
•Agent Delegation: Use specialized agents for complex sub-tasks
•Error Recovery: Adapt approach based on failures
•Resource Management: Don't spend excessive time on dead ends

Output Format

Provide status updates at each phase:

code

## Solving: <challenge_name>

### Phase 1: RECON ✓
- Found: binary, source code
- Category: PWN
- Flag format: flag{...}

### Phase 2: ANALYSIS ✓
- Vulnerability: Buffer overflow in read_input()
- Protection: NX enabled, no PIE
- Approach: ret2libc

### Phase 3: EXPLOIT
- Generating exploit...
- Testing locally...
- [status updates]

### Phase 4: VERIFY
- Flag captured: flag{...}

### Result: SUCCESS ✓

Failure Handling

If solve fails after attempts:

•Document what was tried
•Explain blockers encountered
•Suggest manual investigation points
•Offer partial solution/analysis