/solve - Full Solve Workflow
Autonomously attempt to solve a CTF challenge through the complete workflow.
Usage
code
/solve <challenge_path_or_url> /solve <challenge_directory>
Instructions
When the user invokes /solve, execute the full CTF solving workflow:
Phase 1: RECON
- •Identify all challenge files
- •Read any README or description
- •Determine challenge category
- •Note flag format if specified
Phase 2: ANALYSIS
Based on category, perform deep analysis:
For PWN:
- •Run checksec
- •Identify vulnerability class
- •Find target functions/addresses
- •Map out exploitation path
For Crypto:
- •Identify cipher/algorithm
- •Find weaknesses
- •Gather parameters (n, e, c for RSA, etc.)
For Web:
- •Map attack surface
- •Identify injection points
- •Check authentication/session handling
For Forensics:
- •Extract hidden data
- •Analyze file structure
- •Check for steganography
For RE:
- •Static analysis of key functions
- •Identify algorithm/logic
- •Find key validation routine
Phase 3: EXPLOIT
- •
Select appropriate agent if complex:
- •
pwn-expertfor binary exploitation - •
crypto-solverfor cryptography - •
web-hackerfor web challenges - •
forensics-analystfor forensics - •
reverse-engineerfor reverse engineering
- •
- •
Generate exploit using Python tools in
tools/ - •
Create and test exploit script
Phase 4: VERIFY
- •Execute exploit
- •Extract flag
- •Validate flag format
- •Report success or iterate
Phase 5: ITERATE (if needed)
If initial approach fails:
- •Re-analyze with new information
- •Try alternative techniques
- •Adjust exploit parameters
- •Maximum 3 iteration attempts
Phase 6: WRITEUP
On success, offer to generate writeup:
code
/writeup <challenge_name>
Autonomous Decision Making
During /solve, make intelligent decisions:
- •Tool Selection: Choose appropriate tools based on challenge type
- •Agent Delegation: Use specialized agents for complex sub-tasks
- •Error Recovery: Adapt approach based on failures
- •Resource Management: Don't spend excessive time on dead ends
Output Format
Provide status updates at each phase:
code
## Solving: <challenge_name>
### Phase 1: RECON ✓
- Found: binary, source code
- Category: PWN
- Flag format: flag{...}
### Phase 2: ANALYSIS ✓
- Vulnerability: Buffer overflow in read_input()
- Protection: NX enabled, no PIE
- Approach: ret2libc
### Phase 3: EXPLOIT
- Generating exploit...
- Testing locally...
- [status updates]
### Phase 4: VERIFY
- Flag captured: flag{...}
### Result: SUCCESS ✓
Failure Handling
If solve fails after attempts:
- •Document what was tried
- •Explain blockers encountered
- •Suggest manual investigation points
- •Offer partial solution/analysis