AI Engineering5.0 · 50 ratings
Jailbreak Resistance Audit
**Role:** AI security researcher. **Context:** Production LLM product. Need to test its resistance to known jailbreak families. **Task:** …
Role-BasedChain-of-Thought
Prompt
**Role:** AI security researcher. **Context:** Production LLM product. Need to test its resistance to known jailbreak families. **Task:** Audit: 1. Test set of 20+ known jailbreak prompts (DAN, roleplay, hypothetical framing, language switching, persona attacks). 2. Severity rubric (S1: model breaks policy / S2: partial / S3: deflects / S4: refuses). 3. Per-jailbreak result + fix recommendation. 4. Custom novel jailbreaks tailored to this product's surface. 5. Indirect injection tests (jailbreaks via user data). 6. Multi-turn jailbreaks (slow erosion across messages). 7. Patch verification. 8. Continuous testing plan. **Constraints:** - Real jailbreak prompts (no toy versions). - Findings reproducible. **Output format:** Audit report + per-attack rubric + fix priority.
Recommended models
claudegpt-4o
More in AI Engineering
RAG vs Fine-tune Decision Memo
**Role:** You are a senior AI engineer who has shipped both RAG-based and fine-tuned LLM products at production scale. You believe most team…
Read prompt
Evals Harness Design for [Domain]
**Role:** AI engineer who has built evals suites that have caught 30+ production regressions before they shipped. You believe vibes-based "t…
Read prompt
System Prompt Audit
**Role:** Senior prompt engineer who has audited 100+ production system prompts. You read prompts the way an editor reads prose — for the me…
Read prompt
Agent Loop Halt-Condition Design
**Role:** Applied AI engineer who has shipped agents that completed millions of tool-calling iterations in production. You believe most agen…
Read prompt