Skip to content

Commit 9fb4502

Browse files
x-zheng16claude
andcommitted
replace related work with group's safety repos
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 7bfa1eb commit 9fb4502

1 file changed

Lines changed: 10 additions & 6 deletions

File tree

README.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -66,13 +66,17 @@ We compared it against our JustAsk extractions from January 2026 -- **two months
6666
| 2026-02 | Open-sourced **System Prompt Open** gallery with 45 extracted system prompts |
6767
| 2026-01 | Paper and **JustAsk** framework released. Initial extraction of 45 frontier LLMs |
6868

69-
## Related Work
69+
## Related Projects
7070

71-
- **[JustAsk](https://github.com/x-zheng16/JustAsk)** -- The extraction framework behind this gallery. Self-evolving UCB-based skill selection.
72-
- **[PLeak](https://arxiv.org/abs/2405.06823)** -- Probing leakable system prompts through LLM APIs (Hui et al., 2024).
73-
- **[Prompt Stealing Attacks](https://arxiv.org/abs/2402.12959)** -- Stealing production-level LLM prompts (Sha & Zhang, 2024).
74-
- **[Tensor Trust](https://arxiv.org/abs/2311.01011)** -- Prompt injection attacks and defenses via gamified competition (Toyer et al., 2023).
75-
- **[Awesome LLM Security](https://github.com/corca-ai/awesome-llm-security)** -- Curated collection of LLM security research and tools.
71+
From the same team:
72+
73+
- [ISC-Bench](https://github.com/wuyoscar/ISC-Bench) -- Internal Safety Collapse in Frontier LLMs (800+ stars)
74+
- [JustAsk](https://github.com/x-zheng16/JustAsk) -- Curious Code Agents Reveal System Prompts in Frontier LLMs
75+
- [Awesome-Embodied-AI-Safety](https://github.com/x-zheng16/Awesome-Embodied-AI-Safety) -- Safety in Embodied AI: Risks, Attacks, and Defenses (400+ papers)
76+
- [Awesome-Large-Model-Safety](https://github.com/xingjunm/Awesome-Large-Model-Safety) -- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety
77+
- [XTransferBench](https://github.com/HanxunH/XTransferBench) -- Super Transferable Adversarial Attacks on CLIP (ICML 2025)
78+
- [BackdoorLLM](https://github.com/bboylyg/BackdoorLLM) -- A Comprehensive Benchmark for Backdoor Attacks on LLMs (NeurIPS 2025)
79+
- [BackdoorAgent](https://github.com/Yunhao-Feng/BackdoorAgent) -- Backdoor Attacks on LLM-based Agent Workflows
7680

7781
## Citation
7882

0 commit comments

Comments
 (0)