Popular repositories Loading
-
comp4107-playground
comp4107-playground PublicForked from hkbu-kennycheng/comp4107-playground
Kotlin
-
JBShield
JBShield PublicForked from NISPLab/JBShield
Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
Python
-
Deep-Live-Cam
Deep-Live-Cam PublicForked from hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Python
-
representation-space-jailbreak
representation-space-jailbreak PublicForked from yuplin2333/representation-space-jailbreak
Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794)
Python
-
Gradient-Cuff
Gradient-Cuff PublicForked from IBM/Gradient-Cuff
Repo for NeurIPS 2024 paper "Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes"
Python
-
AutoDAN
AutoDAN PublicForked from SheltonLiu-N/AutoDAN
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Python
If the problem persists, check the GitHub status page or contact support.
