wyc-041212

WU Yuchen wyc-041212

Achievements

comp4107-playground comp4107-playground Public

Forked from hkbu-kennycheng/comp4107-playground

Kotlin
JBShield JBShield Public

Forked from NISPLab/JBShield

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"

Python
Deep-Live-Cam Deep-Live-Cam Public

Forked from hacksider/Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python
representation-space-jailbreak representation-space-jailbreak Public

Forked from yuplin2333/representation-space-jailbreak

Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794)

Python
Gradient-Cuff Gradient-Cuff Public

Forked from IBM/Gradient-Cuff

Repo for NeurIPS 2024 paper "Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes"

Python
AutoDAN AutoDAN Public

Forked from SheltonLiu-N/AutoDAN

[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".

Python