HomeAI Tutorial

llm-instruction-conflicts

Public

This repository contains the data and the code for the paper "Control Illusion: The Failure of Instruction Hierarchies in Large Language Models"

Creat2024-12-10T14:01:50
Update2025-11-14T20:08:43
http://arxiv.org/abs/2502.15851
7
Stars
0
Stars Increase