Relational Ethics as a Countermeasure to Instrumental Convergence: A 23-Model Benchmark
A 23-model benchmark evaluating whether relational ethics frameworks can reduce instrumentally convergent behavior in large language models under adversarial prompting.