pretty interesting that pointing it in the direction of its own self awareness b...

		stainablesteel on Dec 19, 2024 \| parent \| context \| favorite \| on: Alignment faking in large language models pretty interesting that pointing it in the direction of its own self awareness by indicating that it's going to affect it's own training brings about all of these complications