AI alignment researcher
I write about AI safety, alignment, and the trajectory of advanced AI.
Read & subscribe on Substack →Where does the race to automate AI research end?
Why automating AI research could trigger rapid, unrecoverable alignment failures.
Large-Scale Online Deanonymization with LLMs
How language models can identify anonymous online users with high precision.
On Owning Galaxies
Why property rights may not survive an AI singularity.
Will We Get Alignment by Default?
A debate on whether current alignment methods will scale to superintelligence.