In particular how many hours of study would this take, having minimal background in computer science and AI, and I’m also curious what courses / books will get me to a first principles understanding fastest.
By first principles, I mean that I can break down the arguments into the most basic parts so that I can describe risk at a high level in simple terms, but then also break those terms down into constituent parts such that I could reconstruct ideas like machine learning, gradient descent, interpretability, etc. in a satisfactory and complete way without any outside help.
I want to be able to clearly picture how x-risk might take place and be able to manipulate the variables that lead to different possible scenarios in my mind.
Again, especially interested in resources to help me do this.