Great post Justin 👍 Fyi your second link is linking back to this post.
Great post Justin 👍 Fyi your second link is linking back to this post.
Thanks Yanni, just fixed the link :)
tl;dr: structural risks are hard, but they might end up being the most important.
Preamble:
Apart from raising an issue of interest, the purpose of posting this is to help test my aptitude for AI governance, which is one of about ~3 options I’m considering at the moment.
Mostly it's a distillation of recent thoughts and longer pieces I’ve previously written on similar topics. I’m also more than happy to provide citations for any statements of interest - my main priority is to just get this posted for feedback.
Background:
Structural risks are those where negative outcomes arise from an accumulation of events, such that no individual actor or action can be specifically blamed. A classic example is the set of structural risks associated with crude oil (petroleum), which has caused wide-ranging impacts, including sea-level rise, belligerent petrostates, microplastics, and urban air pollution.
It’s arguable that the benefits of petroleum will end up being an overall net gain for humanity’s progress. However, this has been a story of dumb-luck and difficult trade-offs, with geopolitical competition and gold-rush dynamics driving most of the narrative.
Why are structural risks important for AI safety?
Regardless of whether advanced AI is designed safely or used responsibly, there’s every indication that its emergence will catapult humanity into an unprecedented period of uncertainty and upheaval. It seems prudent to assume this will be associated with unforeseen structural risks, and that these may hold catastrophic consequences.
It’s also important to consider two relevant trends:
Note: above examples weren't cherry-picked, they're the first four that I tried. The smallest increase was Microsoft at 60%, followed by Google (), Baidu (), and Tencent (), from 2010 to 2022. Data source: macrotrends.net
Why is this neglected / important right now?
There have been some promising recent developments in AI Governance for addressing misalignment/misuse; it’s fair to say that many (including myself) underestimated how much progress is possible with a certain degree of accordance between national security and private sector interests.
As others have argued, this may have changed the scope of what "neglected" work looks like in this area. This post in particular caught my attention, arguing for a more nuanced approach to AI safety, with an increased focus on the incentives of different actors, and consideration for long-term issues such as value drift and evolutionary pressures.
Below I've presented some arguments as to why structural risks fit into this narrative as something we should now be focusing on more:
Argument 1: Earlier the better
Unlike risks with more proximal causes, the only way to mitigate structural risks is to understand them well in advance and take preemptive steps to steer key actors towards less hazardous paths.
Argument 2: Consistently receives less attention
The dominance of misalignment/misuse in AI safety/governance discussions has been noted as a prevailing issue since at least 2019. There are a few reasons to think this will persist:
Argument 3: Potentially more representative of long-term risks
There's still a chance that misalignment and misuse are superficially attractive problems which fail to capture the true risks humanity is facing. It should be of concern to us that they’re among the first risks that anyone would be expected to think of, and it seems naive to assume that transformative AI won’t also entail vastly more complicated risks.
Leading thinkers also appear to have a poor track record when grappling with systemic risks. Going back to the example of fossil fuels, the only two prescient predictions I’ve come across with regard to its impacts still seem to have been quite misguided:
Personally, I’m always wary about drawing strong conclusions from past case-studies of poor predictions. However, these helped me frame how we might be wrong about predicting risks from advanced AI; both Jevons and Arrhenius had superficial, short-sighted views of this pivotal trend, and there’s a sense in which they almost couldn’t have predicted the magnitude of the topic they were approaching.
With respect to AI safety, I worry that Jevons’ concerns over Britain running out of coal are similar to our concerns over misalignment/misuse. Given the data ["Humans are using large quantities of a finite resource" or "AI is becoming more capable than humans"], the natural next step is to think ["Humans will run out of that resource" or “AI will take over the humans”] . . . (I have more to say about this in the footnotes*)
As we now know, the reality of the situation has been far more complicated than the scenario forecasted in The Coal Question. Because Jevons assumed that we’d stumble at the first hurdle by simply running out of coal, he wasn’t able to extrapolate his insights to explore scenarios where humanity continued its carbon-fueled growth trajectory. It’s possible that this expanded analysis would have uncovered indications of structural issues like climate change or petrostates.
Today’s focus on misuse/misalignment could be making a similar mistake - assuming that humanity will fall victim to the intuitively conceived "first hurdles", while lacking the foresight to conceive of risks beyond that point.
What can we do?
A promising approach is to emulate how the environmental science & climate adaptation community currently conducts itself - using scientific data to develop projections of causal factors, and then investigating potential vulnerabilities, impacts and interventions on this basis.
The first step towards this would be to develop a series of metrics which may be predictive of structural risks from AI; as a starting point, I’d propose to use capabilities that are likely to be both transformative and problematic. The following are three capabilities that I identified:
These three capabilities are associated with significant economic and strategic incentives, and due to their general-purpose nature, lack any obvious governance mechanisms. Monitoring the development and proliferation of these capabilities would support ongoing research into understanding their near and long-term impacts.
Below are some hypothetical examples of what this work might look like:
These examples are purely for illustrative purposes; economics and international relations is not my area of expertise, and I’m sure legitimate experts can propose far better examples.
Potential Limitations / Failure-modes:
Conclusion: the argument in favour of considering structural risks isn’t new, but I think it’s a crucially important one now that the field of AI safety and governance is gaining more traction.
As a specific call to action: the Future of Life Foundation is aiming to launch multiple organisations in the near future - I could see benefits from one of these organisations focusing of structural risks, and perhaps following the approach outlined above.
*Not only are these concerns (coal shortage, misaligned AI) conceptually simple, they also play into our instincts regarding scarcity and security - similar to Jevons projecting the insecurities of the British empire onto the problem of fossil fuel reliance, there's a chance we're simply projecting our insecurities onto the field of AI safety.
On a related note, I also find it unnerving that we have so many mythological tales containing genies, Gods and other all-powerful figures which have eery similarities to the alignment problem. Many proponents seem to view this as an inherent strength of the argument, but I treat it as a distinct weakness given that the alignment problem is still a hypothesis contained in our collective imagination.
Thanks for sharing your thoughts! I think you are onto an interesting angle here that could be worthwhile exploring if you are so inclined.
One interesting line of work that you do not seem to be considering at the moment but could be interesting is the work done in the "metacrisis" (or polycrisis) space. See this presentation for an overview but I recommend diving deeper than this to get a better sense of the space. What this perspective is interested in is trying to understand and address the underlying patterns, which create the wicked situation we find ourselves in. They work a lot with concepts like "Moloch" (i.e., multi-polar traps in coordination games), the risk accelerating role of AI or different types of civilizational failure modes (e.g., dystopia vs. catastrophe) we should guard against.
Interesting for you may also be a working paper that I am working on with ALLFED, where we are looking at the digital transformation as a driver of systemic catastrophic risks. We do this based on a simulation model in specific scenarios but then generalize a framework where we suggest that the key features that make digital systems valuable also make them an inherent driver of what we called "the risk of digital fragility". Our work does not yet elaborate on the role of AI but only the pervasive use of digital systems and services in general. My next steps are to work out the role of AI more clearly and see if/how our digital fragility framework can be put to use to better understand how AI could be contributing to systemic catastrophic risks. You can reach out via PM if you are interested to have a chat about this.