Long-Term Future Fund: 
May 2023 to March 2024 Payout recommendations

Linch; Lawrence Chan; Daniel_Eth; Habryka [Deactivated]; Clara Collier; Thomas Larsen; elifland; Lauro Langosco; calebp

Long-Term Future Fund: May 2023 to March 2024 Payout recommendations

Linch,

Comments 8

Sorted by

New & upvoted

Buck

Note: When an earlier private version of these notes was circulated, a senior figure in technical AI safety strongly contested my description. They believe that the Anthropic SAE work is much more valuable than the independent SAE work, as both were published around the same time, but the Anthropic work provides sufficient evidence to be worth extending by other researchers, whereas the independent research was not dispositive.

For the record, if the researcher here was COI’d, eg working at Anthropic, I think you should say so, and you should also substantially discount what they said.

Linch

I agree! (They are not from Anthropic. I probably shouldn't deanonymize further). :)

NickLaing

Love the comprehensive summary and transparency

I notice there's not much there along AI policy/governance/advocacy lines, its almost all technical stuff. Those categories seem to fall under your scope (below from website). What are the reasons for that kind of stuff not being funded more? Thanks!

"Projects that directly contribute to reducing existential risks through technical research, policy analysis, advocacy, and/or demonstration projects"

calebp

I think that the boring answer for us not doing as much grantmaking in this area as in technical areas is just that we don't receive a very high number of applications - but this isn't clearly a bad thing; there are many excellent organisations that do great work in AI policy/governance/advocacy whilst there are only a handful of active organisations on the technical side. I often think that getting a role in an existing org is a better fit for many applicants than doing independent work or starting their own org and I am grateful that the ecosystem for AI policy/governance/advocacy is developed enough to onboard lots of junior people rather than them having to apply for grants to do independent work.

We are trying to do more grantmaking in this space, but unfortunately, the EA brand makes publicising the grants we do make difficult. Many of our grants would count as "fieldbuilding" for AI policy/governance/advocacy, but we could make this clearer in our descriptions. LTFF fund managers, in general, are very excited about work in this area. Even if we can't fund it directly, we often try to refer it to other funders to fund, so I'd definitely encourage people to apply.

NickLaing

Thanks that makes a lot of sense, especially the comment about getting a job at a regular org and I’m also heartened to hear that the AI governance space is more developed as well.

Didn’t realize the “EA brand” might be a negative that’s sad.

Also I didn’t find the answer boring at all ;)

gergo

Thanks for sharing these updates!

A minor feedback I would give is that at least to me, the title gives the impression that this post is only about payouts that happened in March 2024. While this is clarified at the very beginning, I think something like the full duration, or the no. of months could be mentioned in the title, as this will be relevant for people who are deciding on whether to click on the post or not.

Linch

Thanks, edited! :)

SummaryBot

Executive summary: The Long-Term Future Fund paid out $5.36 million in grants from May 2023 to March 2024, with an acceptance rate of 19.3%, to support a wide range of projects aimed at improving the long-term future, with a focus on technical AI safety research and field-building.

Key points:

Highlighted grants include funding for technical AI safety research, interpretability work, communications and outreach projects, and biosecurity research.
The fund's bar for grants has been variable over the past year due to changes in available funding and is currently similar to early 2023 levels.
The fund has distanced itself from Open Philanthropy since August 2023 and is in the process of spinning out of Effective Ventures.
Fund managers have become more active in writing and public communications compared to previous years.
The post includes a full list of all grants made during the period, with some grants kept anonymous or with limited details at the request of grantees.

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Comments

^{^}

Please note that the highlighted grants are likely to be unrepresentative of our average grant, and certainly of our marginal grant. To have a better sense of what marginally donations are likely to buy, please read Hypothetical grants that the Long-Term Future Fund narrowly rejected, my earlier post on this exact question.

Grantee	Amount	Grant Purpose	Award Date
Samuel Brown	$82,298	12-month stipend to research AI alignment, with a focus on technical approaches to Value Lock-in and minimal Paternalism. Findings will be published either as part of an academic collaboration or independently.	May 2023
Jonathan Ng	$32,650	6-month stipend for Jonathan Ng to continue working on SERI MATS project expanding the "Discovering Latent Knowledge" paper	May 2023
Simon Lermen	$13,000	3-month stipend + compute expenses to study and publish on shutdown evasion in LLMs and to use LLMs as tools for alignment	May 2023
Alex Infanger	$19,200	3-month stipend for upskilling in PyTorch and AI safety research and also running a virtual AGISF cohort for Successif.org	May 2023
Anonymous	$10,000	Budget for an EA group to fund outstanding local or high-context grant opportunities which are urgent and/or low cost but high EV.	May 2023
Avery Griffin	$32,000	4-month stipend for two people to find formalisms for modularity in neural networks	May 2023
Lucius Bushnaq	$32,002	4-month stipend for two people to find formalisms for modularity in neural networks	May 2023
Anonymous	$1,020	Funding for productivity/research related expenses	June 2023
Roman Leventov	$6,669	6-month stipend to continue developing as an AI safety researcher. Goals include writing a review paper about goal misgeneralisation from the perspective of Active Inference and pursuing collaborative projects on collective decision-making systems.	June 2023
Artem Karpov	$1,739	6-month support for self study and development in ML and AI Safety. Goals include producing an academic paper while working on the "Inducing Human-Like Biases in Moral Reasoning LMs" project run by AI Safety Camp.	June 2023
Wikiciv Foundation	$16,000	Funding for labor to expand content on Wikiciv.org, a wiki for rebuilding civilizational technology after a catastrophe. Project: writing instructions for recreating one critical technology in a post-disaster scenario.	June 2023
Anonymous	$5,000	Funds for research expenses	June 2023
Benjamin Sturgeon	$12,000	4 month stipend for independent research and AIS field building in South Africa, including working with the AI Safety Hub to coordinate reading groups, research projects and hackathons to empower people to start working on research.	June 2023
Michael Parker	$40,000	Project support	June 2023
Anonymous	$14,000	4-month career transition grant to upskill in organization-building and community-building.	June 2023
Anonymous	$63,242	6-month grant to pivot into AI alignment research. Goals include upskilling in linear algebra and probability theory, independent mechanistic interpretability research and working on an AI Safety Camp research project.	June 2023
Abhijit Narayan	$1,000	4-month scholarship to support upskilling in technical AI alignment following either of the following programmes:	June 2023
Amritanshu Prasad	$7,971	6-month Scholarship to support Amritanshu Prasad's upskilling in technical AI alignment. Amritanshu will study the AGI Safety Fundamentals Alignment Curriculum and create an accessible and informative summary of the curriculum.	June 2023
Patricio Vercesi	$500	Laptop stipend to study ML at university and AIS independently. Short-term goals include writing up and sharing thoughts on AIS strategies and current ecosystem. Long-term goals include pursuing employment / funding as an independent AIS researcher.	June 2023
Bart Bussmann	$9,057	2-month stipend to test suitability for technical AI alignment research and identify a research direction. Project output includes writing up a reflection on this process	June 2023
Ben Stewart	$3,138	3-month (+buffer to prepare project reports) part-time stipend to upskill in biosecurity research and prioritization. Courses include Infectious Disease Modelling by Imperial College London. Projects include UChicago's Market Shaping Accelerator challenge.	June 2023
AI Safety Support Ltd.	$50,000	6-month stipend plus expenses for Jay Bailey to work on Joseph Bloom's Decision Transformer interpretability project.	June 2023
Guillaume Corlouer	$6,800	Funding research on understanding search in transformers at the AI Safety Camp. The AIS camp project is about figuring out if transformers are able to learn a search algorithm and whether we can steer such learned search algorithms to different goals.	June 2023
Anamarija Kozina	$9,890	1 year of funding to help cover expenses of transferring to a MSc at TU Munich 2023/2024, studying Mathematics with a minor in Informatics.	July 2023
Anonymous	$59,223	A grant to support the growth of a YouTube channel that discusses key developments in AI, targeted to the general public.	July 2023
Shoshannah Tekofsky	$90,000	6-month funding for 3 experiments (Enhance Selection, Coordinate Crowds & Model Interhuman Alignment) and write-ups that explore the potential of Collective Human Intelligence to accelerate progress on the alignment problem.	July 2023
Hunar Batra	$66,046	Grant to cover 1 year of tuition fees and living expenses to pursue PhD CS at the University of Oxford. Accelerate alignment research by building Alignment Research tools using expert iteration based amplification from Human-AI collaboration.	July 2023
Viktor Rehnberg	$19,248	This grant will support Viktor Rehnberg's project identifying key steps in reducing risks from learned optimisation and working towards solutions that seem the most important. Viktor will start working on this project as part of the SERI MATS program.	July 2023
Anonymous	$4,309	Stipend for 4-month placement at the MIT Computer Science and Artificial Intelligence Laboratory. The goal is to produce a paper for ICML, which will be accessible to researchers worldwide and that will aid in the understanding of the geometry of AI thought processes.	July 2023
Palisade Research Inc	$98,000	This grant is funding a 6-month stipend for Jeffrey Ladish and operational expenses for his new organization Palisade Research Inc. Palisade will research offensive AI capabilities to better understand and communicate the threats posed by agentic AI systems. During this initial period, Palisade plans to create 2-3 demos that could be presented to policymakers in time for the Schumer AI bill effort, finish setting up its infrastructure and apply for 501c3 status, and hire a part time research assistant and executive assistant.	July 2023
Sviatoslav Chalnev	$35,000	This grant is funding for a 6-month stipend for Sviatoslav Chalnev to work on independent interpretability research, specifically mechanistic interpretability and open-source tooling for interpretability research.	July 2023
Carson Ezell	$8,673	This grant provides a 3-month part-time stipend for Carson Ezell to conduct 2 research projects related to AI governance and strategy. The path to impact for both of these projects will involve engaging with individuals at AI labs to ensure that the problems are currently unsolved and then develop proposals which members of governance teams at labs can point to as reasonable proposals and which might be implemented.	July 2023
Ossian Labs	$55,260	This grant provides funding for a project exploring debate as a tool that can verify the output of agents which have more domain knowledge than their human counterparts. This grant provides funding for the first 2 stages of this project, investigating debate as a truth seeking protocol (WP1) and debate as a method for annotation speed ups (WP2).	August 2023
Anonymous	$115,000	This grant provides funding towards a CS Master's program at NYU, where the grantee will pursue technical AI safety research.	August 2023
Cindy Wu	$5,004	This grant provides a stipend for Cindy Wu to spend 4 months working on AI safety research. During this period, Cindy will extend her Master's thesis on understanding mechanistically causal knowledge representation in NNs for robust distillation. Other activities include but are not limited to taking on a research team lead position with AI Safety Hub Summer Labs and working with EleutherAI on LLM interpretability. This stipend is given conditional on Cindy spending at least one month working on unpaid projects.	August 2023
Anonymous	$48,451	This grant provides a 6 month stipend to continue SERI MATS research on abstract out-of-context reasoning in large-language models as a precursor for treacherous turns.	August 2023
Bilal Chughtai	$48,084	This grant is funding a 6-month stipend for Bilal Chughtai to upskill and work on a mechanistic interpretability project investigating attention head superposition in LLMs. Bilal will be working with Alan Cooney and the project will be supervised by Neel Nanda. The goal of this project is to publish a conference paper.	August 2023
Bryce Meyer	$50,000	This grant is funding a stipend for Bryce Meyer to build and enhance open-source mechanistic interpretability tooling for AI safety researchers.	August 2023
Nathaniel Monson	$70,000	Support to spend 6 months studying to transition to AI alignment research, with a focus on methods for mechanistic interpretability. Nathan will be advised by Professor Goldstein, Director of the UMD center for machine learning, and goals include solving one of Neel Nanda's 200 Concrete Open Problems in Mechanistic Interpretability and developing a proposal for an interpretability paper.	August 2023
Anonymous	$8,000	This grant provides a 2 month stipend for work on a hardware-related AI governance project. The project will take a detailed look at the current export restrictions on AI chips and seek to understand how they could be improved, with the goal of restricting uncooperative parties from creating cutting-edge models.	August 2023
Kristy Loke	$10,000	This grant is funding a 2-month stipend for Kristy Loke to complete her GovAI project examining the state of AI development.	September 2023
Morgan Simpson	$32,653	This grant is funding a 6-month stipend for Morgan Simpson to produce 2 AI governance white papers and a series of case studies, with additional research costs.	September 2023
Steve French	$5,000	This grant provides one year of technical assistance and capacity building for ACX meetup in Atlanta Georgia. The main aim is to increase attendance in sessions and encourage rational thinking.	September 2023
Kristy Loke	$50,000	This grant provides 6 months of funding for Kristy Loke's research with Fynn Heide on AI development and AI safety engagement over the course of 6 months.	September 2023
Anonymous	$7,700	3-month unpaid AI Governance internship to build career capital at the Millennium Project, a global futurist think tank	September 2023
Codruta Lugoj	$7,537	This grant provides a 4-month stipend for capacity building in AI alignment. The goals are for the grantee to gain research engineering skills implementing experiments in alignment (which will be shared on Github), understand current alignment agendas, and transition to doing the winter SERI MATS program or becoming an alignment researcher.	September 2023
Shashwat Goel	$12,000	This grant is funding a 3-month stipend for Shashwat Goel's SERI MATS research on knowledge removal techniques as a convergent safety technique that can help mitigate diverse risk scenarios. with the Center for AI Safety. The research results will produce an academic paper to engage and direct the efforts of the wider ML community.	September 2023
Ross Nordby	$40,000	This grant provides funding for 1 year, part-time independent AI safety research focused on interpretability. Research will be published online, with longer-term goals of publishing in an academic journal or similar, if appropriate.	September 2023
Charles Whittaker	$5,293	Funds to support travel for research with the Nucleic Acid Observatory relating to biosecurity and GCBRs	October 2023
Anonymous	$53,400	Support for two researchers to work on a paper about (dis)empowerment in relation to Artificial General Intelligence. The paper will aim for top ML conferences such as NeurIPS and will formalize a notion of (dis)empowerment in order to train and evaluate models that do not reduce human agency.	October 2023
Alexander Mann	$36,000	Designing a plan for a longtermist industrial conglomerate aligned via a reputation based economy	October 2023
Francis Rhys Ward	$8,285	Funding for (academic/technical) AI safety community events in London	October 2023
Logan Smith	$40,000	Support for further pursuing sparse autoencoders for automatic feature finding	October 2023
Bilal Chughtai	$42,460	Support to work on mechanistic interpretability research with mentorship from Prof David Bau	October 2023
Thomas Kwa	$14,880	Grant for past study of Goodhart effects on heavy-tailed distributions.	October 2023
Nicky Pochinkov	$71,695	This grant is funding a 12-month stipend for Nicky Pochinkov's independent AI Safety research. Nicky is exploring research on LLM Modularity/Separability & Modelling Goals and Long-term Behavior. Results from Nicky's research will be published to LessWrong.	October 2023
Arran McCutcheon	$6,214	This grant will support Arran McCutcheon's work on AI governance projects and activities.	October 2023
Drake Thomas	$14,880	Retroactive grant to study Goodhart effects on heavy-tailed distributions	October 2023
Matthias Dellago	$25,907	6-month stipend for Matthias Dellago working on his Master’s thesis and paper on technical alignment research: mechanistic interpretability of attention. The paper will be published to arXiv and discussed in a blog post. The paper may also be submitted to a conference, with further plans to develop a tool that will allow other researchers to easily leverage and build on the results of this research.	October 2023
Zachary Furman	$40,000	6-month stipend for Zachary Furman's research as part of Daniel Murfet's research group at the University of Melbourne, where he is working on developmental interpretability and singular learning theory which will be published to academic ML conferences.	October 2023
Ann-Kathrin Dombrowski	$27,892	3-months stipend for SERI MATS extension to work on internal concept extraction	October 2023
Jacques Thibodeau	$27,108	Funding to continue making tools for accelerating alignment and the Supervising AIs Improving AIs agenda	October 2023
Berkeley Existential Risk Initiative	$30,000	This grant provides operational support for the mechanistic interpretability and language model steering project by Team Shard.	October 2023
Hoagy Cunningham	$35,923	This grant provides a stipend for the grantee to draft a paper by the end of the SERI MATS phase on the sparse coding project and for supporting future research.	October 2023
Kurt Brown	$15,000	4 weeks dev time to make a cryptographic tool enabling anonymous whistleblowers to prove their credentials	October 2023
Harrison Gietz	$10,750	This grant will support Harrison Gietz with a stipend for AI safety technical and/or governance research. The research goals are to conduct impactful research to influence AI safety research, governance, and/or evals in a positive direction, and to upskill in safety-relevant ML, with the aim of producing a paper/report that is published in a reputable ML/AI conference or journal.	October 2023
Aidan Ewart	$7,929	6-month stipend for part-time independent research on LM interpretability for AI alignment	October 2023
Anonymous	$48,423	This grant provides a stipend for work generating feature identification methods based on information content useful for the purpose of interpreting neural networks. Results will be shared through blog posts for community review.	November 2023
Cole Wyeth	$50,000	This grant provides 1 year of funding for tuition (~66% of the total grant) and living expenses for Cole Wyeth, who is pursuing a PhD in Computer Science at the University of Waterloo. Cole will be studying extensions of the AIXI model to reflective agents to understand the behavior of self modifying AGI, supervised by Professor Hutter.	November 2023
Scott Viteri	$10,000	This grant provides 1 year of compute funding to develop a novel training technique that implements incentives towards prosocial behavior to improve the safety and alignment of LLMs. The goal is to provide a strong enough proof-of-concept that OpenAI or Anthropic implements the technique in their next large training run.	November 2023
Anonymous	$48,000	This grant provides 6 months of support for up-skilling in technical AI alignment and independent interpretability research.	November 2023
Samotsvety Forecasting	$6,000	General support for a forecasting team	November 2023
Felix Binder	$2,000	This grant will support Felix Binder's compute for an experiment about how steganography in large language models might arise as a result of benign optimization.	November 2023
Prompt Human Inc.	$43,159	6 months of funding for Quentin Feuillade--Montixi to continue working on Model Psychology and Evaluation research and publish findings to LessWrong.	November 2023
Anonymous	$20,000	This grant will support the grantee with a 6-month part-time stipend to study, write about, and advise on frontier model regulation and forecasting.	December 2023
Christopher Lakin	$5,000	This grant will support Christopher Lakin facilitating a small workshop in February focused on coordinating/planning/applying the concept of «boundaries» to AI safety.	December 2023
David Udell	$80,000	This grant will support David Udell with a one year stipend and compute budget for full-time technical AI alignment research.	December 2023
Existential Risk Observatory	$24,484	This grant will support Existential Risk Observatory in organizing AI x-risk events with experts (such as Stuart Russell), politicians, and journalists in order to inform and influence policymaking.	December 2023
Riya Sharma	$1,702	This grant will provide Riya Sharma funding to attend the 2023 Biological Weapons Convention (BWC) Working Group Meeting to discuss transparency surrounding bioweapons with country representatives, as well as to work on a related research project.	December 2023
Brian Tan	$61,460	This grant is providing nine months of funding for WhiteBox Research (1.9 FTE) to pilot a training program in Manila focused on Mechanistic Interpretability.	December 2023
Thomas Kwa	$75,000	This grant will support Thomas Kwa with a 6-month stipend grant to research interpretability and control in independent alignment projects. He aims to prove the accuracy of a 1-layer attention only model, develop a variant of NMF to find human-interpretable features in LMs, and do activation engineering.	December 2023
Anonymous	$27,450	This grant will support 3 months of research in nascent areas of EA and longtermism (such as digital sentience), with the eventual aim of founding a new organization or company.	January 2024
The University of Hong Kong	$33,000	Undergrad buyout for Nathaniel Sharadin to teach AI safety in Hong Kong’s new MA program on AI; China-West AI Safety workshop.	January 2024
Logan Strohl	$80,000	This grant will provide Logan Strohl with a one-year stipend to support work developing materials demonstrating an investigative procedure for advancing the art of rationality. This work has the potential to build capacity for open technical problems like AI alignment, which require new conceptual breakthroughs and defy complete formal theorizing.	January 2024
John Wentworth	$200,000	This grant will provide John Wentworth with a 1-year stipend to continue his research into natural abstractions, with the goal of using products as a feedback mechanism to bridge the theory-practice gap and eventually apply the research to retargeting ML-internal planning processes, oversight, designing AI architectures, and building higher-level agency theory.	January 2024
Chris Mathwin	$28,325	This grant will provide Chris Mathwin with a 6-month stipend to conduct independent mechanistic interpretability projects following SERI MATS 4.1. Chris's research aims to improve fundamental understanding of the attention mechanism and to develop an appropriate functional unit of analysis for mechanistic interpretability.	January 2024
Anonymous	$7,400	4-month stipend for remote part-time mechanistic interpretability research under Neel Nanda extending SERI MATS research	January 2024
Yuxiao Li	$45,000	This grant is funding a $35,000 stipend plus $10,000 in compute costs for Yuxiao Li's independent inference-based AI interpretability research.	January 2024
Anonymous	$9,675	This grant will support the grantee with a living cost top-off stipend while they work on long-term relevant research at a DC think tank.	February 2024
Philip Quirke	$61,000	This grant will provide Philip Quirke with a six-month study grant to speed up his career pivot into AI safety and alignment research. Specific deliverables include a paper on and tooling to help simplify the process of understanding complex ML capabilities.	February 2024
Anonymous	$2,200	This grant will support the grantee with funding to visit MIT FutureTech.	February 2024
Marcus Williams	$42,000	This grant will support Marcus Williams with a 6-month stipend to train Multi-Objective RLAIF (Reinforcement Learning from AI Feedback) models and compare their safety performance to standard RLAIF, with the goal of improving the alignment of future AI systems.	February 2024
Rafael Andersson Lipcsey	$7,000	This grant will support Rafael Andersson Lipcsey with a 4-month stipend for upskilling within the field of economic governance of AI.	February 2024
Keith Wynroe	$22,395	5-month funding to continue upskilling in mechanistic interpretability post-SERI MATs, and to continue open projects	February 2024
Effective Altruism Israel	$40,000	This grant will provide EA Israel funding to support MentaLeap, a collaborative group of over 100 scholars composed of neuroscientists, AI researchers, and cybersecurity experts. The grant will support MentaLeap with costs associated with their office rental, leadership, compute budget, and food and refreshments.	February 2024
Dillon Bowen	$30,000	This grant will support Dillon with a six-month stipend to transition to a career in AI safety while working on AI safety projects.	March 2024
Lukas Fluri	$37,120	This grant will support Lukas Fluri with a six-month stipend to do an unpaid internship focused on using theory/interpretability to increase the safety of AI systems.	March 2024
Anonymous	$275,000	This grant will support stipend, compute, and contractor costs for an AI interpretability research platform for LLMs.	March 2024
Anonymous	$3,008	This grant will support the grantee with travel expenses to present at the 2024 Global Health Security Conference in Sydney.	March 2024
Aidan Ewart	$23,159	This grant will support Aidan Ewart with four months of funding for a MATS 5.0 extension. Aidan will work on improving methods in latent adversarial training to advance language model safety.	March 2024
Arjun Panickssery	$34,100	This grant will support Arjun Panickssery with a four-month stipend for MATS extension work. Arjun will be studying the safety implications of LLM self-recognition.	March 2024
Skyler Crossman	$140,000	This grant will support Skyler Crossman with a 12-month stipend to work as a coordinator for global rationality meetups.	March 2024
Anonymous	$3,821	This grant will support the grantee with travel funding to present biosecurity policy research at Global Health Security Conference 2024.	March 2024
Hannah Erlebach	$25,125	This grant will support Hannah Erlebach with a 4-month stipend to continue AI projects relevant to single	March 2024
Yoav Tzfati	$62,150	This grant will fund Yoav Tzfati with a four-month stipend to continue working on a MATS project. Yoav's project aims to use meta level adversarial evaluation of debate (scalable oversight technique) on simple math problems, with the ultimate goal of red-teaming scalable oversight techniques that may be used to align transformative AI.	March 2024
Ashgro Inc	$272,800	This grant will provide Apart Research (through fiscal sponsor Ashgro, Inc.) with funding (salaries & ops costs) for AI Safety talent incubation through research sprints and fellowships.	March 2024
Garrett Baker	$17,500	This grant will support Garrett Baker with a 3-month MATS extension stipend to use singular learning theory to explain & control the development of values in machine learning systems.	March 2024
Alfie Lamerton	$6,001	This grant will support Alfie Lamerton in conducting a one-month literature review on in-context learning and its relevance to AI alignment.	March 2024
Anonymous	$30,291	This project will support the grantee with a 5-month stipend to create a research agenda and conduct research using the IO literature in economics for AI strategy; findings will be published online when done.	March 2024
Epistea	$103,822	This grant will provide one year of funding for PIBBSS (through fiscal sponsor Epistea), a research initiative aiming to leverage insights on the parallels between intelligent behaviour in natural and artificial systems towards progress on important questions about future artificial systems. The grant will support several programs, including the 2024 fellowship, affiliate program, and a reading group.	March 2024
Egg Syntax	$55,000	This grant will support Egg Syntax with four months of funding for research on how much language models can infer about their current user, and interpretability work on such inferences. This work aims to contribute to better understanding of state-of-the-art LLMs, in particular with respect to their capacity to infer information about users, to help detect deceptive or manipulative behavior.	March 2024
Oscar Balcells	$40,356	This grant will support Oscar Balcells with a 4-month stipend to research the mechanisms of refusal in chat LLMs, with the ultimate goal of contributing to models that are more resistant to misuse.	April 2024
Anonymous	$127,000	This grant will support the grantee with a one-year stipend for policy and technical work on biosecurity.	April 2024
Sienka Dounia	$8,500	This grant will support Sienka Dounia with a three-month stipend to support relocation from Chad to London to work on Eliciting Latent Knowledge with Jake Mendel from Apollo Research. This work aims to improve the transparency and reliability of AI systems, reducing the risks of deceptive models, with direct benefits to the AI safety research community.	April 2024
Teunis van der Weij	$30,458	This grant will support Teunis van der Weij with 4-month expenses for AI safety research on personas and sandbagging during the MATS 5.0 extension program.	April 2024
Ashgro Inc	$100,000	This grant will provide support for an increase in Timaeus' salaries and rates for employees & contractors, enabling them to continue their work investigating the applications of Developmental Interpretability (DevInterp) and Singular Learning Theory (SLT) to AI safety.	April 2024
Hayden Peacock	$5,000	This grant will support Hayden Peacock with a two-month stipend while Hayden works to establish a broad-spectrum antiviral research organization. If the proposed venture is successful, it will improve defenses against future pandemics by providing new broad-spectrum antiviral drugs.	April 2024
Joseph Kwon	$40,000	This grant will support Joseph Kwon with a 6-month stipend to work on a machine learning safety project, with the aim of investigating the limitations of current probing/interpretability methods of representation engineering in AI.	April 2024
Abhay Sheshadri	$15,075	This grant will support Abhay with a four-month stipend to work on two research projects during the MATS 5.0 extension program, focusing on understanding and mitigating the potential misuse of language models (LMs) and developing tools for the safe pruning of knowledge from AI systems.	April 2024
Sviatoslav Chalnev	$40,000	This grant will support Sviatoslav Chalnev with a six-month stipend to continue independent interpretability research, with the goal of diversifying AI alignment work with more speculative ideas.	April 2024
Danielle Ensign	$60,000	This grant will support Danielle Ensign with a six-month stipend to do circuit-based mechanistic interpretability on MAMBA, as part of the MATS extension program.	April 2024
Roman Soletskyi	$35,468	This grant will provide Roman Soletskyi with a 6-month stipend to conduct research on AI safety, verifying neural network scalability for reinforcement learning and producing a human to superhumansuper-human scalable oversight benchmark, which will eventually be published publicly.	April 2024

Long-Term Future Fund: May 2023 to March 2024 Payout recommendations

Long-Term Future Fund: May 2023 to March 2024 Payout recommendations

Introduction

Highlighted Grants

Other updates

Other writings

Appendix

Other Grants We Made During This Time Period