Enabling the AI security experts of 2025 and beyond

Impact Academy’s Global AI Safety Fellowship is a fully funded research program for up to 6 months for exceptional STEM talent to work with the world’s leading AI safety organisations to advance the safe and beneficial development of AI.

The Application Deadline has Passed.

You can use the form below to express your interest. We will reach out to you once we launch applications for our second iteration.

Global AI Safety Fellowship 2025

We will select Fellows from around the world to pursue AI safety research for up to 6 months (or longer). After an initial application, candidates will be invited to participate in a rigorous and purpose-driven selection process that lasts 4 to 6 weeks. We aim to select a cohort of 10-20 technically skilled candidates with a knack for working on complex problems. This number may increase or decrease depending on the strength of our applicant pool.

Our Fellows will work with collaborators from partner organisations like the Center for Human Compatible AI (CHAI) at UC Berkeley, Conjecture, FAR.AI, and the UK AI Safety Institute (AISI). We are excited to support Fellows who will transition to full-time AI safety research after the program. 

Fellows will receive comprehensive financial support that covers their living expenses and research costs, along with dedicated resources for building foundational knowledge in AI safety, regular mentorship, 1:1 coaching calls with our team, and facilitation for in-person work with our partner orgs.

Work With Researchers From

Application Process Overview

Phase 1: Applications & Technical Assessment

Phase 1.5: Optional BootCamp for Candidates New to AI Safety

Phase 2: Assessment with Placement Labs and Organisations

Expected Timeline

Why AI Safety

AI might be the most transformative technology of all time. To make it go well for humanity, we must seriously consider the types of risks advanced AI systems of the future might pose. Fortunately, there is a growing ecosystem of professionals and institutions dedicated to researching and solving these problems- AI safety.

AI safety focuses on developing technology and governance interventions to prevent both short-term and long-term harm caused by advanced AI systems. To learn more, check out this list of resources.

We believe we can support global talent, who might otherwise not have had the opportunity, to play an important role in advancing research and other work in the field through their careers.

Research Directions

Depending on their interests and the placement orgs’ projects, Fellows may get to work on a range of research directions in AI safety and alignment. Below, we have indicatively outlined some of these research agendas.

Adversarial Robustness

Adversarial Robustness is aimed at developing machine learning models that can maintain their performance and reliability even when faced with intentionally misleading or manipulated inputs. See FAR.AI’s work on adversarial training of Go AIs.

Cognitive Emulation

Cognitive Emulation (CogEm) is primarily the research agenda of Conjecture, wherein the goal is to build AI systems that emulate human reasoning and are scalable, auditable and controllable. Through this approach, the systems could be sufficiently understood and bounded to ensure they do not suddenly dramatically shift their behaviour.

Model Evaluations

Model Evaluations are about producing empirical evidence on a model's capabilities and behavioural tendencies, which allows stakeholders to make important decisions about training or deploying the model. For examples, see DeepMind’s evaluations for dangerous capabilities, or Sam Brown’s AutoEnhance evaluation proposal.

Scalable Oversight

Scalable Oversight refers to a set of approaches to help humans effectively monitor, evaluate, and control such complex AI systems. Approaches include constitutional AI, AI safety via debate, iterated distillation and amplification and reward modelling. To learn more, check out Anthropic’s Constitutional AI, or OpenAI’s AI Safety via Debate.

Mechanistic Interpretability

Mechanistic Interpretability is an area of interpretability concerned with reverse-engineering the trained models into human-understandable algorithms. See, for example, recent work from FAR.AI on investigating planning in RNN model playing Sokoban.

Value Learning

Value learning (also called preference inference) is a proposed method for incorporating human values in an AI system. Human values are difficult to specify. Current approaches work on learning human values through human feedback and interaction. Read MIRI’s Learning What to Value and CHAI’s work on Cooperative Inverse Reinforcement Learning.

FAQs

Can I participate in the Fellowship part-time?

We are only looking for candidates who can commit to full-time participation for this iteration of the Fellowship.

I know someone who would be a good fit. Can I refer them?

Yes! We're offering $2,000 to anyone who refers a successful candidate not already in our database who we end up selecting for the Fellowship. This applies to external referrers who we are not already collaborating with. Please use this form to refer potential candidates. 

What if I am new to AI safety?

We’re looking for candidates with strong technical qualifications and research aptitude who are interested in working on ways to reduce the risks of advanced AI. For candidates who excel in our Phase 1 assessment, we will offer a paid opportunity to upskill in the foundations of AI safety and alignment research through a guided curriculum. Even if you have less prior knowledge of this, we encourage you to apply. We are happy to work with you to improve your understanding of the field.

What will I work on if selected?

Our partner organisations work on a portfolio of several experimental and established research agendas for safer and more aligned advanced AI systems. The exact research agendas will depend on the selection and matching process, but Fellows would broadly be working in areas like adversarial robustness, mechanistic interpretability, scalable oversight, cognitive emulation, and control problems, to name some.

What can I gain from this program?

You will work on impactful research projects with some of the most talented researchers in the field, at top AI safety labs and research institutions. Fellows who perform well would have reliable opportunities to continue working full-time on their projects. As a Fellow, you would:

  • Publish papers in top ML conferences like ICML, ICLR, NeurIPS, CVPR, etc.

  • Develop empirical ML research experience on Large Language Models.

  • Participate in a niche research community to collaborate on challenging problems.

  • Advance existing AI safety strategies and research agendas or develop new ones.

Want to apply but have more questions? Write to us!

If you think you may be a good fit for the program but would like to clarify some doubts, please email us at aisafety@impactacademy.org. We are happy to answer queries and potentially get on a quick call to understand you better.

About Us

Impact Academy is a startup that runs cutting-edge fellowships to enable global talent to use their careers to contribute to the safe and beneficial development of AI. Our focus is to provide opportunities for students and professionals to explore challenging ideas in the field and spearhead new strategies for AI safety (AIS). We also offer mentorship for career development and support for job placements.

Since 2023, we have launched, incubated or co-organised several programs in AI safety, technical and alignment research, and governance. These include:

Learn more about us here.

Get in touch

Excited to learn more or looking for a way to get involved? Use the form below to express your interest.