Predictors of success in hiring CEA’s Full-Stack Engineer

AK.; Ben_West🔸

Predictors of success in hiring CEA’s Full-Stack Engineer

AK.,

Comments 19

Sorted by

New & upvoted

Charles He

Assets aren't showing up:

AK.

Images should be fixed now, thanks for pointing this out.

Habryka [Deactivated]

Yep, images are broken. My guess is the document was copy-pasted from a Google Doc, with the images hosted in a way that isn't publicly accessible.

Linch

Thanks for this post! I'll be interested in data from CEA hiring overall, even with the obvious caveat that hiring across different roles will require different skillsets and experiences.

Ben_West🔸

Thanks! In case you haven't already seen that: this post is part of a sequence about EA hiring; other posts have information about hiring different roles.

Cassidy

This was a great write up, interesting topic, informational and easy to follow.

One question I had is if below were the only words you were looking for in a CV and why so. For example, you did not list "Lead", which I'd think is frequently used for engineering roles.
I'm assuming either these were just examples (so not a complete list), or applicants only used these 2 terms?

Did any previous role include the word “senior” in its title?
Did any previous role include the word “manager” in its title?

Ben_West🔸

Thanks! Yeah, maybe we should also have looked for "lead" but we didn't. No strong reason why this is a bad idea, I just didn't think of it.

Charles He

So uh you guys/girls have n=7 samples of people in this FAANG group, and you're using this to get coefficients for one of the regressions. Then for the next regression for the FAANG people making it a cut further, you probably only have 3 observations that regression?

So I think the norm here is to show "summary stats" style of data, e.g. a table that says "For the FAANG applicants, of these 7 made it). I think this table would be better.

Basically, a regression model doesn't add a lot, with this level of data.

Also, at this extremely low amount of data, I'm unsure, but there might be weird "degree of freedom" sort of things, where due to an interaction, the signs/magnitudes explode/implode.

Can you share your code for the regressions that made this table?

Ben_West🔸

Basically, a regression model doesn't add a lot, with this level of data

Yes, I agree that this is the conclusion of the piece, but I feel like you are implying that this means the methodology was flawed?

We aren't trying to do some broad scientific analysis, we are just practically trying to identify ways that we can speed up our hiring process. And given that we do, in practice, have a relatively small number of people applying to each round, we are (apparently) not able to use automated methods to identify the most promising candidates with high accuracy.

Charles He

(Maybe my stats/prob/econometrics is rusty, feel free to stomp this comment)

Yeah, you guys have a 94% pass rate for one dataset you use in one regression.

So you could only be getting any inference from the literally 3 people who failed for the screening interview.

So, like, in a logical, "Shannon information sense", that is all the info you have to go with, to get magnitudes and statistical power, for that particular regression. Right?

So how are you getting a whole column of coefficients for it?

Ben_West🔸

No, "This model predicts whether all invited applicants (N=85) would pass the screening interview." So it's 45/85.

Charles He

Yes, understood, thanks, I was just confused.

Larks

94% pass rate

Also, it does seem that, at least ex post, they might benefit from raising the bar a bit on this round.

Ben_West🔸

Yeah, the point of the screening interview is mostly for the candidate to ask questions. I endorse the belief that we should be measuring programmers through programming tests instead of interviews (i.e. the pass rate of the screening interview should be very high), but I go back and forth on whether the screening interview should come first or second.

Charles He

Yes, raising the bar would make the interviews more useful. This is a good thought that makes a lot of sense to me.

I think what you said makes sense and is logical.

Since I'm far away and uninformed, I think I'm more reluctant to say anything about the process and there could be other explanations.

For example, maybe Ben or his team wanted to meet with many applicants because he/they viewed them highly and cared about their EA activities beyond CEA, and this interview had a lot of value, like a sort of general 1on1.

The "vision" for the hiring process might be different. For example, maybe Ben's view was to pass anyone who met resume screening. For the interview, maybe he just wanted to use it to make candidates feel there was appropriate interest from CEA, before asking them to invest in a vigorous trial exercise.

Ben seems to think hard about issues of recruiting and exclusivity, and has used these two posts to express and show a lot of investment in making things fair.

rotatingpaguro

- Whether candidates had worked in a company with more than 1000 employees - we excluded this in favour of looking at whether candidates had worked at a FAANG company; it was not possible to include both since the variables are not independent.

I'm confused, why can't you include two predictors if they are not independent? I'm assuming that with "independent" you mean correlation 0, if you instead mean no collinearity, i.e., linearly independent vectors of predictors, then feel free to ignore my comment.

Lorenzo Buonanno🔸

Am I reading correctly that you made an offer to 8 developers and had 85 applicants?

So a 9% offer rate? That seems very high, am I missing something?

Ben_West🔸

There is an additional on-site after this, and some people withdrew. We ended up making three offers from this round.

T_W

To highlight (from this comment and reply) the hire rate for this position was 3.5%

Comments

More from the author

Is it still hard to get a job in EA? Insights from CEA’s recruitment data

AK., Ben_West🔸, Cait_Lion·4y ago·9m read

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·3d ago·Curated 22m ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·2w ago·Curated 6d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

138

Let's taboo the V-word

lincolnq·3d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·2d ago·1m read

Applications open for new supported programs on the GWWC donation platform (2026)

Aidan Whitfield🔸, Giving What We Can🔸·26m ago·3m read

Free, client-funded daily 1:1 accountability coaching for people active in the EA ecosystem (GoalsWon)

Guillermo D'Anna·19h ago·1m read

^{^}

They seem common in the authors’ experience; we would appreciate feedback in the comments from other hiring managers about their own experience.

^{^}

We collected other factors but ultimately chose to exclude them from the analysis:
- University rankings - we could not obtain these for enough candidates, which reduced the sample size considered and affected their accuracy.
- Likely salaries in the candidate’s previous position - we used online sources to estimate the typical salary for the candidate’s most recent position and company, but again could not obtain this for enough candidates.
- Whether candidates had worked in a company with more than 1000 employees - we excluded this in favour of looking at whether candidates had worked at a FAANG company; it was not possible to include both since the variables are not independent.

Stage	Number participating	Number passing	Success rate
Initial application sift	85	48	57%
Screening interview	48	45	94%
Trial task	35	8	23%

	Passing initial sift (N=85)		Passing screening interview (N=85)		Passing trial task (N=75)
Predictors	Odds Ratios	p	Odds Ratios	p	Odds Ratios	p
(Intercept)	1.06 (0.43 – 2.63)	0.893	1.10 (0.45 – 2.74)	0.829	0.36 (0.08 – 1.29)	0.135
years of experience mean=9.4	1.01 (0.95 – 1.08)	0.781	1.01 (0.95 – 1.08)	0.795	0.90 (0.74 – 1.03)	0.181
has senior in title n=17	2.23 (0.70 – 7.99)	0.189	2.63 (0.82 – 9.53)	0.116	0.81 (0.04 – 7.19)	0.860
has manager in title n=14	0.50 (0.14 – 1.67)	0.263	0.44 (0.12 – 1.48)	0.197	0.90 (0.04 – 7.09)	0.925
number of typos mean=0.9	0.93 (0.66 – 1.31)	0.648	0.99 (0.70 – 1.43)	0.968	0.90 (0.41 – 1.50)	0.738
has faang company n=7	1.92 (0.37 – 14.55)	0.464	2.44 (0.46 – 18.88)	0.325	2.04 (0.09 – 20.83)	0.575
highest degree mean=1.0	1.10 (0.52 – 2.36)	0.807	0.85 (0.39 – 1.80)	0.667	0.82 (0.19 – 2.85)	0.765

Predictors of success in hiring CEA’s Full-Stack Engineer

Predictors of success in hiring CEA’s Full-Stack Engineer

In summary:

Context

Findings

Predictors for passing an initial sift

Predictors for passing screening interview

Predictors for passing trial task

Commentary from Ben

Appendix: Summary of modelled parameters