Parts A and B looked at 10.0 alone. Part C uses the multi-cohort dataset (6.0 through 10.0, plus alumni outcome data) to validate findings, test for trends, and ask discovery questions we can't answer from 10.0 alone. 8 analyses across cohorts.
| Analysis | Question | n |
|---|---|---|
| C1 — CodeSignal paradox across cohorts | Does the 8.0 paradox replicate? | 7.0/8.0/9.0 (CS + mentor evals) |
| C2 — What predicts mentor evals? | Application features → mentor evals, per cohort | 7.0: n=37, 8.0: n=58, 9.0: n=55 |
| C3 — What predicts publications? | Application features → post-program pubs | 6.0–9.0 alumni-pub joins (n ~50–80 each) |
| C4 — Quality trend across cohorts | Did selection quality improve over 6.0 → 9.0? | 76–106 mentor evals per cohort |
| C5 — 10.0 features retrospective | Would 10.0 features have predicted 8.0/9.0 performance? | 8.0/9.0 joins, n ~60 each |
| C6 — Demographics across cohorts | Did 10.0's centralized process change demographic outcomes? | 7.0–10.0, varies by group |
| C7 — Pool composition shifts | How has the applicant pool changed? | 6.0–10.0, full pools |
| C8 — Alumni survey outcomes | Where do alumni go? What mechanism do they attribute MATS impact to? | 123 survey respondents |
None unrecovered. One in-flight fix: C3's initial join was too strict (listwise drop on all features dropped most rows for 6.0/7.0/8.0). Switched to mean-imputation with cohort-specific CodeSignal column lookup; this brought all four cohorts into the model.