Tyler Tracy
-
Member of Technical Staff at Redwood Research, working on AI control. Former software engineer.
-
High-level overview of the types of projects I'll likely conduct:
Build new control settings to perform research in
Design new control protocols and play red team / blue team games with them
Training attack policies or monitors for existing control settings
Explore incrimination strategies to catch if a model is scheming -
Ranked in order of importance:
Technical skills in evaluating models; should be able to execute independently without much guidance. If you have used Inspect before, that is a big plus. ML knowledge is nice, but only relevant if we do a project requiring training
Knowledge of the AI control field
Good takes on AI safety in general.
Member of Technical Staff, Redwood Research