Where multiple locations are listed for this role, the position may be based in any of those locations, with priority determined according to the order of listing. What you’ll do As a PhD intern, you will: - Collaborate with research scientists to advance methods in: - Planning and RL for computer use (e.g. behavioral cloning, RL on model weights, RAG-based domain knowledge) - Multimodal grounding (e.g. vision-only models, tree search, hybrid methods with large models) - Reward/judge modeling (e.g. error analysis, human evaluation, training judge models) - U
Sign in to apply — one profile, every role on PreferHired.
Sign in to apply