Cambridge, Massachusetts

Marvin von Hagen

Co-founder of Poke and a visiting graduate researcher at MIT, working on AI alignment, foundation models, and the science of auditing the systems we are starting to trust.

01

About

I am an entrepreneur and AI researcher based in Cambridge, Massachusetts. I co-founded Poke, an AI personal assistant built by The Interaction Company of California, and I am a visiting graduate researcher at MIT focused on artificial intelligence and foundation models.

Before MIT, I studied Management & Computer Science at the Technical University of Munich, co-founded TUM Boring - a student tunnel-boring team - and worked at Tesla and other organizations. My early work exposing misalignment behaviors in large language models, including extracting Bing / Sydney's system prompt, was covered in Time and the Washington Post.

02

Research

321
Citations
4
h-index
2024
Latest at FAccT
  1. Black-box access is insufficient for rigorous AI audits

    FAccT 2024/Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency/Cited 265 times
  2. Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT / RLHF Datasets and Refusal Behavior of Black-Box LLMs

    NeurIPS 2024/NeurIPS 2024 SFLLM Workshop/Cited 13 times

Selected work with co-authors including Max Tegmark and Dylan Hadfield-Menell at MIT. Full list on Google Scholar.

03

Experience

04

Elsewhere

Reach out·