Multitask Preplay in Humans and Machines

We hypothesize that humans leverage experience on some tasks to preemptively learn solutions for other tasks that were accessible but unpursued. By doing so, they obtain reactive behavior that is adaptive to novel, unpursued tasks---something typically associated with deliberate planning. We operationalize this as Multitask Preplay and present behavioral evidence with a gridworld domain and a 2D minecraft domain. We conclude with AI simulations in the 2D minecraft domain showing that Multitask Preplay improves generalization of complex tasks to new environments that share task co-occurrence structure.

Multitask Preplay

Consider someone that has moved to a new neighborhood and visited two coffee shops. Along the way, they observed a grocery store along their route. When they later want to go to the grocery store from their home, what behavior do you think they will exhibit?

While people certainly exhibit options 1 and 2 sometimes, we argue that people exhibit option 3 more often than we realize---i.e. that people somehow have access to fast, reactive behavior that can accomplish a novel goal they've previously been exposed to. We hypothesize that this is supported by Multitask Preplay.

Results

Video of person completing generalization task. Left is global view. Right is what they see.

BibTeX

@article{carvalho2025preemptive,
  author    = {Carvalho, Wilka and Hall-McMaster, Sam and Lee, Honglak and Gershman, Samuel J.},
  title     = {Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines},
  journal   = {arXiv preprint arXiv:2507.05561},
  year      = {2025},
}

Preemptive Solving of Future Problems:Multitask Preplay in Humans and Machines