We hypothesize that humans leverage experience on some tasks to preemptively learn solutions for other tasks that were accessible but unpursued. By doing so, they obtain reactive behavior that is adaptive to novel, unpursued tasks---something typically associated with deliberate planning. We operationalize this as Multitask Preplay and present behavioral evidence with a gridworld domain and a 2D minecraft domain. We conclude with AI simulations in the 2D minecraft domain showing that Multitask Preplay improves generalization of complex tasks to new environments that share task co-occurrence structure.
Consider someone that has moved to a new neighborhood and visited two coffee shops. Along the way, they observed a grocery store along their route. When they later want to go to the grocery store from their home, what behavior do you think they will exhibit?
While people certainly exhibit options 1 and 2 sometimes, we argue that people exhibit option 3 more often than we realize---i.e. that people somehow have access to fast, reactive behavior that can accomplish a novel goal they've previously been exposed to. We hypothesize that this is supported by Multitask Preplay.
NiceWebRL is a Python library for quickly making human subject experiments that leverage machine RL environments. It supports making online web experiments with 50+ machine RL environments and was used for all experiments in this paper.
Naturalistic Computational Cognitive Science: Towards generalizable models and theories that capture the full range of natural behavior. This is a position paper on why and how we can build generalizable models of human behavior that can scale up to increasingly naturalistic experimental paradigms. This project is an example of doing that: we first predict human behavior in a fully-observable grid-world, and then in a partially-observable 2D minecraft environment.
Predictive representations: building blocks of intelligence. This is a review of predictive representations in machine learning, cognitive science, and neuroscience. We focus on the family of predictive representations defined by the successor representation, but the arguments equally apply to the family defined by generalized value functions (which Multitask Preplay is an example of).
@article{carvalho2025preemptive,
author = {Carvalho, Wilka and Hall-McMaster, Sam and Lee, Honglak and Gershman, Samuel J.},
title = {Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines},
journal = {arXiv preprint arXiv:2507.05561},
year = {2025},
}