publications
publications in reverse chronological order.
* = equal first author
2025
- Forking paths in neural text generationInternational Conference on Learning Representations (ICLR), 2025
- Language models assign responsibility based on actual rather than counterfactual contributionsIn Proceedings of the Annual Meeting of the Cognitive Science Society, 2025
- Evaluating Self-Orienting in Language and Reasoning ModelsICML Workshop on Assessing World Models: Methods and Metrics for Evaluating Understanding, 2025
- Let’s Simulate Frame-by-Frame: In-Context Physical Simulations with Vision-Language ModelsICML Workshop on Assessing World Models: Methods and Metrics for Evaluating Understanding, 2025
- Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamicsICML Workshop on Actionable Interpretability, 2025
- People evaluate the algorithms that drive agents’ behaviorOpen Mind (under revision), 2025
- On (Not) Seeing Your Self: Some People May Lack Third-Person ImageryIn Society for Philosophy and Psychology, 2025
2024
- In-Context Learning Dynamics with Random Binary SequencesInternational Conference on Learning Representations (ICLR), 2024
- Foundational challenges in assuring alignment and safety of large language modelsTransactions in Machine Learning Research (TMLR), 2024
2023
- Mechanistic mode connectivityInternational Conference on Machine Learning (ICML), 2023
- Subjective Randomness and In-Context LearningIn NeurIPS Workshop on UniReps: Unifying Representations in Neural Models, 2023
- Non-commitment in mental imageryCognition, 2023
2022
- People’s evaluation of programs that drive agents’ behaviorIn Proceedings of the Annual Meeting of the Cognitive Science Society, 2022
- Mechanistic Lens on Mode ConnectivityNeurIPS Workshop on Distribution Shifts: Connecting Methods and Applications, 2022
- Opening the black box: People evaluate agents based on the algorithms that drive their behavior.In Society for Philosophy and Psychology, 2022
2016
- Inferring priors in compositional cognitive models.Proceedings of the Annual Meeting of the Cognitive Science Society, 2016
- A large dataset of generalization patterns in the number gameJournal of Open Psychology Data, 2016
- Tales of two cities: Using social media to understand idiosyncratic lifestyles in distinctive metropolitan areasIEEE Transactions on Big Data, 2016
2015
- On the need for imagistic modeling in story understandingBiologically Inspired Cognitive Architectures, 2015