Publications
You can also find my articles on my Google Scholar profile.
Journal Papers
Published in Computational Psychiatry, 2024
This paper extends the ICML paper from 2023.
Recommended citation: Alon et al. (2024). "(Mal)adaptive Mentalizing in the Cognitive Hierarchy, and Its Link to Paranoia " Computational Psychiatry 8-1 https://cpsyjournal.org/articles/10.5334/cpsy.117
Workshop Papers
Published in First Workshop on Theory of Mind in Communicating Agents, 2023
Recommended citation: Alon et al. (2023). "Between prudence and paranoia: Theory of Mind gone right, and wrong." First Workshop on Theory of Mind in Communicating Agents. https://openreview.net/pdf?id=gB9zrEjhZD
Published in Neurips 2022 workshop on information-theoretic principles in cognitive systems, 2022
Recommended citation: Alon et al. (2022). "A (dis-) information theory of revealed and unrevealed preferences." Neurips 2022 workshop on information-theoretic principles in cognitive systems. https://openreview.net/pdf?id=vcpQW_fGaj5
Preprints
Here you can find my current work:
Published in ArXiv, 2024
This paper discusses the pitfals of evaluating ToM in LLM’s
Recommended citation: Wagner, Alon, Barnby and Abend. (2024). "Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning" ArXiv https://arxiv.org/pdf/2412.13631
Published in ArXiv, 2024
This paper presents the aleph-IPOMDP model
Recommended citation: Alon et al. (2024). "Detecting and Deterring Manipulation in a Cognitive Hierarchy " ArXiv https://arxiv.org/pdf/2405.01870