Diego Calanzone

me.jpg

I am a Ph.D. student at Mila Quebec AI Institute under the supervision of Prof. Pierre-Luc Bacon. Currently, my research consists of Reinforcement Learning with Large Language Models for applications (drug discovery, material discovery) and theory (hierarchical teinforcement Learning, generative rewards).

With my studies in CS/AI, I developed a background in software engineering, logic, computer vision and language. Deep research questions lie at the intersection: what priors are necessary and sufficient to develop intelligent behavior? How to learn efficiently? What are the benefits for humanity in scaling artificial intelligence?

In this web space I aim to share my research in the broader sense, that is the pursuit of understanding of life, intelligence and creativity. My interests are highly heterogenous, from social psychology to environmental activism and analog life. In my spare time, I play music, enjoy board sports and read mostly about research in other fields.

news

Jan 11, 2025 Our work Discovery of Sustainable Refrigerants through Physics-Informed RL Fine-Tuning of Sequence Models has been accepted to the SIMBIOCHEM Workshop @ EurIPS 2025!
Jan 10, 2025 We presented Logically Consistent Language Models via Neuro-Symbolic Integration as a full paper @ ICLR 2025!
Jan 10, 2025 Our work Mol-MoE: Training Preference-Guided Routers for Molecule Generation has been admitted to the GEM Bio Workshop @ ICLR 2025!
Oct 16, 2024 Our paper Logically Consistent Language Models via Neuro-Symbolic Integration has been accepted at the System 2 Reasoning At Scale Workshop @ NeurIPS 2024! 🄳
Aug 1, 2024 I will serve as tutor and creator for the RL Lab @ M2L Summer School 2024 in Milan! Learning contents to be released soon.

Publications

  1. Reasoning
    Logically Consistent Language Models via Neuro-Symbolic Integration
    Diego Calanzone,Ā Stefano Teso,Ā andĀ Antonio Vergari
    NeurIPS 2024 Workshop on System 2 Reasoning at Scale
  2. AI4Science
    Discovery of Sustainable Refrigerants through Physics-Informed RL Fine-Tuning of Sequence Models
    Adrien Goldszal,Ā Diego Calanzone,Ā Vincent Taboga, and 1 more author
    arXiv preprint arXiv:2509.19588
  3. AI4Science
    Mol-MoE: Training Preference-Guided Routers for Molecule Generation
    Diego Calanzone,Ā Pierluca D’Oro,Ā andĀ Pierre-Luc Bacon
    arXiv preprint arXiv:2502.05633
Ā© Copyright 2025 Diego Calanzone. Powered by Jekyll. Last updated: November 10, 2025.