Diego Calanzone
I am a Ph.D. student at Mila Quebec AI Institute under the supervision of Prof. Pierre-Luc Bacon. Currently, my research consists of Reinforcement Learning with Large Language Models for applications (drug discovery, material discovery) and theory (hierarchical teinforcement Learning, generative rewards).
With my studies in CS/AI, I developed a background in software engineering, logic, computer vision and language. Deep research questions lie at the intersection: what priors are necessary and sufficient to develop intelligent behavior? How to learn efficiently? What are the benefits for humanity in scaling artificial intelligence?
In this web space I aim to share my research in the broader sense, that is the pursuit of understanding of life, intelligence and creativity. My interests are highly heterogenous, from social psychology to environmental activism and analog life. In my spare time, I play music, enjoy board sports and read mostly about research in other fields.
news
Publications
- ReasoningLogically Consistent Language Models via Neuro-Symbolic IntegrationNeurIPS 2024 Workshop on System 2 Reasoning at Scale
- AI4ScienceDiscovery of Sustainable Refrigerants through Physics-Informed RL Fine-Tuning of Sequence ModelsarXiv preprint arXiv:2509.19588
- AI4ScienceMol-MoE: Training Preference-Guided Routers for Molecule GenerationarXiv preprint arXiv:2502.05633