Diego Calanzone

I am a Research Intern in Deep RL + Biology at MILA Quebec AI Institute under the supervision of Prof. Pierre-Luc Bacon and Pierluca D’Oro. I am a graduate student in Artificial Intelligence Systems from the University of Trento.
My research interests include deep learning reasoning, agency and intersections in applied sciences. I believe in (1) the gains from multi-modality, (2) logic can be structure in a conceptual space, (3) machines shall have a human nature as base. I’m particularly keen on probability theory.

In this web space I share my research in a broad sense, that is the pursuit of understanding about life, society and intelligence (my main field of studies) artificially and naturally. I love blogs seeking for rationality and wisdom [1][2], as well as learning about different ethnicities and cultures (big fan of National Geographic).

news

Oct 16, 2024	Our paper Logically Consistent Language Models via Neuro-Symbolic Integration has been accepted at the System 2 Reasoning At Scale Workshop @ NeurIPS 2024! 🥳
Aug 1, 2024	I will serve as tutor and creator for the RL Lab @ M2L Summer School 2024 in Milan! Learning contents to be released soon.
Mar 5, 2024	Our paper Towards Logically Consistent Language Models via Probabilistic Reasoning has been accepted at the ICLR 2024 Workshop on Reliable and Responsible Foundation Models!
Feb 5, 2024	I’m going to talk about “Acquiring Complex Concepts with Comparative Learning” in a poster session at the 2024 CORE Project Workshop, Universitat Pompeu Fabra, Barcelona!
Jun 8, 2023	Our paper “An open source perspective on AI and alignment with the EU AI Act” has been accepted to the AI4Safety workshop @ IJCAI 2023!

Publications

Reasoning

Logically Consistent Language Models via Neuro-Symbolic Integration

Diego Calanzone, Stefano Teso, and Antonio Vergari

NeurIPS 2024 Workshop on System 2 Reasoning at Scale

Abs

Large language models (LLMs) are a promising venue for natural language understanding and generation. However, current LLMs are far from reliable: they are prone to generating non-factual information and, more crucially, to contradicting themselves when prompted to reason about relations between entities of the world. These problems are currently addressed with large scale fine-tuning or by delegating reasoning to external tools. In this work, we strive for a middle ground and introduce a loss based on neuro-symbolic reasoning that teaches an LLM to be logically consistent with an external set of facts and rules and improves self-consistency even when the LLM is fine-tuned on a limited set of facts. Our approach also allows to easily combine multiple logical constraints at once in a principled way, delivering LLMs that are more consistent w.r.t. all constraints and improve over several baselines w.r.t. a given constraint. Moreover, our method allows LLMs to extrapolate to unseen but semantically similar factual knowledge, represented in unseen datasets, more systematically.
Reasoning

Towards Logically Consistent Language Models via Probabilistic Reasoning

Diego Calanzone, Stefano Teso, and Antonio Vergari

ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Abs

Large language models (LLMs) are a promising venue for natural language understanding and generation tasks. However, current LLMs are far from reliable: they are prone to generate non-factual information and, more crucially, to contradict themselves when prompted to reason about beliefs of the world. These problems are currently addressed with large scale fine-tuning or by delegating consistent reasoning to external tools. In this work, we strive for a middle ground and introduce a training objective based on principled probabilistic reasoning that teaches a LLM to be consistent with external knowledge in the form of a set of facts and rules. Fine-tuning with our loss on a limited set of facts enables our LLMs to be more logically consistent than previous baselines and allows them to extrapolate to unseen but semantically similar factual knowledge more systematically.
AI Policy

An open source perspective on AI and alignment with the EU AI Act

Diego Calanzone, Andrea Coppari, Riccardo Tedoldi, and 2 more authors

AISafety/SafeRL@ IJCAI 2023

Abs

Artificial intelligence systems based on deep learning have increasingly received interest due to their success in complex human tasks. A current trend in deep learning is to study how algorithms learn multiple new abilities as their size and training data increase." General purpose AI"(GPAI), that is systems that can transfer the acquired knowledge to solve multiple tasks, are candidate to constitute the backbone of many AI algorithms applied in specific fields on industry, eg healthcare, customer support, administration. While various research laboratories express safety concerns on GPAI and do not openly share access to their algorithms, others advocate for their" democratization" and an increasing amount of open-source versions is available online. In this study we analyze this phenomenon from two perspectives and try to reconcile them. From one side, research communities support open collaborations, free access to knowledge and resources; on the other, political institutions, involved in the orchestration between the support for innovation and the control of societal impact, aim at preventing violations of fundamental human rights. We particularly focus on the European approach for risk assessment of AI systems. In our opinion, it greatly overlaps with work in ethics and law conducted by AI researchers (eg the Stanford Centre for Research on Foundation Models). Specifically we identify some necessary modifications to improve coordination between the two sides, while also discussing viable implementations in the technical field.