Nicolò De Sabbata

Hello there! Welcome to my corner of the web. I'm an engineer and researcher passionate about deep learning, reverse-engineering human intelligence, and exploring practical applications of AI in the real world. Currently, I'm doing an internship as a Machine Learning Engineer at Apple in Barcelona.

My research interests lie in improving our understanding of the architecture and learning dynamics of contemporary AI systems and leveraging these insights to develop the next generation of intelligent agents. Specifically, my previous work has focused on two primary goals: (i) understanding the capabilities and limitations of LLMs through comparisons with the human mind, and (ii) addressing these limitations to build agents that can learn, reason, and communicate with efficiency and flexibility.

Previously, I was a research intern in the NLP group, working under the supervision of Prof. Antoine Bosselut, at EPFL, where I earned my Master's degree in Computer Science. Before that, I was a Visiting Student Researcher at the Computational Cognitive Science Lab at Princeton University, collaborating with Prof. Tom Griffiths. My industry experience includes internships at Amazon in Luxembourg, where I automated ETL pipelines monitoring for ATS, and at Axa in Lausanne, where I designed and implemented LLM-based agents to automate the processing of insurance claims.

Email / CV / GitHub / Linkedin / Google Scholar / Twitter

Publications

	Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization Badr AlKhamissi, Nicolò De Sabbata, Zeming Chen, Martin Schrimpf, Antoine Bosselut ArXiv preprint Inspired by neuroscience, we propose Mixture of Cognitive Reasoners (MICRO), a modular transformer model divided into four specialized expert modules, each corresponding to a distinct cognitive brain network. MICRO achieves enhanced interpretability, superior performance across seven reasoning benchmarks, and fine-grained controllability by emphasizing specific modules during inference.
	Rational Metareasoning for Large Language Models Nicolò De Sabbata, Ted Sumers, Thomas L. Griffiths. Published at S2RAS & BML, NeurIPS 2024 We develop a novel method to optimize reasoning in large language models using a novel reward function based on the Value of Computation, enabling models to selectively use intermediate reasoning steps only when necessary. We demonstrate the effectiveness of this approach across diverse datasets, reducing inference costs by up to 45% while maintaining or increasing task performance.
	Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem Declan I. Campbell, Sunayana Rane, Tyler Giallanza, Nicolò De Sabbata, Kia Ghods, Amogh Joshi, Alexander Ku, Jonathan D. Cohen, Thomas L. Griffiths, Taylor W. Webb. Published at NeurIPS 2024. We investigate vision-language models' limitations in multi-object reasoning, linking them to cognitive science's binding problem and trade-offs between flexibility and capacity. We demonstrate that VLMs face human-like limits in multi-object scene processing in tasks like counting, localization, and visual analogy.

University Projects

	AI-Powered Educational Chatbot A GPT-2 model fine-tuned on demonstrations distilled from a 100B - parameter scale LLM, with generation quality enhanced by a reward model trained via reinforcement learning with human feedback (RLHF).
	Noise2Noise Deep learning network Implemented a deep convolutional encoder-decoder neural network for image denoising in PyTorch, trained without a clean reference image.
	Robust public transport route planner A route planner that calculates the fastest route based on desired arrival times and analyzes historical delay data for accurate predictions using Apache Spark on Swiss Federal Railways (SBB) data.
	Distributed Movie Recommendation System Developed a Spark-based distributed recommendation system in Scala, processing millions of reviews. Implemented an approximate and distributed K-Nearest Neighbors algorithm to enhance recommendation precision.
	Political affiliation NLP predictor Processed millions of quotes using statistical and sentiment analysis, followed by topic clustering with a pre-trained BERT encoder to predict the speaker's political party affiliation. Created and presented comprehensive visualizations of the results through a website, data story, and plots.
	Deep Learning for Parkinson's Disease recognition Explored the feasibility of Neural Networks models for the quantitative discrimination between different conformational species of proteins.
	ML Classifier on CERN laboratory data Built a classifier using various ML models to predict Higgs boson decay signatures from background noise, achieving a categorical accuracy of 0.840 and an F1 score of 0.756, ranking 6th among nearly 300 teams.
	Insurance company & Piccardi Music Completed two data analysis projects to enhance skills in data wrangling, data visualization, regression analysis, observational studies, statistics, and supervised learning.
	Master of Renaissance Developed an online multiplayer board game in Java, playable on both JavaFX GUI and CLI, with robust server features for connection stability. Managed the full development cycle using Agile, from UML modeling to implementation, ensuring scalability, reliability, and security through effective design patterns.
	PoliMusic Developed the backend and frontend of a website that allows users to securely upload songs to a server, ensuring safe database communications.

Website template credits to Jon Barron.