Flemming Kondrup

Artificial Intelligence Researcher

Hi! I am a PhD Student at Mila and McGill University supervised by Doina Precup and Lars Grant and advised by Joelle Pineau. I previously completed a BSc at McGill University, for which I received the First-Class Honours distinction and the Dean’s Multidisciplinary Undergraduate Research List Distinction, during which I completed 5 research internships working with Doina Precup, David Juncker, Gabriel Venne and Peter Metrakos.

My research interests center on developing Safe AI Agents, with two primary aims:

Teaching Agents to adapt smoothly to human workflows, preferences, and web navigation via Reinforcement Learning.
Evaluating the risks in scaling autonomous Agents, with emphasis on identifying misalignment and hidden vulnerabilities.

I am a recipient of the:

Work Experience:

Machine Learning Intern – ServiceNow, Montreal, Canada

Developing LLM-based agents for computer use and web navigation, focusing on: (1) methods for adapting to human workflows and preferences; (2) safety through detection pipelines for hidden backdoors, combining reasoning-trace and activation-level anomaly analysis, and examining misalignment and collusion in multi-agent systems.

Summer 2025 | Mentor: Gabriel Huang

Machine Learning Intern – Dialogue, Montreal, Canada

Led the successful deployment of Generative Vision-Language Models (VLMs) in production to automate patient photo verification for telemedicine, boosting classification accuracy by 17% and streamlining the intake process, and developed a novel LLM-powered symptom intake system to reduce patient input time and improve triage.

Jan – Apr 2025 | Mentor: Alexis Smirnov

Research Intern - Royal Victoria Hospital, Montreal, Canada

Leveraged immune cell computational analyses to uncover immunotherapy targets in hepatocellular carcinoma and cholangiocarcinoma, elucidating tumor environment dynamics and innovation in liver cancer treatment strategies.

Sept 2020 – August 2021 | Mentor: Peter Metrakos

Selected Research

Improving agent decision-making in complex environments:

Cracking the Code of Action: A Generative Approach to Affordances for Reinforcement Learning

Leveraging VLMs to guide RL agents and improve decision making in high-dimensional action spaces

Paper

Forecaster: Temporally Abstract Tree-Search Planning from Pixels

Hierarchical RL with abstract world models for tree-search planning

Paper

AI for Healthcare:

Transferrable Model-Based RL for Personalized Insulin Therapy

Combine LSTM Forecasting & RL for individualized treatment

Prospective Publication

Safe Mechanical Ventilation Using Deep Conservative Q-Learning

DeepVent, an offline AI agent for safely optimizing ventilator settings

Paper

Service and Leadership

In 2022, I was captain of McGill’s team in Project X, an AI research competition organized by the University of Toronto with competitors from top academic institutions across North America with renowned sponsors (Google, IBM, Moderna etc.). Our work on Deep Conservative Reinforcement Learning for Mechanical Ventilation received the highest score out of all 25 papers submitted, winning the competition with a $25,000 award, and leading to press interviews with The Tribune and The McGill Reporter.

In 2023-2024, I served as the Executive Director of the McGill Student Emergency Response Team (MSERT), overseeing a team of over 70 responders dedicated to providing emergency medical aid. In addition to managing a $100,000 budget and supervising a 7-member executive board, I facilitated communication between MSERT, the McGill University administration, and governmental agencies. My approach emphasized thoughtful leadership and fostering a collaborative team dynamic, enabling MSERT to expand its services and educational outreach. Over the past five years, I have also volunteered as a responder, contributing more than 2000 hours to the team’s efforts.