Principal Architect - AI Compiler

Microsoft

Biography

My mission is to make Generative AI more efficient: from more efficient foundational models, to more efficient agentic workflows. My efforts can largely be grouped under the theme of AI for AI Systems. I draw from my core background in Reinforcement Learning, Imitation Learning, Planning and Combinatorial Optimization to aid me in this mission. I am an architect at Microsoft working on using evolutionary search and reinforcement learning to optimize kernels for Nvidia, AMD and MAIA (Microsoft AI Accelerator) hardware.

My superpower is leading lean, agile teams of AI researchers and engineers from fundamental research to product:

I built a team of 7 researchers and engineers dedicated to Neural Architecture Search at Microsoft Research, Redmond (2020-2023). My team published multiple NeurIPS, ICLR, ICML high-impact papers. Models from this research serve as efficient, real-time, on-device, text-prediction models for Microsoft Outlook, Word, PowerPoint and Teams which serve billions of queries a month.
I co-invented AirSIM which has become the leading open-source Robotics simulator and also spawned an enterprise-grade product at Microsoft. (2016-2017)
I led a team at DataRobot (2024-2025) and built Syftr which automatically optimizes for the Pareto-frontier of cost, latency and efficiency for agents.

I also love to build high-quality software for e.g.,

Archai a PyTorch-based Neural Architecture Search framework. Models produced by Archai are used by millions worldwide every day and handle billions of queries.
AirSIM a photo-realistic simulator for robotics which is widely used by the community.
Syftr an automatic agentic workflow optimizer which searches for the Pareto-frontier of cost vs. latency vs. accuracy for agentic tasks.

I finished my PhD at the Robotics Institute, Carnegie Mellon University. My interests include decison-making under uncertainty, reinforcement learning, artificial intelligence and machine learning. My work has been honored with Best Paper of the Year Shortlist at the International Journal of Robotics Research. I give back to the AI community by regularly Area Chairing for ICML, NeurIPS, ICLR.

Interests

Generative AI Efficiency
Neural Architecture Search
AutoML
Reinforcement Learning
Robotics
Planning
Vision

Education

PhD in Robotics, 2015

Carnegie Mellon University
MS in Robotics, 2012

Carnegie Mellon University
Bachelor of Electrical Engineering, 2007

Delhi College of Engineering

Experience

Principal Architect - AI Compiler

Microsoft

Sep 2025 – Present Redmond, WA

Distinguished Deep Learning Researcher

DataRobot

Jun 2024 – Sep 2025 Boston, MA

Principal Researcher

Microsoft

Jul 2023 – Apr 2024 Redmond, Washington

Principal Researcher

Microsoft Research

Aug 2019 – Jun 2023 Redmond, Washington

Senior Researcher

Microsoft Research

Jul 2015 – Aug 2019 Redmond, Washington

PhD Student

Robotics Institute, Carnegie Mellon University

Jul 2010 – Jul 2015 Pittsburgh, Pennsylvania

News

02/2025: Area Chairing ICML 2025
06/2024: Joined DataRobot to form and grow a small research team. Stay tuned for what we are cooking!
08/2023: Started new role in Azure AI Frameworks focused on automated search for graph and kernel schedules for novel NPUs.
02/2023: What Makes Convolutional Models Great on Long Sequence Modeling accepted to ICLR 2023.
09/2022: LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models and AutoDistil: accepted at NeurIPS 2022.
06/2022: Colin White and I gave a joint tutorial ( slides, video ) on Neural Architecture Search: Foundations and Trends at the 1st International Conference on Automated Machine Learning.
04/2022: A Deeper Look into Zero-Cost Proxies accepted to the first peer-reviewed ICLR blog post track.
03/2022: LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
03/2022: One Network Doesn’t Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
02/2022: Senior Area Chair 1st Automl-Conf 2022
01/2022: AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
07/2021: Invited talk at AutoML virtual seminar on fast ranking of architectures via their feature extraction capabilities.
06/2021: Area chair Neurips 2021.
06/2021: Preprint on fast ranking of architectures for Neural Architecture Search. Accepted to ICML 2021 workshop on AutoML.
06/2021: ICML paper on making neural networks better utilize hardware by increasing arithmetic intensity!
02/2021: Invited talk at Oregon State University on Neural Architecture Search.
10/2020: Archai is now formally out on Github! Blog
03/2020: Area chair for Neurips 2020.
06/2020: Invited talk on Robotics with Vision-in-the-Loop at CVPR 2020 Workshop on Fair, Data-Efficient and Trusted CV
02/2020: MSR podcast on my research journey!
11/2019: Area chair for ICML 2020.
11/2019: Using RL to optimize software pipelines accepted at AAAI 2020.
10/2019: Invited to NSF Panel on Robotics and Speech at UMD.
09/2019: Efficient Forward Architecture Search accepted to NeurIPS 2019.
09/2019: Top 50% reviewer at NeurIPS 2019.
06/2019: MSR blog post on visual navigation via language assistance.
05/2019: Efficient Forward Neural Architecture Search paper and code is public.
04/2019: Metareasoning in Modular Software Systems using RL is public, Real-World RL ICML workshop and AAAI 2020.
03/2019: Paper on visual navigation via language assistance accepted to CVPR 2019.
02/2019: Outstanding reviewer award ICLR 2019.
01/2019: Invited to CCC-NSF Robotics and Learning Workshop in San Francisco.
10/2018: Invited talk on Interactive Machine Learning at UMD.
10/2018: Two papers accepted at AAAI 2019. Anytime Neural Networks selected for oral presentation.
10/2018: Top reviewer award NeurIPS 2018.
09/2018: Invited talk on Robotics and Imitation Learning at New York University.
09/2018: Invited talk on Imitation Learning at Reinforcement Learning Day at MSR New York.
08/2018: Organizer of session on ‘AI for AI Systems’ at MSR Faculty Summit 2018.
07/2018: Invited talk at UW-MSR Summer Retreat on Social Robotics.
06/2018: Paper on Learning 3D View Utilities accepted at ECCV 2018.
06/2018: Invited talk at RSS Workshop on Resilient Robotics.
02/2018: Paper on Blind Spots in RL accepted to AAMAS 2018.
02/2018: Journal version of Learning to Gather Information accepted at IJRR.
01/2018: Invited talk at The Robotics Institute, Carnegie Mellon University.
12/2017: Visiting MSR Bangalore.
10/2017: Upcoming invited talk at ICCV 2017 Workshop on Role of Simulation in Computer Vision.
08/2017: Paper on efficient 3D scanning accepted at ICCV 2017.
07/2017: Paper describing AirSim accepted at FSR 2017.
06/2017: Invited talk at International Symposium on Aerial Vehicles at University of Pennsylvania.
05/2017: Paper on efficient route planning leveraging multi-armed bandits accepted at ICML 2017.
04/2017: Paper on adaptive information gathering accepted at RSS 2017.
03/2017: Paper on UAV tracking using flight dynamics accepted for oral presentation at CVPR 2017.
02/2017: We released open-source photo-realistic robotics simulator AirSim.
01/2017: Two papers accepted at ICRA 2017.
12/2016: Sponsorship and Publicity Chair of Conference on Robot Learning.
10/2016: Invited talk at workshop on “Vision-based High Speed Autonomous Navigation of UAVs”, IROS 2017.
08/2016: Invited to NSF-UAS Advisory Board meeting at Dayton, OH.
07/2016: Co-organized workshop on “Safe-Cyber Physical Systems” at Faculty Summit, Microsoft Research.
06/2016: Presented at RSS Workshop on Task and Motion Planning at University of Michigan, Ann Arbor.
10/2015: Trajectory optimization for Team Chambliss at Red Bull Air Race at Dallas, TX.
08/2015: Joined Microsoft Research.
07/2015: Defended PhD thesis at Carnegie Mellon University.

Interns

Aditya Modi

University of Michigan, Summer 2018

Alex LaGrassa

CMU, Summer 2020

Angela Lin

University of Texas, Summer 2019

Artem Rozantsov

EPFL, Summer 2016

Benjamin Hepp

ETH Zurich, Summer 2017

Brian Axelrod

Stanford University, Summer 2016

Dilip Arumugam

Stanford University, Summer 2019

Elizabeth Bondi

Harvard University, Fall 2017

Felix Berkenkamp

ETH Zurich, Summer 2017

Francisco Garcia

University of Massachusetts, Fall 2016

Ganesh Jawahar

UBC, Summer 2021

Hanzhang Hu

CMU, Summer 2018

Khanh Nguyen

UMD, Summer 2018

Mike Roberts

Stanford University, Summer 2016, 2017

Mojan Javaheripi

UCSD, Summer 2021

Ramya Ramakrishnan

MIT, Summer 2017, 2018

Sanjiban Choudhury

CMU, Summer 2016

Shushman Choudhury

Stanford University, Summer 2020

Simon Ramstedt

MILA, Summer 2017

Tianle Cai

Princeton, Summer 2022

Wen Sun

CMU, Summer 2016

Yuhong Li

UIUC, Summer 2022

Selected Publications

Yuhong Li, Tianle Cai, Yi Zhang, Deming Chen, Debadeepta Dey

October 2022 ArXiv

What Makes Convolutional Models Great on Long Sequence Modeling?

Convolutional models have been widely used in multiple domains. However, most existing models only use local convolution, making the model unable to handle long-range dependency efficiently. Attention overcomes this problem by aggregating global information based on the pair-wise attention score but also makes the computational complexity quadratic to the sequence length. Recently, Gu et al. [2021a] proposed a model called S4 inspired by the state space model. S4 can be efficiently implemented as a global convolutional model whose kernel size equals the input sequence length. With Fast Fourier Transform, S4 can model much longer sequences than Transformers and achieve significant gains over SoTA on several long-range tasks. Despite its empirical success, S4 is involved. It requires sophisticated parameterization and initialization schemes that combine the wisdom from several prior works. As a result, S4 is less intuitive and hard to use for researchers with limited prior knowledge. Here we aim to demystify S4 and extract basic principles that contribute to the success of S4 as a global convolutional model. We focus on the structure of the convolution kernel and identify two critical but intuitive principles enjoyed by S4 that are sufficient to make up an effective global convolutional model: 1) The parameterization of the convolutional kernel needs to be efficient in the sense that the number of parameters should scale sub-linearly with sequence length. 2) The kernel needs to satisfy a decaying structure that the weights for convolving with closer neighbors are larger than the more distant ones. Based on the two principles, we propose a simple yet effective convolutional model called Structured Global Convolution (SGConv). SGConv exhibits strong empirical performance over several tasks: 1) With faster speed, SGConv surpasses S4 on Long Range Arena and Speech Command datasets. 2) When plugging SGConv into standard language and vision models, it shows the potential to improve both efficiency and performance.

PDF

Mojan Javaheripi, Gustavo H. de Rosa, Subhabrata Mukherjee, Shital Shah, Tomasz L. Religa, Caio C. T. Mendes, Sebastien Bubeck, Farinaz Koushanfar, Debadeepta Dey

March 2022 NeurIPS

LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models

The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models. However, finding architectures with the optimal trade-off between task performance (perplexity) and hardware constraints like peak memory utilization and latency is non-trivial. This is exacerbated by the proliferation of various hardware. We leverage the somewhat surprising empirical observation that the number of decoder parameters in autoregressive Transformers has a high rank correlation with task performance, irrespective of the architecture topology. This observation organically induces a simple Neural Architecture Search (NAS) algorithm that uses decoder parameters as a proxy for perplexity without need for any model training. The search phase of our training-free algorithm, dubbed Lightweight Transformer Search (LTS), can be run directly on target devices since it does not require GPUs. Using on-target-device measurements, LTS extracts the Pareto-frontier of perplexity versus any hardware performance cost. We evaluate LTS on diverse devices from ARM CPUs to NVIDIA GPUs and two popular autoregressive Transformer backbones: GPT-2 and Transformer-XL. Results show that the perplexity of 16-layer GPT-2 and Transformer-XL can be achieved with up to 1.5x, 2.5x faster runtime and 1.2x, 2.0x lower peak memory utilization. When evaluated in zero and one-shot settings, LTS Pareto-frontier models achieve higher average accuracy compared to the 350M parameter OPT across 14 tasks, with up to 1.6x lower latency. LTS extracts the Pareto-frontier in under 3 hours while running on a commodity laptop. We effectively remove the carbon footprint of hundreds of GPU hours of training during search, offering a strong simple baseline for future NAS methods in autoregressive language modeling.

PDF

Hanzhang Hu, John Langford, Rich Caruana, Saurajit Mukherjee, Eric J Horvitz, Debadeepta Dey

November 2019 NeurIPS

Efficient forward architecture search

We propose a neural architecture search (NAS) algorithm, Petridish, to iteratively add shortcut connections to existing network layers. The added shortcut connections effectively perform gradient boosting on the augmented layers. The proposed algorithm is motivated by the feature selection algorithm forward stage-wise linear regression, since we consider NAS as a generalization of feature selection for regression, where NAS selects shortcuts among layers instead of selecting features. In order to reduce the number of trials of possible connection combinations, we train jointly all possible connections at each stage of growth while leveraging feature selection techniques to choose a subset of them. We experimentally show this process to be an efficient forward architecture search algorithm that can find competitive models using few GPU days in both the search space of repeatable network modules (cell-search) and the space of general networks (macro-search). Petridish is particularly well-suited for warm-starting from existing models crucial for lifelong-learning scenarios

PDF Code

Sanjiban Choudhury, Mohak Bhardwaj, Sankalp Arora, Ashish Kapoor, Gireeja Ranade, Sebastian Scherer, Debadeepta Dey

January 2018 IJRR

Data-driven planning via imitation learning

IJRR Best Paper of the Year Shortlist

PDF

Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

January 2017 Field and Service Robotics

Airsim: High-fidelity visual and physical simulation for autonomous vehicles

Developing and testing algorithms for autonomous vehicles in real world is an expensive and time consuming process. Also, in order to utilize recent advances in machine intelligence and deep learning we need to collect a large amount of annotated training data in a variety of conditions and environments. We present a new simulator built on Unreal Engine that offers physically and visually realistic simulations for both of these goals. Our simulator includes a physics engine that can operate at a high frequency for real-time hardware-in-the-loop (HITL) simulations with support for popular protocols (e.g. MavLink). The simulator is designed from the ground up to be extensible to accommodate new types of vehicles, hardware platforms and software protocols. In addition, the modular design enables various components to be easily usable independently in other projects. We demonstrate the simulator by first implementing a quadrotor as an autonomous vehicle and then experimentally comparing the software components with real-world flights.

PDF Code Video

Debadeepta Dey

July 2015 CMU-RI-TR-15-18

Predicting Sets and Lists: Theory and Practice

Increasingly, real world problems require multiple predictions while traditional supervised learning techniques focus on making a single best prediction. For instance in advertisement placement on the web, a list of advertisements is placed on a page with the objective of maximizing click-through rate on that list. In this work, we build an efficient framework for making sets or lists of predictions where the objective is to optimize any utility function which is (monotone) submodular over a list of predictions. Other examples of tasks where multiple predictions are important include: grasp selection in robotic manipulation where the robot arm must evaluate a list of grasps with the aim of finding a sucessful grasp, as early on in the list as possible and trajectory selection for mobile ground robots where given the computational time limits, the task is to select a list of trajectories from a much larger set of feasible trajectories for minimizing expected cost of traversal. In computer vision tasks like frame-to-frame target tracking in video, multiple hypotheses about the target location and pose must be considered by the tracking algorithm. For each of these cases, we optimize for the content and order of the list of predictions. Crucially– and in contrast with existing work on list prediction – our approach to pre- dicting lists is based on very simple reductions of the problem of predicting lists to a series of simple classification/regression tasks. This provides powerful flexibility to use any existing prediction method while ensuring rigorous guarantees on prediction performance. We analyze these meta-algorithms for list prediction in both the online, no-regret and generalization settings. Furthermore we extend the methods to make multiple predictions in structured output domains where even a single prediction is a combinatorial object, e.g. , challenging vision tasks like semantic scene labeling and monocular pose estimation. We conclude with case studies that demonstrate the power and flexibility of these reductions in problems from document summarization, prediction of the pose of humans in images, to predicting the best set of robotic grasps and purely vision based autonomous flight in densely cluttered environments.

PDF

Debadeepta Dey, Tian Yu Liu, Martial Hebert, J Andrew Bagnell

January 2012 RSS

Contextual Sequence Prediction with Application to Control Library Optimization

Sequence optimization, where the items in a list are ordered to maximize some reward has many applications such as web advertisement placement, search, and control libraries in robotics. Previous work in sequence optimization produces a static ordering that does not take any features of the item or context of the problem into account. In this work, we propose a general approach to order the items within the sequence based on the context (e.g., perceptual information, environment description, and goals). We take a simple, efficient, reduction-based approach where the choice and order of the items is established by repeatedly learning simple classifiers or regressors for each “slot” in the sequence. Our approach leverages recent work on submodular function maximization to provide a formal regret reduction from submodular sequence optimization to simple costsensitive prediction. We apply our contextual sequence prediction algorithm to optimize control libraries and demonstrate results on two robotics problems: manipulator trajectory prediction and mobile robot path planning.

PDF

Debadeepta Dey, Christopher Geyer, Sanjiv Singh, Matthew Digioia

January 2011 IJRR

A cascaded method to detect aircraft in video imagery

PDF Extended Technical Report

All Publications

Quickly discover relevant content by filtering publications.

What Makes Convolutional Models Great on Long Sequence Modeling?

Convolutional models have been widely used in multiple domains. However, most existing models only use local convolution, making the …

Yuhong Li, Tianle Cai, Yi Zhang, Deming Chen, Debadeepta Dey

PDF

AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models

Knowledge distillation (KD) methods compress large models into smaller students with manually-designed student architectures given …

Dongkuan Xu, Subhabrata (Subho) Mukherjee, Xiaodong Liu, Debadeepta Dey, Wenhui Wang, Xiang Zhang, Ahmed H. Awadallah, Jianfeng Gao

PDF

LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models

The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models. However, finding …

Mojan Javaheripi, Gustavo H. de Rosa, Subhabrata Mukherjee, Shital Shah, Tomasz L. Religa, Caio C. T. Mendes, Sebastien Bubeck, Farinaz Koushanfar, Debadeepta Dey

PDF

FEAR: Ranking Architectures by their Feature Extraction Capabilities

Debadeepta Dey, Shital Shah, Sebastien Bubeck

PDF Code ArXiv

A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks

Angela S Lin, Sudha Rao, Asli Celikyilmaz, Elnaz Nouri, Chris Brockett, Debadeepta Dey, Bill Dolan

PDF

Blind Spot Detection for Safe Sim-to-Real Transfer

Ramya Ramakrishnan, Ece Kamar, Debadeepta Dey, Eric Horvitz, Julie Shah

PDF

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

Assemblies of modular subsystems are being pressed into service to perform sensing, reasoning, and decision making in high-stakes, …

Aditya Modi, Debadeepta Dey, Alekh Agarwal, Adith Swaminathan, Besmira Nushi, Sean Andrist, Eric Horvitz

PDF

Efficient forward architecture search

We propose a neural architecture search (NAS) algorithm, Petridish, to iteratively add shortcut connections to existing network layers. …

Hanzhang Hu, John Langford, Rich Caruana, Saurajit Mukherjee, Eric J Horvitz, Debadeepta Dey

PDF Code

Anytime neural networks via joint optimization of auxiliary losses

This work considers the trade-off between accuracy and test-time computational cost of deep neural networks (DNNs) via mph{anytime} …

Hanzhang Hu, Debadeepta Dey, J Andrew Bagnell, Martial Hebert

PDF

Overcoming blind spots in the real world: Leveraging complementary abilities for joint execution

Simulators are being increasingly used to train agents before deploying them in real-world environments. While training in simulation …

Ramya Ramakrishnan, Ece Kamar, Besmira Nushi, Debadeepta Dey, Julie Shah, Eric Horvitz

PDF

Reparameterized Variational Divergence Minimization for Stable Imitation

Dilip Arumugam, Debadeepta Dey, Alekh Agarwal, Asli Celikyilmaz, Elnaz Nouri, Bill Dolan

PDF

Vision-based Navigation with Language-based Assistance via Imitation Learning with Indirect Intervention

We present Vision-based Navigation with Languagebased Assistance (VNLA), a grounded vision-language task where an agent with visual …

Khanh Nguyen, Debadeepta Dey, Chris Brockett, Bill Dolan

PDF Code

Airsim-w: A simulation environment for wildlife conservation with uavs

Elizabeth Bondi, Debadeepta Dey, Ashish Kapoor, Jim Piavis, Shital Shah, Fei Fang, Bistra Dilkina, Robert Hannaford, Arvind Iyer, Lucas Joppa, others

Data-driven planning via imitation learning

IJRR Best Paper of the Year Shortlist

Sanjiban Choudhury, Mohak Bhardwaj, Sankalp Arora, Ashish Kapoor, Gireeja Ranade, Sebastian Scherer, Debadeepta Dey

PDF

Discovering blind spots in reinforcement learning

Agents trained in simulation may make errors in the real world due to mismatches between training and execution environments. These …

Ramya Ramakrishnan, Ece Kamar, Debadeepta Dey, Julie Shah, Eric Horvitz

PDF

Learn-to-score: Efficient 3d scene exploration by predicting view utility

Camera equipped drones are nowadays being used to explore large scenes and reconstruct detailed 3D maps. When free space in the scene …

Benjamin Hepp, Debadeepta Dey, Sudipta N Sinha, Ashish Kapoor, Neel Joshi, Otmar Hilliges

PDF

Near Real-Time Detection of Poachers from Drones in AirSim.

Elizabeth Bondi, Ashish Kapoor, Debadeepta Dey, James Piavis, Shital Shah, Robert Hannaford, Arvind Iyer, Lucas Joppa, Milind Tambe

Submodular trajectory optimization for aerial 3d scanning

Drones equipped with cameras are emerging as a powerful tool for large-scale aerial 3D scanning, but existing automatic flight planners …

Mike Roberts, Debadeepta Dey, Anh Truong, Sudipta Sinha, Shital Shah, Ashish Kapoor, Pat Hanrahan, Neel Joshi

PDF

Adaptive information gathering via imitation learning

In the adaptive information gathering problem, a policy is required to select an informative sensing location using the history of …

Sanjiban Choudhury, Ashish Kapoor, Gireeja Ranade, Sebastian Scherer, Debadeepta Dey

PDF

Flight dynamics-based recovery of a UAV trajectory using ground cameras

Artem Rozantsev, Sudipta N Sinha, Debadeepta Dey, Pascal Fua

PDF

Learning to gather information via imitation

The budgeted information gathering problem - where a robot with a fixed fuel budget is required to maximize the amount of information …

Sanjiban Choudhury, Ashish Kapoor, Gireeja Ranade, Debadeepta Dey

PDF

No-regret replanning under uncertainty

Wen Sun, Niteesh Sood, Debadeepta Dey, Gireeja Ranade, Siddharth Prakash, Ashish Kapoor

Safety-aware algorithms for adversarial contextual bandit

In this work we study the safe sequential decision making problem under the setting of adversarial contextual bandits with sequential …

Wen Sun, Debadeepta Dey, Ashish Kapoor

PDF

Airsim: High-fidelity visual and physical simulation for autonomous vehicles

Developing and testing algorithms for autonomous vehicles in real world is an expensive and time consuming process. Also, in order to …

Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

PDF Code Video

Vision and learning for deliberative monocular cluttered flight

Cameras provide a rich source of information while being passive, cheap and lightweight for small and medium Unmanned Aerial Vehicles …

Debadeepta Dey, Kumar Shaurya Shankar, Sam Zeng, Rupesh Mehta, M Talha Agcayazi, Christopher Eriksen, Shreyansh Daftry, Martial Hebert, J Andrew Bagnell

PDF Video

Predicting Sets and Lists: Theory and Practice

Increasingly, real world problems require multiple predictions while traditional supervised learning techniques focus on making a …

Debadeepta Dey

PDF

Predicting multiple structured visual interpretations

We present a simple approach for producing a small number of structured visual outputs which have high recall, for a variety of tasks …

Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J Andrew Bagnell

PDF

Gauss Meets Canadian Traveler: Shortest-Path Problems with Correlated Natural Dynamics

In a variety of real world problems from robot navigation to logistics, agents face the challenge of path optimization on a graph with …

Dey Debadeepta, Andrey Kolobov, Rich Caruana, Ece Kamar, Eric Horvitz, Ashish Kapoor

PDF

Knapsack constrained contextual submodular list prediction with application to multi-document summarization

Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a …

Jiaji Zhou, Stephane Ross, Yisong Yue, Debadeepta Dey, J Andrew Bagnell

PDF

Learning monocular reactive uav control in cluttered natural environments

Autonomous navigation for large Unmanned Aerial Vehicles (UAVs) is fairly straight-forward, as expensive sensors and monitoring devices …

Stéphane Ross, Narek Melik-Barkhudarov, Kumar Shaurya Shankar, Andreas Wendel, Debadeepta Dey, J Andrew Bagnell, Martial Hebert

PDF

Classification of plant structures from uncalibrated image sequences

This paper demonstrates the feasibility of recovering fine-scale plant structure in 3D point clouds by leveraging recent advances in …

Debadeepta Dey, Lily Mummert, Rahul Sukthankar

PDF

Contextual Sequence Prediction with Application to Control Library Optimization

Sequence optimization, where the items in a list are ordered to maximize some reward has many applications such as web advertisement …

Debadeepta Dey, Tian Yu Liu, Martial Hebert, J Andrew Bagnell

PDF

A cascaded method to detect aircraft in video imagery

Debadeepta Dey, Christopher Geyer, Sanjiv Singh, Matthew Digioia

PDF Extended Technical Report

Efficient Optimization of Control Libraries

A popular approach to high dimensional control problems in robotics uses a library of candidate “maneuvers” or “trajectories”. The …

Debadeepta Dey, Tian Yu Liu, Boris Sofman, Drew Bagnell

PDF

Passive, long-range detection of aircraft: towards a field deployable sense and avoid system

Debadeepta Dey, Christopher Geyer, Sanjiv Singh, Matt Digioia

PDF

Prototype sense-and-avoid system for UAVs

C Geyer, Debadeepta Dey, Sanjiv Singh

Principal Architect - AI Compiler

Biography

Interests

Education

Experience

Principal Architect - AI Compiler

Distinguished Deep Learning Researcher

Principal Researcher

Principal Researcher

Senior Researcher

PhD Student

News

Interns

University of Michigan, Summer 2018

CMU, Summer 2020

University of Texas, Summer 2019

EPFL, Summer 2016

ETH Zurich, Summer 2017

Stanford University, Summer 2016

Stanford University, Summer 2019

Harvard University, Fall 2017

ETH Zurich, Summer 2017

University of Massachusetts, Fall 2016

UBC, Summer 2021

CMU, Summer 2018

UMD, Summer 2018

Stanford University, Summer 2016, 2017

UCSD, Summer 2021

MIT, Summer 2017, 2018

CMU, Summer 2016

Stanford University, Summer 2020

MILA, Summer 2017

Princeton, Summer 2022

CMU, Summer 2016

UIUC, Summer 2022

Selected Publications

All Publications

Contact