ITML

Foundations of modern machine learning

We build mathematical frameworks to better understand the success of existing ML algorithms, and devise methods to overcome their limitations.
Our current focus is on foundation models (including large language models and vision-language models) studied across four directions.

Pillar 1

Internal representations & mechanisms of foundation models

We study what foundation models encode and how they compute it internally. This spans the geometry of learned representations, the theory of contrastive and self-supervised learning, compositional reasoning in vision-language models, and the mechanisms by which transformers process information. The goal is a mathematical account of why these models represent the world the way they do.

Pillar 2

Training & adaptation of foundation models

We design principled ways to train, fine-tune, and continually adapt foundation models. Our work covers fine-tuning that preserves in-context learning, the memorization capacity of fine-tuning, continual learning under distribution shift, and language-interfaced fine-tuning for non-language tasks. We aim for adaptation methods backed by provable guarantees rather than purely empirical recipes.

Pillar 3

Trustworthy & efficient foundation models

We make foundation models reliable and affordable to deploy. The aim is trustworthy behavior at lower cost, without giving up capability.

Pillar 4

Applications of foundation models

We bring foundation models into real-world domains alongside domain experts. Current applications include healthcare and psychiatry (cognitive behavioral therapy alignment, psychiatric interviews, cognitive-distortion classification), finance (question answering and retrieval-augmented generation), and law (Korean legal QA). These deployments create real impact while surfacing new theoretical questions.

Earlier research directions

Distributed ML

Distributed and federated learning over networks.

  • Buffer-based Gradient Projection for Continual Federated Learning TMLR · 2025
  • Election Coding for Distributed Learning: Protecting SignSGD against Byzantine Attacks NeurIPS · 2020
  • Attack of the Tails: Yes, You Really Can Backdoor Federated Learning NeurIPS · 2020
  • Hierarchical Coding for Distributed Computing IEEE ISIT · 2018

Information / Communication Theory

Coding theory for storage, computing, and wireless systems.

  • Capacity of Clustered Distributed Storage IEEE Trans. Information Theory · 2019
  • Secure Clustered Distributed Storage Against Eavesdropping IEEE Trans. Information Theory · 2019
  • On Reusing Pilots Across Interfering Cells in Massive MIMO IEEE Trans. Wireless Comm. · 2017
  • Pilot Reuse Strategy Maximizing the Weighted-Sum-Rate in Massive MIMO Systems IEEE JSAC · 2017

Funding

  • · IITP — Next-Generation AI development (2025–2029)
  • · NRF — Basic Research Laboratory (2024–2027)
  • · NRF — Outstanding Young Scientist Award (2024–2027)