Nikolaos Dimitriadis

Nikolaos Dimitriadis

Research Scientist

EPFL

Hello! I am Nikos

I am a research scientist specializing in post-training, model merging, and multi-task learning for foundation models. I completed my PhD at École Polytechnique Fédérale de Lausanne (EPFL), under the supervision of François Fleuret and Pascal Frossard. During my PhD, I completed two research internships at Google DeepMind, spanning LLM post-training and text-to-image generation. My work focuses on understanding weight-space and loss-landscape geometry to design efficient, scalable post-training and model-editing methods for large foundation models.

Before coming to Switzerland, I completed my undergraduate studies in Electrical and Computer Engineering at the National Technical University of Athens in Greece. I conducted my thesis (available here in Greek) at the CVSP lab under the supervision of Petros Maragos. The focus lied on using tropical geometry to analyze Morphological Neural Networks, studying the sparsity of their representations compared to their linear counterparts, their ability to enforce shape constraints such as monotonicity, and extending a training algorithm based on Difference-of-Convex Programming to multiclass problems.

I am also an avid classical guitar player! I love playing Baroque and romantic pieces, such as compositions by Agustín Barrios Mangoré. Check out this beautiful performance!

Download my resumé .

Interests
  • Post-Training
  • Model Merging
  • Multi-Task Learning
  • Model Editing
Education
  • PhD in Computer Science

    École Polytechnique Fédérale de Lausanne

  • MEng in Electrical Engineering and Computer Science, 2020

    National Technical University of Athens

Publications

Quickly discover relevant content by filtering publications.
(2026). Learning Many Tasks via Weight-Space Geometry. EPFL PhD Thesis (EPFL).

Cite DOI

(2026). Model Soups Need Only One Ingredient. International Conference on Machine Learning (ICML).

Cite URL

(2025). MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs. Advances in Neural Information Processing Systems (NeurIPS).

Cite URL

(2025). Pareto Low-Rank Adapters: Efficient Multi-Task Learning with Preferences. International Conference on Learning Representations (ICLR).

Cite PDF

(2025). Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning. arXiv preprint arXiv:2504.11268.

Cite

(2024). LiNeS: Post-training layer scaling prevents forgetting and enhances model merging. International Conference on Learning Representations (ICLR).

Cite PDF GitHub Repository Website

(2024). Localizing Task Information for Improved Model Merging and Compression. International Conference on Machine Learning (ICML).

Cite PDF Website GitHub repository

(2023). Benefits of Max Pooling in Neural Networks: Theoretical and Experimental Evidence. Transactions on Machine Learning Research (TMLR).

PDF Cite URL

(2023). Pareto Manifold Learning: Tackling Multiple Tasks via Ensembles of Single-Task Models. International Conference on Machine Learning (ICML).

PDF Cite Poster Slides GitHub repository

(2023). SequeL: A Continual Learning Library in PyTorch and JAX. CVPR Workshop on Continual Learning (CVPR-WCL).

PDF Cite Website GitHub repository

(2023). Flexible Channel Dimensions for Differentiable Architecture Search. arXiv (Preprint).

PDF Cite arXiv

(2022). U-Boost NAS: Utilization-boosted Differentiable Neural Architecture Search. European Conference on Computer Vision (ECCV).

PDF Cite Poster Slides Video GitHub repository

(2021). Advances in Morphological Neural Networks: Training, Pruning and Enforcing Shape Constraints. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

PDF Cite Poster Slides Video

Contact

dimitriadisnikolaos0[at]gmail.com