Ege Erdogan

PhD Student at the University of Amsterdam

DSC00157_small.jpg

Hi! I am a PhD student at the University of Amsterdam with Ana Lucic as part of the SIAS, CLC, and AI4Science groups.

My main research interest is mechanistic interpretability with a particular focus towards AI4Science. I am interested in two interrelated questions: how can interpretability of ML models facilitate scientific discovery, and how can scientific knowledge inform the design of interpretability methods?

I previously did my MSc at the Technical University of Munich and wrote my thesis on geometric generative models of neural network weights, advised by David Rügamer and Bastian Rieck.

Feel free to reach out at ege at erdogan.dev.

news

Sep 2025 🎉 Paper accepted at the NeurIPS Mech Interp and UniReps workshops!
Group Equivariance Meets Mechanistic Interpretability: Equivariant Sparse Autoencoders
May 2025 🎉 Paper accepted at FAccT 2025!
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods
May 2025 👨‍⚕️ Started my PhD at the University of Amsterdam!
Mar 2025 🎉 Paper accepted at the ICLR MLMP Workshop!
FreeFlow: Latent Flow Matching for Free Energy Difference Estimation
Jan 2025 🎓 Completed my MSc at TUM! Read my thesis here

selected publications

  1. Mech Interp ’25
    Group Equivariance Meets Mechanistic Interpretability: Equivariant Sparse Autoencoders
    Erdogan, Ege, and Lucic, Ana
    NeurIPS Mechanistic Interpretability and UniReps Workshops 2025
  2. FAccT ’25
    Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods
    Dhaini, Mahdi,  Erdogan, Ege, Feldhus, Nils, and Kasneci, Gjergji
    ACM Conference on Fairness, Accountability, and Transparency 2025
  3. MLMP ’25
    FreeFlow: Latent Flow Matching for Free Energy Difference Estimation
    Erdogan, Ege, Ralev, Radoslav, Rebensburg, Mika, Marquet, Céline, Klein, Leon, and Stark, Hannes
    ICLR Workshop on Machine Learning Multiscale Processes 2025
  4. GLFrontiers ’23
    Poisoning × Evasion: Symbiotic Adversarial Robustness for Graph Neural Networks
    Erdogan, Ege, Geisler, Simon, and Günnemann, Stephan
    New Frontiers in Graph Learning Workshop (at NeurIPS ’23) 2023
  5. WPES ’22
    UnSplit: Data-oblivious model inversion, model stealing, and label inference attacks against split learning
    Erdogan, Ege, Kupcu, Alptekin, and Cicek, A Ercument
    The 21st Workshop on Privacy in the Electronic Society (at ACM CCS ’22) 2022