Instructors: Prof. Dr. Nassir NavabDr. Shahrooz Faghihroohi, Dr. Azade Farshad, Yousef Yeganeh


Time: TBA

Registration

Announcements

Introduction

  • Deep Learning is growing tremendously in Computer Vision and Medical Imaging as well. Highly impacted journals in the medical imaging community, i.e. IEEE Transaction on Medical Imaging, recently published their special edition on Deep Learning [1]. The Seminar will propose a list of recent scientific articles related to the main current research topics in deep learning for Medical Applications, together with some interesting papers from other communities (CVPR, NeurIPS, ICCV, ICLR, ICML, ...).

Course Structure

In this Master Seminar (Hauptseminar), students select one scientific topic from the list provided by course organizers. The students should read the proposed sample papers by the tutors, find the topic-related articles, summarize and compare them in their presentation and blogpost:

  • Presentation: The selected paper is presented to the other participants (Maximum 25 minutes presentation, 10 minutes questions). You can use the CAMP templates for PowerPoint TUM-Template.pptx.
  • Blog Post: A blog post of 3000-3500 words excluding references, should be submitted before the deadline. The blog post must include all references used and must be written completely in your own words. Copy and paste will not be tolerated.
  • Attendance: Participants have to participate actively in all seminar sessions. Each presentation is followed by a discussion, and everyone is encouraged to actively participate.

Submission Deadline: You have to submit the blog post one week before the first presentation date and can modify it a bit until the last session of the course.

Schedule

TBA

List of Topics and Material

The proposed papers for each topic in this course are usually selected from the following venues/publications:


CVPR: Conference on Computer Vision and Pattern Recognition
ICLR: International Conference on Learning Representations
NeurIPS: Neural Information Processing Systems

TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence

TMI: IEEE Transaction on Medical Imaging
JBHI: IEEE Journal of Biomedical and Health Informatics
MedIA: Medical Image Analysis (Elsevier)

MICCAI: Medical Image Computing and Computer-Assisted Intervention.
BMVC: British Machine Vision Conference
MIDL: Medical Imaging with Deep Learning


List of topics

NoTopicSample PapersJournal/ ConferenceTutorStudentLink
1

Medical / Biomedical AI Agent

CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image AnalysisNeurIPS 2025
https://arxiv.org/abs/2505.20510
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-MakingNeurIPS 2024https://arxiv.org/abs/2404.15155
Enhancing diagnostic capability with multi-agents conversational (MAC)NPJ Digital Medicinehttps://www.nature.com/articles/s41746-025-01550-0
2

Any-to-Any Multimodal LLM

NExT-GPT: Any-to-Any Multimodal LLMICML 2024
https://arxiv.org/abs/2309.05519
Training Transitive and Commutative Multimodal (LoReTTa)NeurIPS 2023https://arxiv.org/abs/2305.14243
Meta-Transformer: A Unified Framework for Multimodal Learningarxivhttps://arxiv.org/abs/2307.10802
3

Audio Reconstruction/Generation from Video Sequences

Speech Audio Generation from Dynamic MRI via a Knowledge Enhanced Conditional Variational AutoencoderMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/2374_paper.pdf

MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCVPR 2025https://arxiv.org/pdf/2412.15322
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Soundsarxivhttps://arxiv.org/pdf/2407.01494
4

Tissue biomechanics modelling in Deep Learning

Real-time simulation of viscoelastic tissue behavior with physics-guided deep learningarxiv
https://arxiv.org/pdf/2301.04614
Data-Driven Tissue- and Subject-Specific Elastic Regularization for Medical Image RegistrationMICCAI 2024

https://papers.miccai.org/miccai-2024/paper/3303_paper.pdf

Review of Machine Learning Techniques in Soft Tissue Biomechanics and BiomaterialsReview article

https://link.springer.com/article/10.1007/s13239-024-00737-y

5

Gaussian Splatting in minimally invasive surgery

T2GS: Comprehensive Reconstruction of Dynamic Surgical Scenes with Gaussian SplattingMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/5019_paper.pdf

SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian SplattingMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/1324_paper.pdf

EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian SplattingMICCAI 2024

https://papers.miccai.org/miccai-2024/paper/0791_paper.pdf

6

OCT-based retina representation, but where to put it?

Gaussian Primitive Optimized Deformable Retinal Image RegistrationMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/3875_paper.pdf

Retinal OCT Image Registration: Methods and ApplicationsIEEE RBME Vol. 16 2021

https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9531445

RetinaRegNet: A zero-shot approach for retinal image registrationComput. Biol. Med 2025

https://www.sciencedirect.com/science/article/pii/S001048252401730X

7Adaptations of generalist segmentation models to medical scenesUM-SAM: Unsupervised Medical Image Segmentation using Knowledge Distillation from Segment Anything ModelMICCAI 2025Diego Biagini

https://papers.miccai.org/miccai-2025/paper/2296_paper.pdf

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term TrackingMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/0617_paper.pdf

SR-SAM: Subspace Regularization for Domain Generalization of Segment Anything ModelMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/2210_paper.pdf

8Segmentation of delicate tissue layers in OCT images of the posterior eyeAutomated retinal boundary segmentation of optical coherence tomography images using an improved Canny operatorNature Sci Rep 2022

https://www.nature.com/articles/s41598-022-05550-y#citeas

Retinal OCT image segmentation with deep learning: A review of advances, datasets, and evaluation metricsCMIG 2025

https://www.sciencedirect.com/science/article/abs/pii/S0895611125000485

Weakly supervised segmentation of retinal layers on OCT images with AMD using uncertainty prototype and boundary regressionMedIA 2025

https://www.sciencedirect.com/science/article/abs/pii/S1361841525001197

9Understanding Neurosurgery (Pituitary Surgery)F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic SurgeryMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/1527_paper.pdf

Automatic summarization of endoscopic skull base surgical videos through object detection and hidden Markov modelingCMIG 2023

https://www.sciencedirect.com/science/article/pii/S0895611123000666

SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgeryarxivhttps://arxiv.org/pdf/2503.09474
10

Surgical Video Understanding

Future Slot Prediction for Unsupervised Object Discovery in Surgical VideoMICCAI 2025

https://papers.miccai.org/miccai-2025/paper/4725_paper.pdf

General surgery vision transformer: A video pre-trained foundation model for general surgeryarxivhttps://arxiv.org/pdf/2403.05949
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learningarxivhttps://arxiv.org/pdf/2408.07981
11implicit neural representation combined with diffusion modelsHighly accelerated MRI via implicit neural representation guided posterior sampling of diffusion modelsMedIA 25Mohammad Farid Azampour

https://www.sciencedirect.com/science/article/pii/S1361841524003232?casa_token=h74qJgOHDwYAAAAA:koHfi_l4bAZVBBE7hmP-WmtUEx-jqWuQBYmtS8XdqqhNgahHyXQ1I9cpnPCppDvf_KjxBYhYeg

Hyperdiffusion: Generating implicit neural fields with weight-space diffusionIccv 23

http://openaccess.thecvf.com/content/ICCV2023/papers/Erkoc_HyperDiffusion_Generating_Implicit_Neural_Fields_with_Weight-Space_Diffusion_ICCV_2023_paper.pdf

Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural RepresentationsMiccai 25https://arxiv.org/pdf/2504.19402

Literature and Helpful Links

A lot of scientific publications can be found online.

The following list may help you to find some further information on your particular topic:

Some publishers:

Libraries (online and offline):

Some further hints for working with references:

  • JabRef is a Java program for comfortable working with Bibtex literature databases. Handy feature: if you know the PubMed ID for an article, JabRef can import data from there (via "Web Search/Medline").
  • Mendeley is a cross-platform program for organising your references.

If you find useful resources that are not already listed here, please tell us, so we can add them for others. Thanks.

  • Keine Stichwörter