Instructors: Prof. Dr. Nassir Navab, Dr. Shahrooz Faghihroohi, Azade Farshad, Yousef Yeganeh

Time: TBA

Registration

Registration must be done through TUM Matching Platform (please pay attention to the Deadlines)
In order to increase your priority, please also apply via our own Registration system.
The maximum number of participants: 20.

Announcements

The presentation and blogpost guidelines are available here: TBA
The preliminary meeting slides can be found here: DLMA-PreliminaryMeeting-SS24.pdf
The preliminary meeting is scheduled for Feb 1st, 13:30 to 14:00 with the following Zoom link:

https://tum-conf.zoom-x.de/j/62875182420?pwd=SWlFM0Jya0dQVFNLeUVrUHg5cWhDUT09

Introduction

Deep Learning is growing tremendously in Computer Vision and Medical Imaging as well. Highly impacted journals in the medical imaging community, i.e. IEEE Transaction on Medical Imaging, recently published their special edition on Deep Learning [1]. The Seminar will propose a list of recent scientific articles related to the main current research topics in deep learning for Medical Applications, together with some interesting papers from other communities (CVPR, NeurIPS, ICCV, ICLR, ICML, ...).

Course Structure

In this Master Seminar (Hauptseminar), students select one scientific topic from the list provided by course organizers. The students should read the proposed sample papers by the tutors, find the topic-related articles, summarize and compare them in their presentation and blogpost:

Presentation: The selected paper is presented to the other participants (Maximum 25 minutes presentation, 10 minutes questions). You can use the CAMP templates for PowerPoint TUM-Template.pptx.
Blog Post: A blog post of 3000-3500 words excluding references, should be submitted before the deadline. The blog post must include all references used and must be written completely in your own words. Copy and paste will not be tolerated.
Attendance: Participants have to participate actively in all seminar sessions. Each presentation is followed by a discussion, and everyone is encouraged to actively participate.

Submission Deadline: You have to submit the blog post by the first presentation date (13.06) and can modify it a bit until the last session of the course.

Schedule (TBA)

Date

Session: Topics

Students

13.06.2024

Multimodal Generative Models

Causal Generative Models

Handling Motion in Medical Imaging with Spatio-Temporal Generative Models

Linus Salzmann

Simoleit Cameron

Julien Schulz

20.06.2024

Wavelet and Diffusion Models

Medical Image Reconstruction Using Diffusion Models

Diffusion-based 3D Shape completion

Leonhard Zirus

Janina Schellenberg

Johannes Thyroff

27.06.2024

Video Synthesis Using Diffusion Model

Fast diffusion models

Yugay Vasiliy

Zhang Shihong

04.07.2024

Deformable Image Registration with Implicit Neural Representations

Neural Implicit Representations for Medical Shapes

3d reconstruction in the context of medical applications

Zhang Xingyu

Laura Leschke

Arpi Arustamyan

11.07.2024

Can a Neural Network learn Physiology?

Fine-tuning Large Language Models using Reinforcement Learning

Graph Diffusion Models

Donnate Hooft

Tomislav Pavković

UNG Jacques

List of Topics and Material

The proposed papers for each topic in this course are usually selected from the following venues/publications:

CVPR: Conference on Computer Vision and Pattern Recognition
ICLR: International Conference on Learning Representations
NeurIPS: Neural Information Processing Systems

TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence

TMI: IEEE Transaction on Medical Imaging
JBHI: IEEE Journal of Biomedical and Health Informatics
MedIA: Medical Image Analysis (Elsevier)

MICCAI: Medical Image Computing and Computer-Assisted Intervention.
BMVC: British Machine Vision Conference
MIDL: Medical Imaging with Deep Learning

List of papers (TBA)

No	Topic	Sample Papers	Journal/ Conference	Tutor	Student	Link
1	Multimodal Generative Models	Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models	CVPR 2024	Azade Farshad , Yeganeh, Y. M.	Linus Salzmann	https://arxiv.org/pdf/2404.03913
		Instruct-Imagen: Image Generation with Multi-modal Instruction	CVPR 2024			https://arxiv.org/abs/2401.01952
		MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant	CVPR 2024			https://arxiv.org/abs/2403.04290
2	Causal Generative Models	Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting	ICML 2023	Azade Farshad , Yeganeh, Y. M.	Simoleit Cameron	https://openreview.net/pdf?id=Kw90j2pNSt
		Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models	CVPR 2024			https://arxiv.org/abs/2312.06685
		What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models	ICCV 2023			https://arxiv.org/abs/2310.06627
3	Graph Diffusion Models	Data-Centric Learning from Unlabeled Graphs with Diffusion Model	NeurIPS 2023	Azade Farshad , Yeganeh, Y. M.	UNG Jacques	https://proceedings.neurips.cc/paper_files/paper/2023/file/4290cccf23be59e42a575d026ccbeeb8-Paper-Conference.pdf
		Autoregressive Diffusion Model for Graph Generation	ICML 2023			https://proceedings.mlr.press/v202/kong23b/kong23b.pdf
		DiGress: Discrete Denoising diffusion for graph generation	ICLR 2023			https://arxiv.org/abs/2209.14734
4	Medical Image Reconstruction Using Diffusion Models	DiffGAN: An adversarial diffusion model with local transformer for MRI reconstruction	Magnetic Resonance Imaging 2024	Shahrooz Faghihroohi	Janina Schellenberg	https://www.sciencedirect.com/science/article/pii/S0730725X24000730
		DOLCE: A model-based probabilistic diffusion framework for limited-angle ct reconstruction	ICCV 2023			https://openaccess.thecvf.com/content/ICCV2023/papers/Liu_DOLCE_A_Model-Based_Probabilistic_Diffusion_Framework_for_Limited-Angle_CT_Reconstruction_ICCV_2023_paper.pdf
		Adaptive diffusion priors for accelerated MRI reconstruction	MedIA 2023			https://www.sciencedirect.com/science/article/pii/S1361841523001329
5	Video Synthesis Using Diffusion Model	Align your latents: High-resolution video synthesis with latent diffusion models	ICCV 2023	Shahrooz Faghihroohi	Yugay Vasiliy	https://openaccess.thecvf.com/content/CVPR2023/papers/Blattmann_Align_Your_Latents_High-Resolution_Video_Synthesis_With_Latent_Diffusion_Models_CVPR_2023_paper.pdf
		Structure and content-guided video synthesis with diffusion models	ICCV 2023			https://openaccess.thecvf.com/content/ICCV2023/papers/Esser_Structure_and_Content-Guided_Video_Synthesis_with_Diffusion_Models_ICCV_2023_paper.pdf
		Lumiere: A space-time diffusion model for video generation	Arxiv 2024			https://arxiv.org/html/2401.12945v2
6	Handling Motion in Medical Imaging with Spatio-Temporal Generative Models	CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac Anatomy	TMI 2023	Magdalena Wysocki	Julien Schulz	https://ieeexplore.ieee.org/document/10315018
		Learning a Generative Motion Model From Image Sequences Based on a Latent Motion Matrix	TMI 2021			https://ieeexplore.ieee.org/document/9344838
		SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation	IPMI 2023			https://arxiv.org/pdf/2212.08228
7	Deformable Image Registration with Implicit Neural Representations	Deformable Image Registration with Geometry-informed Implicit Neural Representations	MIDL 2024	Magdalena Wysocki	Zhang Xingyu	https://proceedings.mlr.press/v227/harten24a/harten24a.pdf
		SINR: Spline-enhanced implicit neural representation for multi-modal registration	MIDL 2024			https://openreview.net/pdf?id=V5XDYSRcQP
		Robust Deformable Image Registration Using Cycle-Consistent Implicit Representations	TMI 2023			https://ieeexplore.ieee.org/abstract/document/10268959?casa_token=oOvfZKXswRgAAAAA:9rRoWYgKustP9wtK5PJo1_9k3HKWwMbo9I8EQausmHk3eNLeU_VYvl7Fdghu77Yp8YE7tN5mTA
8	Wavelet and Diffusion Models	Wavelet-Improved Score-Based Generative Model for Medical Imaging	TMI 2024	Mohammad Farid Azampour	Leonhard Zirus	https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10288274&casa_token=2avtrTfCV0gAAAAA:KvCCDS2zCZ44YoW5naT2ukf12WhGPI-vvS_keWa5_Qj-Cwu7j8f2EWtypTfYbNZGDrIQ4ILCx2I&tag=1
		Wavelet Score-Based Generative Modeling	NeurIPS 2022			https://proceedings.neurips.cc/paper_files/paper/2022/file/03474669b759f6d38cdca6fb4eb905f4-Paper-Conference.pdf
		Neural Wavelet-domain Diffusion for 3D Shape Generation	Siggraph 2022			https://arxiv.org/pdf/2209.08725.pdf
9	Neural Implicit Representations for Medical Shapes	4D Myocardium Reconstruction with Decoupled Motion and Shape Model	ICCV 2023	Magdalena Wysocki	Laura Leschke	https://openaccess.thecvf.com/content/ICCV2023/papers/Yuan_4D_Myocardium_Reconstruction_with_Decoupled_Motion_and_Shape_Model_ICCV_2023_paper.pdf
		MedShapeNet - A Large-Scale Dataset of 3DMedical Shapes for Computer Vision	Arxiv 2023 + MICCAI Workshop			https://arxiv.org/pdf/2308.16139
		ImplicitAtlas: Learning Deformable Shape Templates in Medical Imaging	ICPR 2022			https://openaccess.thecvf.com/content/CVPR2022/papers/Yang_ImplicitAtlas_Learning_Deformable_Shape_Templates_in_Medical_Imaging_CVPR_2022_paper.pdf
10	Diffusion-based 3D Shape completion	SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation	CVPR 2023	Miruna-Alexandra Gafencu	Johannes Thyroff	https://openaccess.thecvf.com/content/CVPR2023/papers/Cheng_SDFusion_Multimodal_3D_Shape_Completion_Reconstruction_and_Generation_CVPR_2023_paper.pdf
		Diffusion-SDF: Conditional Generative Modeling of Signed Distance Functions	ICCV 2023			https://openaccess.thecvf.com/content/ICCV2023/papers/Chou_Diffusion-SDF_Conditional_Generative_Modeling_of_Signed_Distance_Functions_ICCV_2023_paper.pdf
		3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models	TOG 2023			https://dl.acm.org/doi/pdf/10.1145/3592442
11	Fine-tuning Large Language Models using Reinforcement Learning	Training language models to follow instructions with human feedback	NeurIPS 2022	David Bani-Harouni	Tomislav Pavković	https://arxiv.org/abs/2203.02155
		Quark: Controllable Text Generation with Reinforced [Un]learning	NeurIPS 2022			https://arxiv.org/abs/2205.13636
		Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering	EMNLP 2022			https://arxiv.org/abs/2210.03078
12	Can a Neural Network learn Physiology?	The New Field of Network Physiology: Building the Human Physiolome	Frontiers in Network Physiology 2021	Francesca De Benetti	Donnate Hooft	https://www.frontiersin.org/articles/10.3389/fnetp.2021.711778/full
		Dynamic networks of cortico-muscular interactions in sleep and neurodegenerative disorders	Frontiers in Network Physiology 2023			https://www.frontiersin.org/articles/10.3389/fnetp.2023.1168677/full
		Dynamic networks of physiologic interactions of brain waves and rhythms in muscle activity	Human Movement Science 2022			https://www.sciencedirect.com/science/article/pii/S0167945722000513
13	3d reconstruction in context of medical applications	Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape Reconstruction	NeurIPS2024	Unbekannter Benutzer (ga87jay)	Arpi Arustamyan	https://proceedings.neurips.cc/paper_files/paper/2023/file/412732f172bdd5ad0efde2fafa110700-Paper-Datasets_and_Benchmarks.pdf
		3D reconstruction of proximal femoral fracture from biplanar radiographs with fractural representative learning	Scientific Reports 2023			https://www.nature.com/articles/s41598-023-27607-2
		A Deep-Learning Approach For Direct Whole-Heart Mesh Reconstruction	Medical Image Analysis 2021			https://arxiv.org/abs/2102.07899
14	Fast diffusion models	UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models	Neurips 2024	Mohammad Farid Azampour	Zhang Shihong	https://proceedings.neurips.cc/paper_files/paper/2023/file/9c2aa1e456ea543997f6927295196381-Paper-Conference.pdf
		Fast Sampling of Diffusion Models via Operator Learning	ICML 2023			https://proceedings.mlr.press/v202/zheng23d/zheng23d.pdf
		One-Step Diffusion Distillation via Deep Equilibrium Models	Neurips 2024			https://proceedings.neurips.cc/paper_files/paper/2023/file/82f05a105c928c10706213952bf0c8b7-Paper-Conference.pdf

Literature and Helpful Links

A lot of scientific publications can be found online.

The following list may help you to find some further information on your particular topic:

Some publishers:

Libraries (online and offline):

http://rzblx1.uni-regensburg.de/ezeit/ (Elektronische Zeitschriften Bibliothek)
Verbundkatalog des Bibliotheksverbundes Bayern (BVB)
Computer ORG
http://www.ub.tum.de/ (TUM Library)
- To get access onto the electronic library, see http://www.ub.tum.de/medien/ejournals/readme.html
- "proxy.biblio.tu-muenchen.de" mit Port 8080 (nur fuer http). Damit klappen zumindest portal.acm.org und computer.org meistens
Various proceedings of conferences in our AR-Lab, 03.13.036 (These proceedings are not for lending!)

Some further hints for working with references:

JabRef is a Java program for comfortable working with Bibtex literature databases. Handy feature: if you know the PubMed ID for an article, JabRef can import data from there (via "Web Search/Medline").
Mendeley is a cross-platform program for organising your references.

If you find useful resources that are not already listed here, please tell us, so we can add them for others. Thanks.

Seitenhierarchie

DLMA: Summer 2024