Instructors: Prof. Dr. Nassir NavabDr. Shahrooz Faghihroohi, Azade Farshad, Yousef Yeganeh


Time: TBA

Registration

Announcements

  • The presentation and blogpost guidelines are available here: TBA 
  • The preliminary meeting slides can be found here: DLMA-PreliminaryMeeting-SS24.pdf
  • The preliminary meeting is scheduled for Feb 1st, 13:30 to 14:00 with the following Zoom link: 

https://tum-conf.zoom-x.de/j/62875182420?pwd=SWlFM0Jya0dQVFNLeUVrUHg5cWhDUT09

Introduction

  • Deep Learning is growing tremendously in Computer Vision and Medical Imaging as well. Highly impacted journals in the medical imaging community, i.e. IEEE Transaction on Medical Imaging, recently published their special edition on Deep Learning [1]. The Seminar will propose a list of recent scientific articles related to the main current research topics in deep learning for Medical Applications, together with some interesting papers from other communities (CVPR, NeurIPS, ICCV, ICLR, ICML, ...).

Course Structure

In this Master Seminar (Hauptseminar), students select one scientific topic from the list provided by course organizers. The students should read the proposed sample papers by the tutors, find the topic-related articles, summarize and compare them in their presentation and blogpost:

  • Presentation: The selected paper is presented to the other participants (Maximum 25 minutes presentation, 10 minutes questions). You can use the CAMP templates for PowerPoint TUM-Template.pptx.
  • Blog Post: A blog post of 3000-3500 words excluding references, should be submitted before the deadline. The blog post must include all references used and must be written completely in your own words. Copy and paste will not be tolerated.
  • Attendance: Participants have to participate actively in all seminar sessions. Each presentation is followed by a discussion, and everyone is encouraged to actively participate.

Submission Deadline: You have to submit the blog post by your presentation date and can modify it until the last session of the course.

Schedule (TBA)

DateSession: TopicsStudents
13.06.2024

Multimodal Generative Models

Causal Generative Models

Handling Motion in Medical Imaging with Spatio-Temporal Generative Models

Linus Salzmann

Simoleit Cameron

Julien Schulz

20.06.2024

Wavelet and Diffusion Models

Medical Image Reconstruction Using Diffusion Models

Diffusion-based 3D Shape completion 

Leonhard Zirus

Janina Schellenberg

Johannes Thyroff

27.06.2024

Video Synthesis Using Diffusion Model

Graph Diffusion Models

Fast diffusion models

Yugay Vasiliy

UNG Jacques

Zhang Shihong

04.07.2024

Deformable Image Registration with Implicit Neural Representations

Neural Implicit Representations for Medical Shapes

3d reconstruction in the context of medical applications

Zhang Xingyu

Laura Leschke

Arpi Arustamyan

11.07.2024

Can a Neural Network learn Physiology?

Fine-tuning Large Language Models using Reinforcement Learning

Donnate Hooft

Tomislav Pavković

List of Topics and Material

The proposed papers for each topic in this course are usually selected from the following venues/publications:


CVPR: Conference on Computer Vision and Pattern Recognition
ICLR: International Conference on Learning Representations
NeurIPS: Neural Information Processing Systems

TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence

TMI: IEEE Transaction on Medical Imaging
JBHI: IEEE Journal of Biomedical and Health Informatics
MedIA: Medical Image Analysis (Elsevier)

MICCAI: Medical Image Computing and Computer-Assisted Intervention.
BMVC: British Machine Vision Conference
MIDL: Medical Imaging with Deep Learning


List of papers (TBA)

NoTopicSample PapersJournal/ ConferenceTutorStudentLink
1

Multimodal Generative Models

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image ModelsCVPR 2024Linus Salzmannhttps://arxiv.org/pdf/2404.03913
Instruct-Imagen: Image Generation with Multi-modal InstructionCVPR 2024https://arxiv.org/abs/2401.01952
MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual InvariantCVPR 2024https://arxiv.org/abs/2403.04290
2Causal Generative ModelsGenerative Causal Representation Learning for Out-of-Distribution Motion
Forecasting
ICML 2023Simoleit Cameronhttps://openreview.net/pdf?id=Kw90j2pNSt
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language ModelsCVPR 2024https://arxiv.org/abs/2312.06685
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language ModelsICCV 2023https://arxiv.org/abs/2310.06627
3Graph Diffusion ModelsData-Centric Learning from Unlabeled Graphs with Diffusion ModelNeurIPS 2023UNG Jacques

https://proceedings.neurips.cc/paper_files/paper/2023/file/4290cccf23be59e42a575d026ccbeeb8-Paper-Conference.pdf

Autoregressive Diffusion Model for Graph GenerationICML 2023https://proceedings.mlr.press/v202/kong23b/kong23b.pdf
DiGress: Discrete Denoising diffusion for graph generationICLR 2023https://arxiv.org/abs/2209.14734
4Medical Image Reconstruction Using Diffusion ModelsDiffGAN: An adversarial diffusion model with local transformer for MRI reconstructionMagnetic Resonance Imaging 2024Shahrooz Faghihroohi Janina Schellenberghttps://www.sciencedirect.com/science/article/pii/S0730725X24000730
DOLCE: A model-based probabilistic diffusion framework for limited-angle ct reconstructionICCV 2023

https://openaccess.thecvf.com/content/ICCV2023/papers/Liu_DOLCE_A_Model-Based_Probabilistic_Diffusion_Framework_for_Limited-Angle_CT_Reconstruction_ICCV_2023_paper.pdf

Adaptive diffusion priors for accelerated MRI reconstructionMedIA 2023https://www.sciencedirect.com/science/article/pii/S1361841523001329
5Video Synthesis Using Diffusion ModelAlign your latents: High-resolution video synthesis with latent diffusion modelsICCV 2023Shahrooz Faghihroohi Yugay Vasiliy

https://openaccess.thecvf.com/content/CVPR2023/papers/Blattmann_Align_Your_Latents_High-Resolution_Video_Synthesis_With_Latent_Diffusion_Models_CVPR_2023_paper.pdf

Structure and content-guided video synthesis with diffusion modelsICCV 2023

https://openaccess.thecvf.com/content/ICCV2023/papers/Esser_Structure_and_Content-Guided_Video_Synthesis_with_Diffusion_Models_ICCV_2023_paper.pdf

Lumiere: A space-time diffusion model for video generationArxiv 2024https://arxiv.org/html/2401.12945v2
6Handling Motion in Medical Imaging with Spatio-Temporal Generative ModelsCHeart: A Conditional Spatio-Temporal Generative Model for Cardiac AnatomyTMI 2023Julien Schulzhttps://ieeexplore.ieee.org/document/10315018
Learning a Generative Motion Model From Image Sequences Based on a Latent Motion MatrixTMI 2021https://ieeexplore.ieee.org/document/9344838
SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image GenerationIPMI 2023https://arxiv.org/pdf/2212.08228
7Deformable Image Registration with Implicit Neural RepresentationsDeformable Image Registration with Geometry-informed Implicit Neural RepresentationsMIDL 2024Zhang Xingyuhttps://proceedings.mlr.press/v227/harten24a/harten24a.pdf
SINR: Spline-enhanced implicit neural representation for multi-modal registrationMIDL 2024https://openreview.net/pdf?id=V5XDYSRcQP
Robust Deformable Image Registration Using Cycle-Consistent Implicit RepresentationsTMI 2023

https://ieeexplore.ieee.org/abstract/document/10268959?casa_token=oOvfZKXswRgAAAAA:9rRoWYgKustP9wtK5PJo1_9k3HKWwMbo9I8EQausmHk3eNLeU_VYvl7Fdghu77Yp8YE7tN5mTA

8Wavelet and Diffusion ModelsWavelet-Improved Score-Based Generative Model for Medical ImagingTMI 2024Mohammad Farid Azampour Leonhard Zirus

https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10288274&casa_token=2avtrTfCV0gAAAAA:KvCCDS2zCZ44YoW5naT2ukf12WhGPI-vvS_keWa5_Qj-Cwu7j8f2EWtypTfYbNZGDrIQ4ILCx2I&tag=1

Wavelet Score-Based Generative ModelingNeurIPS 2022

https://proceedings.neurips.cc/paper_files/paper/2022/file/03474669b759f6d38cdca6fb4eb905f4-Paper-Conference.pdf

Neural Wavelet-domain Diffusion for 3D Shape GenerationSiggraph 2022https://arxiv.org/pdf/2209.08725.pdf
9Neural Implicit Representations for Medical Shapes4D Myocardium Reconstruction with Decoupled Motion and Shape ModelICCV 2023Laura Leschke

https://openaccess.thecvf.com/content/ICCV2023/papers/Yuan_4D_Myocardium_Reconstruction_with_Decoupled_Motion_and_Shape_Model_ICCV_2023_paper.pdf

MedShapeNet - A Large-Scale Dataset of 3DMedical Shapes for Computer VisionArxiv 2023 + MICCAI Workshophttps://arxiv.org/pdf/2308.16139
ImplicitAtlas: Learning Deformable Shape Templates in Medical ImagingICPR 2022

https://openaccess.thecvf.com/content/CVPR2022/papers/Yang_ImplicitAtlas_Learning_Deformable_Shape_Templates_in_Medical_Imaging_CVPR_2022_paper.pdf

10Diffusion-based 3D Shape completion SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationCVPR 2023Miruna-Alexandra GafencuJohannes Thyroff

https://openaccess.thecvf.com/content/CVPR2023/papers/Cheng_SDFusion_Multimodal_3D_Shape_Completion_Reconstruction_and_Generation_CVPR_2023_paper.pdf

Diffusion-SDF: Conditional Generative Modeling of Signed Distance FunctionsICCV 2023

https://openaccess.thecvf.com/content/ICCV2023/papers/Chou_Diffusion-SDF_Conditional_Generative_Modeling_of_Signed_Distance_Functions_ICCV_2023_paper.pdf

3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion ModelsTOG 2023https://dl.acm.org/doi/pdf/10.1145/3592442
11Fine-tuning Large Language Models using Reinforcement LearningTraining language models to follow instructions with human feedbackNeurIPS 2022David Bani-HarouniTomislav Pavkovićhttps://arxiv.org/abs/2203.02155
Quark: Controllable Text Generation with Reinforced [Un]learningNeurIPS 2022https://arxiv.org/abs/2205.13636
Rainier: Reinforced Knowledge Introspector for Commonsense Question AnsweringEMNLP 2022https://arxiv.org/abs/2210.03078
12Can a Neural Network learn Physiology?The New Field of Network Physiology: Building the Human PhysiolomeFrontiers in Network Physiology 2021Francesca De BenettiDonnate Hoofthttps://www.frontiersin.org/articles/10.3389/fnetp.2021.711778/full
Dynamic networks of cortico-muscular interactions in sleep and neurodegenerative disordersFrontiers in Network Physiology 2023https://www.frontiersin.org/articles/10.3389/fnetp.2023.1168677/full
Dynamic networks of physiologic interactions of brain waves and rhythms in muscle activityHuman Movement Science 2022https://www.sciencedirect.com/science/article/pii/S0167945722000513
133d reconstruction in context of medical applicationsBenchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape ReconstructionNeurIPS2024Agnieszka Tomczak Arpi Arustamyan

https://proceedings.neurips.cc/paper_files/paper/2023/file/412732f172bdd5ad0efde2fafa110700-Paper-Datasets_and_Benchmarks.pdf

3D reconstruction of proximal femoral fracture from biplanar radiographs with fractural representative learningScientific Reports 2023https://www.nature.com/articles/s41598-023-27607-2
A Deep-Learning Approach For Direct Whole-Heart Mesh ReconstructionMedical Image Analysis 2021https://arxiv.org/abs/2102.07899
14Fast diffusion modelsUniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion ModelsNeurips 2024Mohammad Farid Azampour Zhang Shihong

https://proceedings.neurips.cc/paper_files/paper/2023/file/9c2aa1e456ea543997f6927295196381-Paper-Conference.pdf

Fast Sampling of Diffusion Models via Operator LearningICML 2023https://proceedings.mlr.press/v202/zheng23d/zheng23d.pdf
One-Step Diffusion Distillation via Deep Equilibrium ModelsNeurips 2024

https://proceedings.neurips.cc/paper_files/paper/2023/file/82f05a105c928c10706213952bf0c8b7-Paper-Conference.pdf


Literature and Helpful Links

A lot of scientific publications can be found online.

The following list may help you to find some further information on your particular topic:

Some publishers:

Libraries (online and offline):

Some further hints for working with references:

  • JabRef is a Java program for comfortable working with Bibtex literature databases. Handy feature: if you know the PubMed ID for an article, JabRef can import data from there (via "Web Search/Medline").
  • Mendeley is a cross-platform program for organising your references.

If you find useful resources that are not already listed here, please tell us, so we can add them for others. Thanks.