Instructors: Prof. Dr. Nassir NavabDr. Shahrooz Faghihroohi, Azade Farshad, Yousef Yeganeh


Time: TBA

Registration

Announcements

  • The presentation and blogpost guidelines are available here: Guide_DLMA SS2022.pdf 
  • The preliminary meeting slides can be found here: DLMAWS23-24.pdf
  • The preliminary meeting is scheduled for July 5th, 14:30 (Zoom link is visible on TUMonline).

Introduction

  • Deep Learning is growing tremendously in Computer Vision and Medical Imaging as well. Highly impacted journals in the medical imaging community, i.e. IEEE Transaction on Medical Imaging, recently published their special edition on Deep Learning [1]. The Seminar will propose a list of recent scientific articles related to the main current research topics in deep learning for Medical Applications, together with some interesting papers from other communities (CVPR, NeurIPS, ICCV, ICLR, ICML, ...).

Course Structure

In this Master Seminar (Hauptseminar), students select one scientific topic from the list provided by course organizers. The students should read the proposed sample papers by the tutors, find the topic-related articles, summarize and compare them in their presentation and blogpost:

  • Presentation: The selected paper is presented to the other participants (Maximum 25 minutes presentation, 10 minutes questions). You can use the CAMP templates for PowerPoint TUM-Template.pptx.
  • Blog Post: A blog post of 2500-3000 words excluding references, should be submitted before the deadline. The blog post must include all references used and must be written completely in your own words. Copy and paste will not be tolerated.
  • Attendance: Participants have to participate actively in all seminar sessions. Each presentation is followed by a discussion, and everyone is encouraged to actively participate.

Submission Deadline: You have to submit the blog post by the last session.

Schedule (TBA)

DateSession: TopicsStudents
14.12

Video Object Segmentation

Audio Segmentation and Sound Event Detection

Temporal Modeling for Longitudinal Medical Data

Zong, Xia

Sengün Gökce

Ruochen Li

21.12

Multimodal Learning with Functional and Structural MRI Analysis

Epileptic Seizure Detection and Prediction Using EEG

Learning-based Statistical Shape Model

El Alaoui Talibi, Ghita

Seidl, Máté

Tang, Yilin

11.01

Synthetic vessel generation

3D reconstruction from a single or biplanar images

Automatic C-arm Positioning/Pose estimation

Image Stitching Using Unsupervised/Semi-Supervised Learning


Obelleiro-Liz, Manuel

Jingtian Zhao

Ding Zhou

Krüger, Moritz

18.01

Physics-inspired Neural Networks for Medical Applications

Physics-inspired diffusion model

Representation Learning for Modeling Interactions

Aggarwal, Kunal

Weixuan Yuan

Chia-Chian Chan

25.01

Rethinking ultrasound confidence maps

Leveraging Knowledge for Medical Image Understanding in Radiology

Temporal Knowledge Graphs

Baller, Stephan

Khattab, Muhammad

Fatemeh Shamsoddini Ardekani

List of Topics and Material

The proposed papers for each topic in this course are usually selected from the following venues/publications:


CVPR: Conference on Computer Vision and Pattern Recognition
ICLR: International Conference on Learning Representations
NeurIPS: Neural Information Processing Systems

TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence

TMI: IEEE Transaction on Medical Imaging
JBHI: IEEE Journal of Biomedical and Health Informatics
MedIA: Medical Image Analysis (Elsevier)

MICCAI: Medical Image Computing and Computer-Assisted Intervention
BMVC: British Machine Vision Conference
MIDL: Medical Imaging with Deep Learning


List of papers (TBA)

NoTopicSample PapersJournal/ ConferenceTutorStudentLink
1Representation Learning for Modeling InteractionsPhysical Interaction: Reconstructing Hand-object Interactions with PhysicsSIGGRAPH 2022Chia-Chian Chanhttps://dl.acm.org/doi/abs/10.1145/3550469.3555421
What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactionsCVPR 2022https://arxiv.org/pdf/2204.00746.pdf
PIGNet: a physics-informed deep learning model toward generalized drug–target interaction predictionsChemical Science 2022https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8966633/
2Multimodal Learning with Functional and Structural MRI AnalysisStructural and Functional MRI Data Differentially Predict Chronological Age and Behavioral Memory PerformanceENeuro 2022El Alaoui Talibi, Ghitahttps://www.eneuro.org/content/9/6/ENEURO.0212-22.2022
Abnormal structural and functional network topological properties associated with left prefrontal, parietal, and occipital cortices significantly predict childhood TBI-related attention deficits: A semi-supervised deep learning studyFrontiers in Neuroscience 2023https://www.frontiersin.org/articles/10.3389/fnins.2023.1128646/full
Combined Structural MR and Diffusion Tensor Imaging Classify the Presence of Alzheimer’s Disease With the Same Performance as MR Combined With Amyloid Positron Emission Tomography: A Data Integration ApproachFrontiers in Neuroscience 2022https://www.frontiersin.org/articles/10.3389/fnins.2021.638175/full
3Temporal Modeling for Longitudinal Medical DataThe Queensland Twin Adolescent Brain Project, a longitudinal study of adolescent brain developmentNature 2023Ruochen Lihttps://www.nature.com/articles/s41597-023-02038-w
LSOR: Longitudinally-Consistent Self-Organized Representation LearningMICCAI 2023

https://link.springer.com/chapter/10.1007/978-3-031-43907-0_27#:~:text=Called%20Longitudinally%2Dconsistent%20Self%2DOrganized,assignments%20used%20by%20existing%20SOM).

Mixing Temporal Graphs with MLP for Longitudinal Brain Connectome AnalysisMICCAI 2023https://link.springer.com/chapter/10.1007/978-3-031-43895-0_73
4Video Object SegmentationLook Before You Match: Instance Understanding Matters in Video Object SegmentationCVPR 2023Zong, Xia

https://openaccess.thecvf.com/content/CVPR2023/html/Wang_Look_Before_You_Match_Instance_Understanding_Matters_in_Video_Object_CVPR_2023_paper.html

Unsupervised video object segmentation via prototype memory networkCVPR 2023

https://openaccess.thecvf.com/content/WACV2023/papers/Lee_Unsupervised_Video_Object_Segmentation_via_Prototype_Memory_Network_WACV_2023_paper.pdf

GL-Fusion: Global-Local Fusion Network for Multi-view Echocardiogram Video SegmentationMICCAI 2023https://arxiv.org/abs/2309.11144
5Temporal Knowledge GraphsLearning Meta-Representations of One-shot Relations for Temporal Knowledge Graph Link PredictionIJCNN 2023Fatemeh Shamsoddini Ardekanihttps://arxiv.org/pdf/2205.10621
Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation NetworksAAAI 2021https://ojs.aaai.org/index.php/AAAI/article/view/16604/16411
GenTKG: Generative Forecasting on Temporal Knowledge GrapharXiv 2023https://arxiv.org/pdf/2310.07793
6Image Stitching Using Unsupervised/Semi-Supervised LearningDepth-Aware Multi-Grid Deep Homography Estimation with Contextual CorrelationIEEE Transactions on CSVT 2022Krüger, Moritzhttps://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9605632
Unsupervised Deep Image Stitching: Reconstructing Stitched Features to ImagesTIP 2021https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9472883
Semi-supervised Deep Large-baseline Homography Estimation with Progressive Equivalence ConstraintAAAI 2023https://arxiv.org/abs/2212.02763
7Learning-based Statistical Shape ModelDeep implicit statistical shape models for 3d medical image delineationAAAI 2022Tang, Yilinhttps://ojs.aaai.org/index.php/AAAI/article/view/20110
Deep Structural Causal Shape ModelsECCV 2022https://arxiv.org/abs/2208.10950
Leveraging unsupervised image registration for discovery of landmark shape descriptorMedIA 2021https://www.sciencedirect.com/science/article/abs/pii/S1361841521002036
8Audio Segmentation and Sound Event Detection A review of deep learning techniques in audio event recognition (AER) applicationsMultimedia Tools and Applications 2023Sengün Gökcehttps://link.springer.com/article/10.1007/s11042-023-15891-z
You Only Hear Once: A YOLO-like Algorithm for Audio Segmentation and Sound Event DetectionApplied Sciences 2022https://www.mdpi.com/2076-3417/12/7/3293
THE COCKTAIL FORK PROBLEM: THREE-STEM AUDIO SEPARATION FOR REAL-WORLD SOUNDTRACKIEEE, ICASSP 2022https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9746005
9Physics-inspired diffusion modelsA Physics-informed Diffusion Model for High-fidelity Flow Field ReconstructionJournal of Computational Physics 2022Weixuan Yuanhttps://www.sciencedirect.com/science/article/pii/S0021999123000670
Physics-Driven Diffusion Models for Impact Sound Synthesis from VideosCVPR 2023

https://openaccess.thecvf.com/content/CVPR2023/papers/Su_Physics-Driven_Diffusion_Models_for_Impact_Sound_Synthesis_From_Videos_CVPR_2023_paper.pdf

PhysDiff: Physics-Guided Human Motion Diffusion ModelICCV 2023https://nvlabs.github.io/PhysDiff
10Leveraging Knowledge for Medical Image Understanding in RadiologyKiUT: Knowledge-injected U-Transformer for Radiology Report GenerationCVPR 2023Chantal PellegriniKhattab, Muhammad

https://openaccess.thecvf.com/content/CVPR2023/papers/Huang_KiUT_Knowledge-Injected_U-Transformer_for_Radiology_Report_Generation_CVPR_2023_paper.pdf

Knowledge-enhanced Visual-Language Pre-training on Chest Radiology ImagesNature Communications, 2023https://www.nature.com/articles/s41467-023-40260-7
Cross-modal Prototype Driven Network for Radiology Report GenerationECCV 2022https://link.springer.com/chapter/10.1007/978-3-031-19833-5_33
11Physics-inspired Neural Networks for Medical ApplicationsPhysics-informed neural networks for modeling physiological time series for cuffless blood pressure estimationNPJ Digital Medicine (Nature) 2023Francesca De Benetti Aggarwal, Kunalhttps://www.nature.com/articles/s41746-023-00853-4
WarpPINN: Cine-MR image registration with physics-informed neural networks.MIA 2023 https://arxiv.org/pdf/2211.12549.pdf
Physics-Informed Neural Networks for Brain Hemodynamic Predictions Using Medical ImagingTMI 2022https://ieeexplore.ieee.org/document/9740143
12Rethinking ultrasound confidencce maps ULTRASOUND CONFIDENCE MAPS OF INTENSITY AND STRUCTURE BASED ON DIRECTED ACYCLIC GRAPH AND ARTIFACT MODELSISBI 2021Baller, Stephanhttps://arxiv.org/pdf/2011.11956.pdf
Weakly Supervised Estimation of Shadow Confidence Maps in Fetal Ultrasound ImagingTMI 2019https://arxiv.org/pdf/1811.08164v3.pdf
Stochastic Neural Radiance Fields: Quantifying Uncertainty in Implicit 3D Representations3DV20121https://arxiv.org/pdf/2109.02123.pdf
13Synthetic vessel generationVesselVAE: Recursive Variational Autoencoders for 3D Blood Vessel SynthesisMICCAI 2023Agnieszka TomczakObelleiro-Liz, Manuelhttps://arxiv.org/pdf/2307.03592.pdf
Physiology-based simulation of the retinal vasculature enables annotation-free segmentation of OCT angiographsMICCAI 2022https://arxiv.org/pdf/2207.11102.pdf
Optional: Blood Vessel Geometry Synthesis using Generative Adversarial Networksarxiv 2018https://arxiv.org/pdf/1804.04381.pdf
14Epileptic Seizure Detection and Prediction Using EEGEfficient graph convolutional networks for seizure prediction using scalp EEGFrontiers in NeuroScience 2022Seidl, Mátéhttps://www.frontiersin.org/articles/10.3389/fnins.2022.967116/full
Patient-Specific Seizure Prediction via Adder Network and Supervised Contrastive LearningIEEE Transactions on Neural System 2022https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9787538
Data Augmentation for Seizure Prediction with Generative Diffusion ModelArxiv 2023https://arxiv.org/pdf/2306.08256.pdf
153D reconstruction from single imageImage-to-Graph Convolutional Network for Deformable Shape Reconstruction from a Single Projection ImageMICCAI 2021Jingtian Zhaohttps://arxiv.org/pdf/2108.12533.pdf
X2Vision : 3D CT Reconstruction from Biplanar X-Rays with Deep Structure PriorMICCAI 2023https://link.springer.com/chapter/10.1007/978-3-031-43999-5_66
X2CT-GAN: Reconstructing CT from Biplanar X-Rays with Generative Adversarial NetworksCVPR 2019

https://openaccess.thecvf.com/content_CVPR_2019/papers/Ying_X2CT-GAN_Reconstructing_CT_From_Biplanar_X-Rays_With_Generative_Adversarial_Networks_CVPR_2019_paper.pdf

16Automatic C-arm Positioning/Pose estimationShape-Based Pose Estimation for Automatic Standard Views of the KneeMICCAI2023Ding Zhouhttps://link.springer.com/chapter/10.1007/978-3-031-43990-2_45
Aneurysm Pose Estimation with Deep LearningMICCAI2023https://link.springer.com/chapter/10.1007/978-3-031-43895-0_51
C-arm positioning for standard projections during spinal implant placementMIA2022https://www.sciencedirect.com/science/article/pii/S136184152200202X


Literature and Helpful Links

A lot of scientific publications can be found online.

The following list may help you to find some further information on your particular topic:

Some publishers:

Libraries (online and offline):

Some further hints for working with references:

  • JabRef is a Java program for comfortable working with Bibtex literature databases. Handy feature: if you know the PubMed ID for an article, JabRef can import data from there (via "Web Search/Medline").
  • Mendeley is a cross-platform program for organising your references.

If you find useful resources that are not already listed here, please tell us, so we can add them for others. Thanks.

  • Keine Stichwörter