Instructors: Prof. Dr. Nassir NavabDr. Shahrooz Faghihroohi, Azade Farshad, Yousef Yeganeh


Time: TBA

Registration

Announcements

  • The presentation and blogpost guidelines are available here: Guide_DLMA SS2022.pdf 
  • The preliminary meeting slides can be found here:
  • The preliminary meeting is scheduled for Feb 2nd, 10:30 (Zoom link is visible on TUMonline).

Introduction

  • Deep Learning is growing tremendously in Computer Vision and Medical Imaging as well. Highly impacted journals in the medical imaging community, i.e. IEEE Transaction on Medical Imaging, recently published their special edition on Deep Learning [1]. The Seminar will propose a list of recent scientific articles related to the main current research topics in deep learning for Medical Applications, together with some interesting papers from other communities (CVPR, NeurIPS, ICCV, ICLR, ICML, ...).

Course Structure

In this Master Seminar (Hauptseminar), students select one scientific topic from the list provided by course organizers. The students should read the proposed sample papers by the tutors, find the topic-related articles, summarize and compare them in their presentation and blogpost:

  • Presentation: The selected paper is presented to the other participants (Maximum 25 minutes presentation, 10 minutes questions). You can use the CAMP templates for PowerPoint TUM-Template.pptx.
  • Blog Post: A blog post of 2500-3000 words excluding references, should be submitted before the deadline. The blog post must include all references used and must be written completely in your own words. Copy and paste will not be tolerated.
  • Attendance: Participants have to participate actively in all seminar sessions. Each presentation is followed by a discussion, and everyone is encouraged to actively participate.

Submission Deadline: You have to submit the blog post by the end of July 23rd.

Schedule (TBA)

DateSession: TopicsSlidesStudents
15.06

Recent Trends in Medical Image Segmentation

3D vessel segmentation

Structural Continuity in Segmentation


Huang, Pei-Ran

Sauer, Bjarne

Güvercin, Göktug

22.06

Exploring Latest Unsupervised Computer Vision Models for Segmentation

Self-supervised Volume Segmentation

Self-supervised graph representation learning


Klausen, Tobias

Altunbas, Begüm

Oytun Demirbilek

29.06

Image Superresolution Using Generative Models

Sound and Music Generative Models

Sensorless US compounding


Schauer, Robert

Victor Dzhagatspanyan

Sharma, Devansh

6.07

Application of Diffusion Models for Medical Imaging

Image to image translation with diffusion models

Sampling Methods in Diffusion Models


Cheng, JiaJian

Trigui Amal

Yeşilkaynak, Vahit Buğra

13.07

Converting weights of 2D Vision Transformer for 3D Image Classification

Natural Language Explanations for Vision and Vision-Language tasks

non-rigid 2d-3d registration

Image Stitching Using Unsupervised/Semi-Supervised Learning


Ben Chaaben, Zeineb

Marin Ruiz, Jorge

Yang, Shucheng

Güven Erkaya

20.07

Physics-inspired Neural Networks

Counterfactual Modelling

Confidence segmentation


Wagner, Jakob

Pennig, Lars

Yakal, Furkan

27.07TBA

List of Topics and Material

The proposed papers for each topic in this course are usually selected from the following venues/publications:


CVPR: Conference on Computer Vision and Pattern Recognition
ICLR: International Conference on Learning Representations
NeurIPS: Neural Information Processing Systems

TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence

TMI: IEEE Transaction on Medical Imaging
JBHI: IEEE Journal of Biomedical and Health Informatics
MedIA: Medical Image Analysis (Elsevier)

MICCAI: Medical Image Computing and Computer-Assisted Intervention
BMVC: British Machine Vision Conference
MIDL: Medical Imaging with Deep Learning


List of papers (TBA)

NoTopicSample PapersJournal/ ConferenceTutorStudentLink
1Counterfactual ModellingCOUNTERFACTUAL GENERATIVE NETWORKSICLR 2021Pennig, Larshttps://arxiv.org/pdf/2101.06046.pdf
Uncertainty Estimation and Out-of-Distribution Detection for Counterfactual Explanations: Pitfalls and SolutionsICML 2021https://arxiv.org/pdf/2107.09734.pdf
ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical ImagingArXiv 2023https://arxiv.org/pdf/2303.15421.pdf
2Sampling Methods in Diffusion ModelsFast Sampling of Diffusion Models with Exponential IntegratorICLR 2023Yeşilkaynak, Vahit Buğrahttps://arxiv.org/pdf/2204.13902.pdf
DENOISING DIFFUSION IMPLICIT MODELSICLR 2021https://arxiv.org/pdf/2010.02502.pdf
PSEUDO NUMERICAL METHODS FOR DIFFUSION MODELS ON MANIFOLDSICLR 2022https://arxiv.org/pdf/2202.09778.pdf
3Image Stitching Using Unsupervised/Semi-Supervised LearningDepth-Aware Multi-Grid Deep Homography Estimation with Contextual CorrelationIEEE Transactions on CSVT 2022Shahrooz Faghihroohi Güven Erkayahttps://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9605632
Unsupervised Deep Image Stitching: Reconstructing Stitched Features to ImagesTIP 2021https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9472883
Semi-supervised Deep Large-baseline Homography Estimation with Progressive Equivalence ConstraintAAAI 2023https://arxiv.org/abs/2212.02763
4Image Superresolution Using Generative ModelsDeep Constrained Least Squares for Blind Image Super-ResolutionCVPR 2022Shahrooz Faghihroohi Schauer, Robert

https://openaccess.thecvf.com/content/CVPR2022/html/Luo_Deep_Constrained_Least_Squares_for_Blind_Image_Super-Resolution_CVPR_2022_paper.html

Progressive Residual Learning with Memory Upgrade for Ultrasound Image Blind Super-resolutionIEEE BHI 2022https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9684683
Blind Image Super-Resolution: A Survey and BeyondPAMI 2023https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9870558
5Sound and Music Generative ModelsNoise2Music: Text-conditioned Music Generation with Diffusion ModelsArXive 2023Victor Dzhagatspanyanhttps://arxiv.org/pdf/2302.03917.pdf
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking HeadArXive 2023https://arxiv.org/pdf/2304.12995.pdf
AudioGen: Textually Guided Audio GenerationICLR 2023https://openreview.net/forum?id=CYK7RfcOzQ4
63D vessel segmentation3D vessel-like structure segmentation in medical images by an edge-reinforced networkMedIASauer, Bjarnehttps://www.sciencedirect.com/science/article/abs/pii/S1361841522002201
3D Graph-Connectivity Constrained Network for Hepatic Vessel SegmentationIEEE JBHIhttps://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6221020
Noisy Labels are Treasure: Mean-Teacher-Assisted Confident Learning for Hepatic Vessel SegmentationMICCAI 2021https://arxiv.org/pdf/2106.01860.pdf
7Sensorless US compounding3D freehand ultrasound without external tracking using deep learningMedIA 2018Mohammad Farid Azampour Sharma, Devansh https://www.sciencedirect.com/science/article/abs/pii/S1361841518303712
Development of Implicit Representation Method for Freehand 3D Ultrasound Image Reconstruction of Carotid VesselIUS 2022https://ieeexplore.ieee.org/abstract/document/9958448
RecON: Online learning for sensorless freehand 3D ultrasound reconstructionMedIA 2023https://www.sciencedirect.com/science/article/abs/pii/S1361841523000713
8Image to image translation with diffusion modelsDUAL DIFFUSION IMPLICIT BRIDGES FOR IMAGE-TO-IMAGE TRANSLATIONICLR 2023Trigui Amalhttps://arxiv.org/pdf/2203.08382.pdf
DIFFUSION-BASED IMAGE TRANSLATION USING DISENTANGLED STYLE AND CONTENT REPRESENTATIONICLR 2023https://arxiv.org/pdf/2209.15264.pdf
Palette: Image-to-Image Diffusion ModelsArXiv 2022https://arxiv.org/abs/2111.05826
9Structural Continuity in SegmentationDirectional Connectivity-based Segmentation of Medical ImagesCVPR 2023Güvercin, Göktughttps://arxiv.org/pdf/2304.00145.pdf
Introducing Soft Topology Constraints in Deep Learning-based Segmentation using Projected Pooling LossSPIE Medical Imaging 2023https://inria.hal.science/hal-03832309/file/proceedings_version.pdf
Exploring Discontinuity for Video Frame InterpolationCVPR 2023https://arxiv.org/pdf/2202.07291.pdf
10Self-supervised Volume SegmentationMasked Supervised Learning for Semantic SegmentationBMVC 2022Altunbas, Begümhttps://bmvc2022.mpi-inf.mpg.de/0417.pdf
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI ImagesMICCAI 2021https://arxiv.org/pdf/2201.01266.pdf
Volumetric Optimal Transportation by Fast Fourier TransformICLR 2023https://openreview.net/forum?id=EVrz7UM-ZDm
11Physics-inspired Neural NetworksPhysics-Driven Diffusion Models for Impact Sound Synthesis from VideosCVPR 2023Wagner, Jakobhttps://arxiv.org/pdf/2303.16897.pdf
Phase2vec: dynamical systems embedding with a physics-informed convolutional networkICLR 2023https://arxiv.org/pdf/2212.03857.pdf
DaxBench: Benchmarking Deformable Object Manipulation with Differentiable PhysicsICLR 2023https://arxiv.org/pdf/2210.13066.pdf
12Exploring Latest Unsupervised Computer Vision Models for SegmentationEmerging Properties in Self-Supervised Vision TransformersArXiv 2023Klausen, Tobiashttps://arxiv.org/pdf/2104.14294.pdf
DINOv2: Learning Robust Visual Features without SupervisionArXiv 2023https://arxiv.org/pdf/2304.07193.pdf
Segment AnythingArXiv 2023https://arxiv.org/pdf/2304.02643.pdf
13Recent Trends in Medical Image SegmentationAttention-enhanced Disentangled Representation Learning for Unsupervised Domain Adaptation in Cardiac SegmentationMICCAI 2022Unbekannter Benutzer (ge94wiy) Huang, Pei-Ranhttps://rdcu.be/cVRXJ

CRISP- Reliable Uncertainty Estimation for Medical Image SegmentationMICCAI 2022https://rdcu.be/cVVp2

Domain Specific Convolution and High Frequency Reconstruction Based Unsupervised Domain Adaptation for Medical Image Segmentation

MICCAI 2022https://rdcu.be/cVRXA
14non-rigid 2d-3d registration A Weakly Supervised Framework for 2D/3D Vascular Registration Oriented to Incomplete 2D Blood VesselsIEEE Transactions on Medical Robotics and Bionics. 2022Yang, Shucheng

https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9765532&casa_token=Hn0z5R7rWjYAAAAA:QyRY1H7CqD_bC40pAj7us2I74SjInCFI5tEcMj3geniECBfv9Wqw0EY9Zp4Arw1pRyOZU7Qtllc

Non-rigid registration based on hierarchical deformation of coronary arteries in CCTA imagesBiomedical Engineering Letters. 2023https://link.springer.com/article/10.1007/s13534-022-00254-8
CNN-based real-time 2D-3D deformable registration from a single X-ray projectionArXiv 2022https://arxiv.org/pdf/2212.07692.pdf
15Converting weights of 2D Vision Transformer for 3D Image ClassificationCan We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?ArxivMatthias KeicherBen Chaaben, Zeineb

[2209.07026] Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer? (arxiv.org)

Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image SegmentationArxiv

[2302.04303] Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation (arxiv.org)

COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom PretrainingsECCV 22 Workshop

COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings | SpringerLink

16Learning-based Statistical Shape ModelDeep implicit statistical shape models for 3d medical image delineationAAAI 2022Nonehttps://ojs.aaai.org/index.php/AAAI/article/view/20110
Deep Structural Causal Shape ModelsECCV 2022https://arxiv.org/abs/2208.10950
Leveraging unsupervised image registration for discovery of landmark shape descriptorMedIA 2021https://www.sciencedirect.com/science/article/abs/pii/S1361841521002036
17Application of Diffusion Models for Medical ImagingUnsupervised Denoising of Retinal OCT with Diffusion Probabilistic ModelSPIE Medical Imaging 2022Cheng, JiaJianhttps://arxiv.org/abs/2201.11760
On Conditioning the Input Noise for Controlled Image Generation with Diffusion ModelsArxivhttps://arxiv.org/pdf/2205.03859.pdf
Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion ModelsMICCAI 2022https://link.springer.com/chapter/10.1007/978-3-031-16452-1_67
18Self-supervised graph representation learningPrototype-based Embedding Network for Scene Graph GenerationCVPR 2023Oytun Demirbilekhttps://arxiv.org/pdf/2303.07096.pdf
Unbiased Scene Graph Generation in VideosCVPR 2023https://arxiv.org/pdf/2304.00733.pdf
Multi-task Self-supervised Graph Neural Networks Enable Stronger Task GeneralizationICLR 2023https://openreview.net/forum?id=1tHAZRqftM
19Natural Language Explanations for Vision and Vision-Language tasksNLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language TasksCVPR 2022David Bani-HarouniMarin Ruiz, Jorge

https://openaccess.thecvf.com/content/CVPR2022/html/Sammani_NLX-GPT_A_Model_for_Natural_Language_Explanations_in_Vision_and_CVPR_2022_paper.html

CLEVR-X: A Visual Reasoning Dataset for Natural Language ExplanationsICML 2022 workshophttps://link.springer.com/chapter/10.1007/978-3-031-04083-2_5
ALICE: Active Learning with Contrastive Natural Language ExplanationsEMNLP 2020https://aclanthology.org/2020.emnlp-main.355/
20Confidence segmentationAutomated and real-time segmentation of suspicious breast masses using convolutional neural networkPloS one, 2018Yakal, Furkanhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5955504/pdf/pone.0195816.pdf
Leveraging Uncertainty Estimates to Improve Segmentation Performance in Cardiac MRMiccai 2021https://hal.science/hal-03349833/document
NOVEL STRUCTURAL-SCALE UNCERTAINTY MEASURES AND ERROR RETENTION CURVES: APPLICATION TO MULTIPLE SCLEROSISISBI 2023https://arxiv.org/pdf/2211.04825.pdf


Literature and Helpful Links

A lot of scientific publications can be found online.

The following list may help you to find some further information on your particular topic:

Some publishers:

Libraries (online and offline):

Some further hints for working with references:

  • JabRef is a Java program for comfortable working with Bibtex literature databases. Handy feature: if you know the PubMed ID for an article, JabRef can import data from there (via "Web Search/Medline").
  • Mendeley is a cross-platform program for organising your references.

If you find useful resources that are not already listed here, please tell us, so we can add them for others. Thanks.

  • Keine Stichwörter