Instructors

Prof. Dr. Christian Wachinger, Tom Nuno Wolf, Fabian Bongratz, Bailiang Jian, Yitong LiEmre Kavak

Contact

If you have any questions regarding this seminar contact seminars@ai-med.de.

Announcements

Registration

Registration to the seminar is done via the TUM Matching Platform. Pay attention to the deadlines!!

Timeline

  • Feb 3, 2025, 1pm: pre-course meeting
  • April 30, 2025, 11am, Seminarraum Holbeinstrasse 11: Kickoff, assignment of papers (attendance is mandatory)
  • During the semester: meet your supervisor (optional but recommended)
  • July 3 & 4, 9am - 3pm, Seminarraum Holbeinstrasse 11: Block seminar (attendance is mandatory)

Topics

Paper IDTitlePublished inLinkGroup/SupervisorStudentAdditional Material
1Toward expert-level medical question answering with large language modelsNature Medicinehttps://www.nature.com/articles/s41591-024-03423-7BailiangJulia Seidl
2Towards accurate differential diagnosis with large language modelsNature Medicinehttps://www.nature.com/articles/s41586-025-08869-4BailiangThanh Dang
3Medical large language models are vulnerable to data-poisoning attacksNature Medicinehttps://www.nature.com/articles/s41591-024-03445-1BailiangSmail Smajlović
4Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-Ray Expert ModelsCVPRhttps://arxiv.org/abs/2404.04936YitongMelek Walha
5MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsICLRhttps://arxiv.org/abs/2410.13085YitongHimel Gosh
6Learning Causal Alignment for Reliable Disease DiagnosisICLRhttps://arxiv.org/abs/2310.01766YitongSimon Bohnen
7CausalDiff: Causality-Inspired Disentanglement
via Diffusion Model for Adversarial Defense
NeurIPShttps://arxiv.org/abs/2410.23091Emre

8

Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography

MICCAIhttps://arxiv.org/abs/2405.12255EmreTimofey Kuznetsov
9Self-supervised Vision Transformer are Scalable
Generative Models for Domain Generalization
MICCAIhttps://arxiv.org/abs/2407.02900EmreClemens Schwarzmann
10

VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis

Arxivhttps://arxiv.org/abs/2410.08397NunoAlexandra Marquardt
11

SynthSR: A public AI tool to turn heterogeneous clinical brain scans into high-resolution T1-weighted images for 3D morphometry

Science Advanceshttps://www.science.org/doi/10.1126/sciadv.add3607FabiKatherina Terefenko
12

SIM: Surface-based fMRI Analysis for Inter-Subject Multimodal Decoding from Movie-Watching Experiments

ICLRhttps://openreview.net/forum?id=OJsMGsO6ynFabiPaul Burkhardt
13

Learnable latent embeddings for joint behavioural and neural analysis

Naturehttps://www.nature.com/articles/s41586-023-06031-6FabiVivien Hopfhttps://cebra.ai/
14

A multimodal generative AI copilot for human pathology

Naturehttps://www.nature.com/articles/s41586-024-07618-3NunoLara Lanz
15

ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image

ECCVhttps://link.springer.com/chapter/10.1007/978-3-031-73661-2_12NunoHyunji Lee

Resources & Material

Giving talks

Doing a TED Talk: The Full Story

TEDx Speaker Guide

The secret structure of great talks

How to Deliver a Great TED Talk

Talk Like TED

Blog posts

ML-Neuro Guidelines for blog post

TUM guide on ChatGPT

BAIR blog

GDLMA blog posts 2021

ML-Neuro blog posts, Summer 2024

  • Keine Stichwörter