MICCAI 2024 ChallengeCXR-LT: Long-tailed, multi-label, and zero-shot classification on chest X-rays |
||
Location: Amethyste, Palmeraie Palace, Marrakesh, Morocco Time: 1:30pm (GMT+1), October 10, 2024 |
Chest radiography, like many diagnostic medical exams, produces a long-tailed distribution of clinical findings; while a small subset of diseases is routinely observed, the vast majority of diseases are relatively rare [1]. This poses a challenge for standard deep learning methods, which exhibit bias toward the most common classes at the expense of the important but rare "tail" classes [2]. Many existing methods [3] have been proposed to tackle this specific type of imbalance, though only recently has attention been given to long-tailed medical image recognition problems [4-6]. Diagnosis on chest X-rays (CXRs) is also a multi-label problem, as patients often present with multiple disease findings simultaneously; however, only a select few studies incorporate knowledge of label co-occurrence into the learning process [7-9, 12]. Since most large-scale image classification benchmarks contain single-label images with a mostly balanced distribution of labels, many standard deep learning methods fail to accommodate the class imbalance and co-occurrence problems posed by the long-tailed, multi-label nature of tasks like disease diagnosis on CXRs [2].
In the first iteration of CXR-LT held in 2023, we expanded upon the MIMIC-CXR-JPG [10,11] dataset by enlarging the set of target classes from 14 to 26, generating labels for 12 new rare disease findings by parsing radiology reports [13]. While this made for a challenging long-tailed, multi-label disease classification task that attracted 59 teams who contributed over 500 unique submissions, Radiology Gamuts Ontology documents over 4,500 unique radiological image findings. That is, the "true" distribution of all clinical findings on CXR is at least two orders of magnitude longer than what our -- or any existing dataset can offer. For this reason, we argue that the only way to truly tackle the long-tail of radiological image findings is to develop a model that can readily generalize to new classes in "zero-shot" fashion [14].
For this year's version of CXR-LT, we extract labels for an additional 19 rare disease findings (for a total of 377,110 CXR images, each with 45 disease labels) and introduce two new challenge tracks, featuring a zero-shot classification task.
In the first iteration of CXR-LT held in 2023, we expanded upon the MIMIC-CXR dataset by enlarging the set of target classes from 14 to 26, generating labels for 12 new rare disease findings by parsing radiology reports. For this year's version of CXR-LT, we extract labels for an additional 19 rare disease findings (for a total of 377,110 CXR images, each with 45 disease labels).
Given a CXR, our challenge includes three tasks, to be held as independent tasks:
For each task, models will be evaluated on the provided testing set using “macro-averaged” mean Average Precision (mAP).
The competition will be conducted through the CodaLab platform.
This competition uses data from MIMIC-CXR-JPG v2.0.0, which requires credentialing through PhysioNet and a signed Data Use Agreement (DUA) for MIMIC-CXR-JPG.
To participate in this competition, you must follow these steps:
If you have completed these steps correctly, you will be admitted to the competition and we will provide links to download the necessary data by email! You are not permitted to share these labels whatsoever.
May 01, 2024 Training data released and challenge (development phase) begins
Aug 01, 2024 Test labels released and final evaluation (testing phase) begins
Aug 04, 2024 Testing phase ends and competition is closed.
Aug 15, 2024 Top-performing teams invited to present at MICCAI 2024
Oct 10, 2024 MICCAI 2024 CXR-LT Challenge event
1:30 PM Introduction to CXR-LT
1:40 PM Team "zguo", Arizona State University
1:52 PM Team "XYPB", Yale University
2:04 PM Team "YYama", University of Tokyo
2:16 PM Team "yyge", University of Pennsylvania
2:28 PM Team "pamessina", Pontifical Catholic University of Chile
2:40 PM Team "tianjie_dai", Shanghai Jiao Tong University
2:52 PM Team "dongkyunk", Carnegie Mellon University
3:04 PM Team "yangz16", Rensselaer Polytechnic Institute
3:16 PM Team "ZhangRuichi", Xiamen University
3:28 PM Closing remarks
Please contact cxrltchallenge2024@gmail.com if you have any questions. This webpage template is by courtesy of Georgia.