MICCAI 2024 Challenge

CXR-LT: Long-tailed, multi-label, and zero-shot classification on chest X-rays

Location: Amethyste, Palmeraie Palace, Marrakesh, Morocco
Time: 1:30pm (GMT+1), October 10, 2024

Click here to participate in Task 1: https://codalab.lisn.upsaclay.fr/competitions/18601
Click here to participate in Task 2: https://codalab.lisn.upsaclay.fr/competitions/18603
Click here to participate in Task 3: https://codalab.lisn.upsaclay.fr/competitions/18604

Overview

Chest radiography, like many diagnostic medical exams, produces a long-tailed distribution of clinical findings; while a small subset of diseases is routinely observed, the vast majority of diseases are relatively rare [1]. This poses a challenge for standard deep learning methods, which exhibit bias toward the most common classes at the expense of the important but rare "tail" classes [2]. Many existing methods [3] have been proposed to tackle this specific type of imbalance, though only recently has attention been given to long-tailed medical image recognition problems [4-6]. Diagnosis on chest X-rays (CXRs) is also a multi-label problem, as patients often present with multiple disease findings simultaneously; however, only a select few studies incorporate knowledge of label co-occurrence into the learning process [7-9, 12]. Since most large-scale image classification benchmarks contain single-label images with a mostly balanced distribution of labels, many standard deep learning methods fail to accommodate the class imbalance and co-occurrence problems posed by the long-tailed, multi-label nature of tasks like disease diagnosis on CXRs [2].

In the first iteration of CXR-LT held in 2023, we expanded upon the MIMIC-CXR-JPG [10,11] dataset by enlarging the set of target classes from 14 to 26, generating labels for 12 new rare disease findings by parsing radiology reports [13]. While this made for a challenging long-tailed, multi-label disease classification task that attracted 59 teams who contributed over 500 unique submissions, Radiology Gamuts Ontology documents over 4,500 unique radiological image findings. That is, the "true" distribution of all clinical findings on CXR is at least two orders of magnitude longer than what our -- or any existing dataset can offer. For this reason, we argue that the only way to truly tackle the long-tail of radiological image findings is to develop a model that can readily generalize to new classes in "zero-shot" fashion [14].

For this year's version of CXR-LT, we extract labels for an additional 19 rare disease findings (for a total of 377,110 CXR images, each with 45 disease labels) and introduce two new challenge tracks, featuring a zero-shot classification task.


Shared Task

Dataset

In the first iteration of CXR-LT held in 2023, we expanded upon the MIMIC-CXR dataset by enlarging the set of target classes from 14 to 26, generating labels for 12 new rare disease findings by parsing radiology reports. For this year's version of CXR-LT, we extract labels for an additional 19 rare disease findings (for a total of 377,110 CXR images, each with 45 disease labels).

Task

Given a CXR, our challenge includes three tasks, to be held as independent tasks:

For all tasks, participants will be provided with a large, automatically labeled training set of >250,000 CXR images with 40 binary disease labels. While last year's CXR-LT was a success, we hope that CXR-LT 2024 can provide even further meaningful methodological advances toward clinically realistic multi-label, long-tailed, and zero-shot disease classification on CXR.

Evaluation

For each task, models will be evaluated on the provided testing set using “macro-averaged” mean Average Precision (mAP).

Online Evaluation

The competition will be conducted through the CodaLab platform.


To Participate

This competition uses data from MIMIC-CXR-JPG v2.0.0, which requires credentialing through PhysioNet and a signed Data Use Agreement (DUA) for MIMIC-CXR-JPG.

To participate in this competition, you must follow these steps:

If you have completed these steps correctly, you will be admitted to the competition and we will provide links to download the necessary data by email! You are not permitted to share these labels whatsoever.


Important dates

May 01, 2024    Training data released and challenge (development phase) begins

Aug 01, 2024    Test labels released and final evaluation (testing phase) begins

Aug 04, 2024    Testing phase ends and competition is closed.

Aug 15, 2024    Top-performing teams invited to present at MICCAI 2024

Oct 10, 2024    MICCAI 2024 CXR-LT Challenge event


Tentative Schedule

1:30 PM    Introduction to CXR-LT

1:40 PM    Team "zguo", Arizona State University

1:52 PM    Team "XYPB", Yale University

2:04 PM    Team "YYama", University of Tokyo

2:16 PM    Team "yyge", University of Pennsylvania

2:28 PM    Team "pamessina", Pontifical Catholic University of Chile

2:40 PM    Team "tianjie_dai", Shanghai Jiao Tong University

2:52 PM    Team "dongkyunk", Carnegie Mellon University

3:04 PM    Team "yangz16", Rensselaer Polytechnic Institute

3:16 PM    Team "ZhangRuichi", Xiamen University

3:28 PM    Closing remarks


Steering committee

George Shih
Weill Cornell Medicine
Zhiyong Lu
NIH/NLM/NCBI
Ronald M. Summers
NIH Clinical Center
Leo Anthony Celi
MIT/Harvard
Adam E. Flanders
Thomas Jefferson University

Organizers

Mingquan Lin
University of Minnesota
Greg Holste
UT Austin
Song Wang
UT Austin
Yiliang Zhou
Weill Cornell Medicine
Hao Chen
HKUST
Atlas Wang
UT Austin
Yifan Peng
Weill Cornell Medicine

References

  1. Zhou SK, Greenspan H, Davatzikos C, Duncan JS, Van Ginneken B, Madabhushi A, Prince JL, Rueckert D, Summers RM. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises. Proceedings of the IEEE. 2021 Feb 26;109(5):820-38.
  2. Holste G, Wang S, Jiang Z, Shen TC, Shih G, Summers RM, Peng Y, Wang Z. Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study. In Data Augmentation, Labelling, and Imperfections: Second MICCAI Workshop, DALI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 22, 2022, Proceedings 2022 Sep 16 (pp. 22-32). Cham: Springer Nature Switzerland.
  3. Zhang Y, Kang B, Hooi B, Yan S, Feng J. Deep long-tailed learning: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023 Apr 19.
  4. Zhang R, Haihong E, Yuan L, He J, Zhang H, Zhang S, Wang Y, Song M, Wang L. MBNM: multi-branch network based on memory features for long-tailed medical image recognition. Computer Methods and Programs in Biomedicine. 2021 Nov 1;212:106448.
  5. Ju L, Wang X, Wang L, Liu T, Zhao X, Drummond T, Mahapatra D, Ge Z. Relational subsets knowledge distillation for long-tailed retinal diseases recognition. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part VIII 24 2021 (pp. 3-12). Springer International Publishing.
  6. Yang Z, Pan J, Yang Y, Shi X, Zhou HY, Zhang Z, Bian C. ProCo: Prototype-Aware Contrastive Learning for Long-Tailed Medical Image Classification. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18-22, 2022, Proceedings, Part VIII 2022 Sep 16 (pp. 173-182). Cham: Springer Nature Switzerland.
  7. Chen H, Miao S, Xu D, Hager GD, Harrison AP. Deep hierarchical multi-label classification of chest X-ray images. In International Conference on Medical Imaging with Deep Learning 2019 May 24 (pp. 109-120). PMLR.
  8. Wang G, Wang P, Cong J, Liu K, Wei B. BB-GCN: A Bi-modal Bridged Graph Convolutional Network for Multi-label Chest X-Ray Recognition. arXiv preprint arXiv:2302.11082. 2023 Feb 22.
  9. Chen B, Li J, Lu G, Yu H, Zhang D. Label co-occurrence learning with graph convolutional networks for multi-label chest x-ray image classification. IEEE Journal of Biomedical and Health Informatics. 2020 Jan 16;24(8):2292-302.
  10. Johnson AE, Pollard TJ, Greenbaum NR, Lungren MP, Deng CY, Peng Y, Lu Z, Mark RG, Berkowitz SJ, Horng S. MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs. arXiv preprint arXiv:1901.07042. 2019 Jan 21.
  11. PhysioNet. MIMIC-CXR-JPG - chest radiographs with structured labels [Internet]. Available from: https://physionet.org/content/mimic-cxr-jpg/2.0.0/.
  12. Moukheiber D, Mahindre S, Moukheiber L, Moukheiber M, Wang S, Ma C, Shih G, Peng Y, Gao M. Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays. In Data Augmentation, Labelling, and Imperfections: Second MICCAI Workshop, DALI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 22, 2022, Proceedings 2022 Sep 16 (pp. 112-122). Cham: Springer Nature Switzerland.
  13. CodaLab. CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays [Internet]. Available from: https://codalab.lisn.upsaclay.fr/competitions/12599.
  14. Holste G, Zhou Y, Wang S, Jaiswal A, Lin M, Zhuge S, Yang Y, Kim D, Nguyen-Mau TH, Tran MT, Jeong J. Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge. arXiv preprint arXiv:2310.16112. 2023 Oct 24.

Please contact cxrltchallenge2024@gmail.com if you have any questions. This webpage template is by courtesy of Georgia.