RadNLP 2024 (Natural Language Processing for Radiology) is a shared task in the international conference NTCIR-18, organized by the National Institute of Informatics in Japan.
We propose the tasks, publish the dataset, and call for solutions from participants. RadNLP 2024 aims to create open medical data and contribute insights back to medical and informatic communities.
RadNLP 2024 aims to automatically determine the stage (i.e., the degree of progression) of lung cancer from radiology reports.
Management of lung cancer is based on the stage, and radiology reports provide various related information by describing medical images such as CT and MRI.
However, radiology reports do not always specify the stage explicitly¹. This imposes extra workload on human experts for careful manual information extraction, which can be aided by automation.
All radiology reports in RadNLP 2024 diagnose lung cancer at the initial evaluation. There are no reports of lung cancer post-treatment evaluation or different cancer.
Our datasets contain NO personal health information. The radiology reports are not derived from real medical institutions but are created with crowdsourcing by diagnosing de-identified images on Radiopaedia². Task participants requires no complex applications to use our datasets.
2 Nakamura Y et al. Clinical Comparable Corpus Describing the Same Subjects with Different Expressions. Stud Health Technol Inform 2022:290:253-257.
RadNLP 2024 opens two independent tracks, English Track and Japanese Track:
Participants are welcome to join either the English track, the Japanese track, or both. Scoring and ranking will be conducted independently for each track.
Also, RadNLP 2024 consists of two tasks, the sub task and main task:
Sub task is a document segmentation to identify up to eight spans related to the following topics:
(i) Omittable – Spans that are clearly free of any positive findings or clearly unrelated to lung cancer staging. In this definition, “clearly” means the high clarity that does not require detailed knowledge of the lung cancer staging criteria.
(ii) Measure – Span describing mainly the existence and diameter of the primary lesion.
(iii) Extension – Span describing the range of the primary lesion’s extension outside the lung parenchyma.
(iv) Atelectasis – Span pointing out atelectasis or obstructive pneumonia.
(v) Satellite – Span pointing out intrapulmonary metastasis or lymphangiomatosis carcinomatosa.
(vi) Lymphadenopathy – Span pointing out enlarged regional lymph nodes.
(vii) Pleural – Span pointing out pleural/pericardial effusion/dissemination.
(viii) Distant – Span pointing out distant metastasis outside the lung parenchyma.
In NLP terminology, this sub task is a multi-label sentence binary classification.
The segmentation is at the sentence level, and it is possible for a topic span to be discontinuous or for the same sentence to belong to more than one topic.
Participants are requested to determine whether each sentence falls into categories (i) to (viii). If it does, mark it as “1,” and if it does not, mark it as “0.” Therefore, each sentence requires eight binary answers.
Note that every sentence will either fall into only category (i) Omittable, or into one or more of categories (ii) Measure to (viii) Distant.
The dataset for Sub Task consists of the following two files:
Sample data for Sub Task is available below:
[UPDATED] We calculate the following evaluation metrics, with the one marked with ★ used to sort the leaderboard:
Main task is a multi-label document classification to correctly determine T, N, and M categories for each radiology report.
In RadNLP 2024, we follow the staging criteria by the Japan Lung Cancer Society (JLCS). Note that the JLCS criteria closely aligns the global standard, namely the 8th edition of the TNM Classification of Malignant Tumours by the Union for International Cancer Control (UICC).
The dataset for Main Task consists of the following two files:
Sample data for Main Task is available below:
[UPDATED] We calculate the following evaluation metrics, with the one marked with ★ used to sort the leaderboard:
1. Prepare your email address and click the link below:
2. Read the instruction carefully and open the online registration form.
3. In the registration form, choose “Yes” in Question 12, and check the track(s) to join in Question 13.
* Deciding whether to participate in sub task and/or main task is not mandatory at the time of registration.
We expect that the sub task will support the main task.
Our aim is the NLP application for staging (i.e., the main task), which was also the focus of the last NTCIR-17 shared task.
However, the results from NTCIR-17 indicated that even the most advanced solutions at that time had potential for improvement.
Therefore, we are providing new sentence-level annotations (i.e., the sub task), in addition to document-level annotations, to enable participants to explore various new approaches.
Yes. It is fine for one person to join two or more different teams.
Yes. It is no problem to use the Japanese track’s dataset to solve the English track, or to use the English track’s dataset to solve the Japanese track.
If you re-use one track’s dataset in the other, please specify the detail in your system paper for the sake of reproducibility.
Our policy is that RadNLP 2024 is, rather than a competition, a workshop to welcome participants’ diverse approaches and share the insights widely.
Yes. We will not pose any specific limits on the use of any external resources, including models, dictionaries, corpora, or datasets.
If you use external resources, please specify the detail in your system paper for the sake of reproducibility.
Our policy is that RadNLP 2024 is, rather than a competition, a workshop to welcome participants’ diverse approaches and share the insights widely.
Should you use external resources, please handle them with a maximum consideration not to violate human rights, especially privacy.
No. We request every team to submit ONE system paper and make ONE presentation, even if you participate in both the English and Japanese tracks.
No. We request every team to submit ONE system paper and make ONE presentation, even if you solve both the main and sub tasks.
If you have any other questions, please feel free to contact us.
Department of Computational Diagnostic Radiology and Preventive Medicine, the University of Tokyo Hospital
Department of Radiology, Graduate School of Medicine, the University of Tokyo
Social Computing Laboratory, Nara Institute of Science and Technology (NAIST)
Social Computing Laboratory, Nara Institute of Science and Technology (NAIST)
Division of Radiology and Biomedical Engineering, Graduate School of Medicine, The University of Tokyo
Division of Radiology and Biomedical Engineering, Graduate School of Medicine, The University of Tokyo
Data Science Center, Jichi Medical University
Department of Radiology, The University of Tokyo Hospital
Department of Radiology, Graduate School of Medicine, the University of Tokyo
Social Computing Laboratory, Nara Institute of Science and Technology (NAIST)
Social Computing Laboratory, Nara Institute of Science and Technology (NAIST)
Kyushu University
Department of Advanced Imaging in Medical Magnetic Resonance, Graduate School of Medicine, Kyoto University
Institute of Diagnostic and Interventional Radiology, University Hospital Zurich
Department of Quantitative Biomedicine, University of Zurich
RadNLP 2024 has two preceding shared tasks, whose websites are available below:
E-mail: radnlp [at] googlegroups.com
Designed with WordPress