Social Computing Laboratory publishes Medical NLP Standard Text datasets = MedTxt as a collection of standard datasets in multiple languages including Japanese for various medical natural language processing tasks. With these datasets, we aim to promote research work and development of systems. We plan to further add multiple datasets for various other tasks. 

We currently provide the two following corpora.