Imaging data sets (artificial intelligence)

Changed by Candace Makeda Moore, 14 Jan 2020

Updates to Article Attributes

Body was changed:

The aggregation of an imaging data set is a critical step in building artificial intelligence (AI) for radiology. Imaging data sets are used in various ways including training and/or testing algorithms. Many data sets for building convolutional neural networks for image identification involve at least thousands of images but smaller data sets are useful for texture analysistransfer learning, and other programs. 

Many commercial AI products are built on proprietary data sets or specific hospital data sets not available due to concerns over patient privacy. There are however several imaging data sets of radiological images and/or reports publicly available at the following websites:

Additionally, The Cancer Imaging Archive contains links to many open radiology data sets including the following:

  • -<a title="Computed Tomography Emphysema Database" href="http://image.diku.dk/emphysema_database/">Computed Tomography Emphysema Database </a>small images specifically for texture analysis</li>
  • +<a href="http://image.diku.dk/emphysema_database/">Computed Tomography Emphysema Database </a>small images specifically for texture analysis</li>
  • -<a href="https://openi.nlm.nih.gov/">OpenI - The Open Access Biomedical Image Search Engine</a>: data sets search engine </li>
  • +<a href="https://openi.nlm.nih.gov/">OpenI - The Open Access Biomedical Image Search Engine</a>: data sets search engine, API (application programmer interface) to create customized data sets available at <a title="MedPix" href="https://medpix.nlm.nih.gov">MedPix </a>
  • +</li>

ADVERTISEMENT: Supporters see fewer/no ads

Updating… Please wait.

 Unable to process the form. Check for errors and try again.

 Thank you for updating your details.