MDS_HE_IA

Image analysis of H&E stained slides of patients with myelodysplastic syndrome (MDS) using convolutional neural networks and elastic net.

Visual inspection of Hematoxylin&Eosin (H&E) stained tissue samples is a standard procedure for histopathological diagnosis. We collected formalin-fized paraffin-embedded (FFPE) bone marrow (BM) trephine biopsies from 236 MDS patients, 87 MDS/myeloproliferative neoplasia (MPN) patients and 10 healthy controls. These were transformed into tissue microarrays (TMA) and stained with a standard in-house H&E staining. Each image was split into 500 equally-sized tiles and colors converted into grayscale for more robust analysis. Only non-white (= non-empty) tiles were retained for image analysis. Visual features were extracted at the tile level with VGG16 and Xception networks using imagenet weights. In addition, we analyzed each TMA image at the pixel-level using a trained Weka model with Fiji and at the cell level using Qupath (analytical design and settings are described in separate manuscript). Hence, we are able to associate high-resolution image analysis data from CNNs with more comprehensible image analysis features from pixel and cell level image analysis.

Visual features from the CNNs were fed into several elastic net model with predefined alpha penalization (0.0, 0.5 and 1.0) and lambda values were optimized for each. The dataset was divided at the TMA level into training (2/3) and test (1/3) datasets. In total 3 different penalization from 2 different CNN models were used to predict

overall survival
risk of AML
treatment response to azacitidine
diagnosis (between MDS and MDS/MPN)
mutation status for TP53, ASXL1, DNMT3A, SRSF2, TET2, RUNX1, SF3B1, NRAS/KRAS, IDH1/IDH2, STAG2
mutations in cell cycle, cell differentiation, DNA chromatin structurea and spliceosome regulation pathways
aberrant karyotype, chromosome 5q deletion, 7q deletion, 7 monosomy, 8 trisomy, 20q deletion, complex karyotype
patient age and gender
de novo vs. secondary MDS
ipssr score and ipssr cytopenia score

Only visual features from MDS patients were included for these analyses except for diagnosis prediction where also MDS/MPN patients included.

As visual features from the Xception network performed superiorly to VGG16, we used these features to analyze the natural embedding of MDS, MDS/MPN and control subjects in an unsupervised fashion. For this purpose, we used PCA and UMAP and clustered data with Phenograph and Kmeans clustering. Due to the non-parametric nature of visual data, we observed UMAP to perform more accurately. As tile-level and aggregated sample-level analyses differ both by matrix size and by purpose, we used Phenograph for tile-level feature inspection to increase granularity and k-means for sample-level analyses to to facilitate robust conclusions.

This repository included scripts to

preprocess image data (Python)
extract visual features (R)
build elastic net models (R)
generate AUC of prediction model ROCs and plot AUC data + generate most representative images for each prediction models to assess models + combine tile-level prediction models with pixel-level and cell-level image analysis data (R)
to run unsupervised analyses on visual feature matrices using UMAP and PCA and cluster these with Kmeans and Phenograph. Clusters are compared to each others with wilcoxon test (continuous variables) or chi2 test (categorical variables) (R)

Hardware

Steps 1, 4 and 5 have been analyzed with a MacBook Pro 2018 (macOS Mojave, 16 GB 2133 MHz LPDDR3, 2.3 GHz Intel Core i5)
Steps 2 and 3 were analyzed in a Linux kernel (CentOS Linux 7, 4 Cores/20 GB)

Scripts have been written and implemented by Oscar Brück.

Oscar Brück, MD
Hematology Research Unit Helsinki, University of Helsinki &
Data Admnistration, Helsinki University Hospital &
Hematology, Helsinki University Hospital
Helsinki, Finland

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
scripts		scripts
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDS_HE_IA

About

Releases

Packages

Languages

obruck/MDS_HE_IA

Folders and files

Latest commit

History

Repository files navigation

MDS_HE_IA

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages