ISMBE 2020

Not-so-supervised Digital Cytopathology

author.DisplayName author.DisplayName 1 author.DisplayName 2 author.DisplayName 1 author.DisplayName 3 author.DisplayName 4 author.DisplayName 1 author.DisplayName 1
1Duke University, USA
2Duke University, USA
3Duke University, USA
4Duke University, USA

Background: Digital pathology is booming with numerous deep-learning applications to whole slide image (WSI) analysis. Most studies consider histopathology, pathology of whole tissue sections. In contrast, we focus on cellular-level cytopathology biopsies, containing separate, sparsely located informative groups of cells. These preoperative cytopathology biopsies are signi cantly more challenging for diagnosis, often resulting in unnecessary surgeries. We consider automatic prediction of thyroid malignancy from cytopathology WSIs, with the goal of reducing unnecessary surgeries.

Methods: We collected a dataset, > 50 terabytes in size, of 908 (799 training, 109 testing) WSIs; among the rst and largest datasets of cytopathology WSIs. Each WSI includes a binary label (malignant/benign), obtained from post-surgery analysis, and a human diagnostic score per WSI; the dataset also includes an overall 4494 local, patch level, labels annotated by a pathologist. We propose a maximum likelihood estimation framework via a \not-so-supervised" deep-learning algorithm: the WSIs are processed in patches, however, only a tiny number of patches are labeled. Our theoretical analysis establishes a link to multiple- and single-instance learning, leading to an improved training strategy.

Results: The algorithm's performance is compared to 3 cytopathologists using area under the curve (auc) (algorithm: 0:932, experts: 0:931; 0:917; 0:909 ), and average precision (ap) (algorithm: 0:872, experts: 0:848; 0:848; 0:759). By further combining algorithm and human decisions, we reduce indeterminate cases by 55% from 47 to 26.

Conclusion: The proposed algorithm achieves human-level performance and reduces inconclusive diagnoses. Future research includes a multi-center study for evaluating the e ect of di erent scanners and staining processes.









Powered by Eventact EMS