-
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective
Authors:
Shengjia Chen,
Gabriele Campanella,
Abdulkadir Elmas,
Aryeh Stock,
Jennifer Zeng,
Alexandros D. Polydorides,
Adam J. Schoenfeld,
Kuan-lin Huang,
Jane Houldsworth,
Chad Vanderbilt,
Thomas J. Fuchs
Abstract:
Recent advances in artificial intelligence (AI), in particular self-supervised learning of foundation models (FMs), are revolutionizing medical imaging and computational pathology (CPath). A constant challenge in the analysis of digital Whole Slide Images (WSIs) is the problem of aggregating tens of thousands of tile-level image embeddings to a slide-level representation. Due to the prevalent use…
▽ More
Recent advances in artificial intelligence (AI), in particular self-supervised learning of foundation models (FMs), are revolutionizing medical imaging and computational pathology (CPath). A constant challenge in the analysis of digital Whole Slide Images (WSIs) is the problem of aggregating tens of thousands of tile-level image embeddings to a slide-level representation. Due to the prevalent use of datasets created for genomic research, such as TCGA, for method development, the performance of these techniques on diagnostic slides from clinical practice has been inadequately explored. This study conducts a thorough benchmarking analysis of ten slide-level aggregation techniques across nine clinically relevant tasks, including diagnostic assessment, biomarker classification, and outcome prediction. The results yield following key insights: (1) Embeddings derived from domain-specific (histological images) FMs outperform those from generic ImageNet-based models across aggregation methods. (2) Spatial-aware aggregators enhance the performance significantly when using ImageNet pre-trained models but not when using FMs. (3) No single model excels in all tasks and spatially-aware models do not show general superiority as it would be expected. These findings underscore the need for more adaptable and universally applicable aggregation techniques, guiding future research towards tools that better meet the evolving needs of clinical-AI in pathology. The code used in this work is available at \url{https://github.com/fuchs-lab-public/CPath_SABenchmark}.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
A Clinical Benchmark of Public Self-Supervised Pathology Foundation Models
Authors:
Gabriele Campanella,
Shengjia Chen,
Ruchika Verma,
Jennifer Zeng,
Aryeh Stock,
Matt Croken,
Brandon Veremis,
Abdulkadir Elmas,
Kuan-lin Huang,
Ricky Kwan,
Jane Houldsworth,
Adam J. Schoenfeld,
Chad Vanderbilt
Abstract:
The use of self-supervised learning (SSL) to train pathology foundation models has increased substantially in the past few years. Notably, several models trained on large quantities of clinical data have been made publicly available in recent months. This will significantly enhance scientific research in computational pathology and help bridge the gap between research and clinical deployment. With…
▽ More
The use of self-supervised learning (SSL) to train pathology foundation models has increased substantially in the past few years. Notably, several models trained on large quantities of clinical data have been made publicly available in recent months. This will significantly enhance scientific research in computational pathology and help bridge the gap between research and clinical deployment. With the increase in availability of public foundation models of different sizes, trained using different algorithms on different datasets, it becomes important to establish a benchmark to compare the performance of such models on a variety of clinically relevant tasks spanning multiple organs and diseases. In this work, we present a collection of pathology datasets comprising clinical slides associated with clinically relevant endpoints including cancer diagnoses and a variety of biomarkers generated during standard hospital operation from two medical centers. We leverage these datasets to systematically assess the performance of public pathology foundation models and provide insights into best practices for training new foundation models and selecting appropriate pretrained models.
△ Less
Submitted 11 July, 2024; v1 submitted 8 July, 2024;
originally announced July 2024.
-
Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images
Authors:
Gabriele Campanella,
Ricky Kwan,
Eugene Fluder,
Jennifer Zeng,
Aryeh Stock,
Brandon Veremis,
Alexandros D. Polydorides,
Cyrus Hedvat,
Adam Schoenfeld,
Chad Vanderbilt,
Patricia Kovatch,
Carlos Cordon-Cardo,
Thomas J. Fuchs
Abstract:
Recent breakthroughs in self-supervised learning have enabled the use of large unlabeled datasets to train visual foundation models that can generalize to a variety of downstream tasks. While this training paradigm is well suited for the medical domain where annotations are scarce, large-scale pre-training in the medical domain, and in particular pathology, has not been extensively studied. Previo…
▽ More
Recent breakthroughs in self-supervised learning have enabled the use of large unlabeled datasets to train visual foundation models that can generalize to a variety of downstream tasks. While this training paradigm is well suited for the medical domain where annotations are scarce, large-scale pre-training in the medical domain, and in particular pathology, has not been extensively studied. Previous work in self-supervised learning in pathology has leveraged smaller datasets for both pre-training and evaluating downstream performance. The aim of this project is to train the largest academic foundation model and benchmark the most prominent self-supervised learning algorithms by pre-training and evaluating downstream performance on large clinical pathology datasets. We collected the largest pathology dataset to date, consisting of over 3 billion images from over 423 thousand microscopy slides. We compared pre-training of visual transformer models using the masked autoencoder (MAE) and DINO algorithms. We evaluated performance on six clinically relevant tasks from three anatomic sites and two institutions: breast cancer detection, inflammatory bowel disease detection, breast cancer estrogen receptor prediction, lung adenocarcinoma EGFR mutation prediction, and lung cancer immunotherapy response prediction. Our results demonstrate that pre-training on pathology data is beneficial for downstream performance compared to pre-training on natural images. Additionally, the DINO algorithm achieved better generalization performance across all tasks tested. The presented results signify a phase change in computational pathology research, paving the way into a new era of more performant models based on large-scale, parallel pre-training at the billion-image scale.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Tell me, what are you most afraid of? Exploring the Effects of Agent Representation on Information Disclosure in Human-Chatbot Interaction
Authors:
Anna Stock,
Stephan Schlögl,
Aleksander Groth
Abstract:
Self-disclosure counts as a key factor influencing successful health treatment, particularly when it comes to building a functioning patient-therapist-connection. To this end, the use of chatbots may be considered a promising puzzle piece that helps foster respective information provision. Several studies have shown that people disclose more information when they are interacting with a chatbot tha…
▽ More
Self-disclosure counts as a key factor influencing successful health treatment, particularly when it comes to building a functioning patient-therapist-connection. To this end, the use of chatbots may be considered a promising puzzle piece that helps foster respective information provision. Several studies have shown that people disclose more information when they are interacting with a chatbot than when they are interacting with another human being. If and how the chatbot is embodied, however, seems to play an important role influencing the extent to which information is disclosed. Here, research shows that people disclose less if the chatbot is embodied with a human avatar in comparison to a chatbot without embodiment. Still, there is only little information available as to whether it is the embodiment with a human face that inhibits disclosure, or whether any type of face will reduce the amount of shared information. The study presented in this paper thus aims to investigate how the type of chatbot embodiment influences self-disclosure in human-chatbot-interaction. We conducted a quasi-experimental study in which $n=178$ participants were asked to interact with one of three settings of a chatbot app. In each setting, the humanness of the chatbot embodiment was different (i.e., human vs. robot vs. disembodied). A subsequent discourse analysis explored difference in the breadth and depth of self-disclosure. Results show that non-human embodiment seems to have little effect on self-disclosure. Yet, our data also shows, that, contradicting to previous work, human embodiment may have a positive effect on the breadth and depth of self-disclosure.
△ Less
Submitted 23 July, 2023;
originally announced July 2023.
-
Towards smoother surfaces by applying subdivision to voxel data
Authors:
A. Michael Stock,
Sergio López-Ureña
Abstract:
In computed tomography, the approximation quality of a scan of a physical object is typically limited by the acquisition modalities, especially the hardware including X-ray detectors. To improve upon this, we experiment with a three-dimensional subdivision scheme to increase the resolution of the reconstructed voxel data. Subdivision schemes are often used to refine two-dimensional manifolds (most…
▽ More
In computed tomography, the approximation quality of a scan of a physical object is typically limited by the acquisition modalities, especially the hardware including X-ray detectors. To improve upon this, we experiment with a three-dimensional subdivision scheme to increase the resolution of the reconstructed voxel data. Subdivision schemes are often used to refine two-dimensional manifolds (mostly meshes) leading to smoother surfaces. In this work, we apply a refinement scheme to three-dimensional data first, and only then, start the surface extraction process. Thus, the main subject of this work lies not on subdivision surfaces, but rather on subdivision volumes. In the volumetric case, each subdivision iteration consumes eight times more storage space than the previous one. Hence, we restrict ourselves to a single subdivision iteration. We evaluate the quality of the produced subdivision volumes using synthetic and industrial data. Furthermore, we consider manufacturing errors in the original and in the subdivision volumes, extract their surfaces, and compare the resulting meshes in critical regions. Observations show that our specific choice of a subdivision scheme produces smoothly interpolated data while also preserving edges.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.