Sujatha Alla

and 8 more

Histopathologists are experiencing a digital revolution in their field thanks to the digitization of Whole Slide Images (WSIs), which are microscope slides of tissue that can measure gigapixels in size. With so much high resolution data at their disposal, computer vision techniques can now be used to automate laboratory processes, create visual standards, and increase analysis throughput, all of which reduce the workload of pathologists [1]. The ”gold” standard in neuropathology, particularly for Alzheimer’s Disease—is pathological diagnosis made by looking at White Matter Inclusions (WSIs) in brain tissue. Semi-quantitative scoring in accordance with the standards established by the Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) is necessary in order to arrive at a pathology diagnosis. As to these criteria, the diagnosis of Alzheimer’s disease relies heavily on the density of neuritic (cored) Amyloid-β plaques. In order to support in the classification of Alzheimer’s disease, recent studies by Tang et al. [2] and Wong et al. [3] examine the use of Deep Learning (DL) algorithms to WSIs of brain regions that have been histochemically labelled. According to the results from [2], there is a link between semi-quantitative scoring performed by a skilled neuropathologist and scoring generated by a DL, coupled with strong precision and recall metrics. By using the methods of [2] to data from an alternative brain bank, Vizcarra et al. [4] validate it and discover a similar connection between DL and semi-quantitative scoring. The methodology’s successful validation shows potential for incorporating DL into the neuropathology pipeline. While this can shorten the time neuropathologists need to score WSIs, two studies [2], [3] point out a specific problem—the predominant Aβ morphologies are quite rare, with diffuse plaques predominating. Classified as cored plaques or Cerebral Amyloid Angiopathy (CAA), only 1-2% of candidate samples found using conventional computer vision algorithms meet these criteria. Consequently, the significantly imbalanced datasets that are employed may have a negative impact on the DL’s performance. Additionally, trying to increase minority class instances by extra annotation by highly qualified neuropathologists would be prohibitively costly and time-consuming. Therefore, we aim to address the problem of unbalanced datasets for Aβ morphology in brain tissue immune histochemically stained WSIs. As part of our approach, we want to investigate how minority examples of Aβ morphologies, like cored plaques and CAA, may be synthesized using generative modelling, namely Generative Adversarial Networks (GANs), in order to balance the dataset. By doing this, we might also shorten the amount of time needed to classify high-quality datasets. To the best of our knowledge, this is the initial attempt at generatively modeling for the distribution of Aβ morphologies in images. Our objective is to respond to the following questions:Is it possible to develop a GAN architecture that can synthesize unique, high-quality instances of Aβ morphologies that are identical to real samples?Does the GAN prevent the problem of memorization and is it able to sufficiently address the modes of the real data distribution?Are there ever instances of low quality? If so, is it possible to eliminate this through a selection process?Can we use datasets balanced by GAN-oversampling minority data to increase the reliability of downstream classifiers for Alzheimer’s disease?The paper is as follows in the next section we will see the background related to our study. In Section 3, the related works are presented. In Section 4, the materials and methods used are discussed. In Section 5, the experimental analysis along with results are presented and we conclude the paper in Section 6 with some conclusions and future works.