3,3′-Diaminobenzidine (DAB) was used as a chromogenic substrate, and the slides were counterstained using haematoxylin. == Image acquisition, management and analysis == Digital images were captured using the Aperio ScanScope XT Slide Scanner (Aperio Technologies, Vista, CA, USA) as previously described [10]. analysis algorithms were developed using MatLab 7 (MathWorks, Apple Hill Drive, MA, USA). A fully automated nuclear algorithm was developed to discriminate tumour from normal tissue and to quantify ER and PR expression in both cohorts. Random forest clustering was employed to identify optimum thresholds for survival analysis. == Results == The accuracy of the nuclear algorithm was initially confirmed by a histopathologist, who validated the output in 18 representative images. In these 18 samples, an excellent correlation was evident between the results obtained by manual and automated analysis (Spearman’s = 0.9,P< 0.001). Optimum thresholds for survival analysis were identified using random forest clustering. This revealed 7% positive tumour cells as the optimum threshold for the ER and 5% positive tumour cells for the PR. Moreover, a 7% cutoff level for the ER predicted a better response to tamoxifen than the currently used 10% threshold. Finally, linear regression was employed to demonstrate a more homogeneous pattern of expression for the ER (R= 0.860) than for the PR (R= 0.681). == Conclusions == In summary, we present data on the automated quantification of the ER and the PR in 743 primary breast tumours using a novel unsupervised image analysis algorithm. This novel approach provides a useful tool for the quantification of biomarkers on tissue specimens, as well as for objective identification of appropriate cutoff thresholds for biomarker positivity. It also offers the potential to identify proteins with a homogeneous pattern of expression. == Introduction == The oestrogen receptor (ER) remains the only reliable predictor of endocrine responsiveness in breast cancer, and is arguably the single most important predictive biomarker in clinical oncology today [1]. Moreover, one of the most studied ER-regulated genes is the progesterone receptor (PR). Approximately 70% to 80% of NFKB-p50 all invasive breast cancers are ER-positive and thus are considered likely to respond to endocrine therapy. The PR, which is positive in approximately 60% of cases, may be even more important in predicting response to anti-oestrogens [2]. Both premenopausal and postmenopausal women benefit from 5 years of treatment with the anti-oestrogen tamoxifen [3]. Current treatment guidelines for premenopausal women with hormone-responsive breast cancer advocate a combination of ovarian ablation/suppression or chemotherapy, followed by 5 years of tamoxifen treatment [4-6]. In hormone-responsive postmenopausal ladies, data from large prospective randomised controlled trials including aromatase inhibitors are now growing and herald fresh requirements in adjuvant endocrine treatment [7,8]. The International Expert Consensus on the Primary Therapy of Early Breast Cancer claims that tamoxifen may be an acceptable option, but that aromatase inhibitors have shown superiority over tamoxifen in postmenopausal breast cancer [5]. Irrespective of their NMDA-IN-1 menopausal status, arguably the solitary most important issue for any breast cancer patient is the assessment of her tumour hormone receptor status. The hormone receptor status is definitely routinely evaluated in all resected main tumours to assess the levels of ER and PR. Inmmunohistochemistry (IHC) performed on formalin-fixed cells sections is now the most commonly used assay, having replaced biochemical-based methods. The hormone receptor status is currently assessed by a pathologist; having a cutoff threshold of 10% positive tumour cells becoming commonly used to forecast responsiveness to adjuvant hormonal therapy. Such a threshold can lead to NMDA-IN-1 significant intra-observer variability. For example, one study of 172 German pathologists highlighted the difficulties that can arise from manual assessment, with 24% of ER staining interpreted as being falsely bad [9]. Improved image analysis systems possess the potential to circumvent the burden of interpretation and intra-observer NMDA-IN-1 variability, offering the potential to develop objective automated quantitative rating models for IHC. A move away from the semiquantitative manual rating models NMDA-IN-1 currently used should lead to less variability in results, to improved throughput and to the recognition of fresh prognostic subgroups, which may not have been evident following initial manual analysis alone [10]. In the present article we propose an automated approach, based on unsupervised learning, to accurately assess the ER and PR manifestation levels in an considerable cohort of breast tumor specimens. In particular, our approach utilizes a novel approach to the recognition of tumour nuclei, whereby nontumour constructions, including stromal parts and lymphocytic infiltrate, are instantly excluded from any analysis. Such an approach should allow for more accurate assessment of the IHC transmission. == Materials and methods == == Individuals and tumour samples == Two patient cohorts were used in the present study (Table1). The studies were authorized by the honest committees at Lund University or college and Linkping University or college. == Table 1. == Clinicopathological characteristics of Cohorts I and II Data in parentheses represent percentages unless normally stated. Cohort I (test cohort) consisted of 179 consecutive instances of invasive breast cancer diagnosed in NMDA-IN-1 the Division of Pathology, Malm University or college Hospital, Malm, Sweden,.