Advertisement
Thoracic: Lung Cancer| Volume 13, P389-410, March 2023

Download started.

Ok

Highly sensitive noninvasive early lung cancer detection using DNA methylation topology in sputum-derived epithelial cells

Open AccessPublished:December 09, 2022DOI:https://doi.org/10.1016/j.xjon.2022.11.018

      Abstract

      Objective

      Sputum is a source of exfoliated respiratory epithelial cells transformed early in lung carcinogenesis. Malignant cells are hypomethylated and contain less genomic 5-methylcytosine (5mC). Validating a test that recognizes and quantifies aberrantly hypomethylated cells in sputum, we assessed its potential as a screening tool for detecting early-stage non–small cell lung cancer.

      Methods

      Cells extracted from sputum were immunofluorescence labeled with an anti-5-methylcytosine antibody and counterstained with 4′,6-diamidino-2-phenylindole (DAPI) delineating global nuclear DNA (gDNA). Via confocal scanning and 3-dimensional image analysis, fluorescence 5mC and DAPI signals were measured in segmented cell nuclei, and a 5mC/DAPI co-distribution map was generated for each imaged cell. Cells were classified as hypomethylated based on 5mC load and 5mC/DAPI co-distribution. The proportion of hypomethylated epithelial cells in the sputum determines whether a patient has lung cancer.

      Results

      A total of 88 subjects were enrolled: 12 healthy subjects; 34 high-risk subjects with benign chronic lung disorders (10 with chronic obstructive pulmonary disease, 24 with idiopathic pulmonary fibrosis), and 43 subjects with non–small cell lung cancer (27 with stage I-II and 16 with stage III-IV). The test identified early-stage non–small cell lung cancer and distinguished it from the high-risk group with 95.8% (95% confidence interval, 78.9-99.9) sensitivity and 41.2% (95% confidence interval, 24.6-59.3) specificity applying only 5mC, 95.8% (95% confidence interval, 78.9-99.9) sensitivity and 26.5% (95% confidence interval, 12.9-44.4) specificity using solely 5mC/DAPI index, and 100% (95% confidence interval, 98.7-100) sensitivity and 26.1% (95% confidence interval, 26.2-27.8) specificity with the combined parameters.

      Conclusions

      We tested and validated a novel, noninvasive, highly sensitive screening test for non–small cell lung cancer. With the use of sputum, our test may impact lung cancer screening, evaluation of pulmonary nodules, and cancer surveillance algorithms.

      Video Abstract

      Graphical abstract

      Key Words

      Abbreviations and Acronyms:

      CI (confidence interval), COPD (chronic obstructive pulmonary disease), CT (computed tomography), DAPI (4′,6-diamidino-2-phenylindole), 5mC (5-methylcytosine), gDNA (global nuclear DNA), IPF (idiopathic pulmonary fibrosis), NSCLC (non–small cell lung cancer), PBS (phosphate-buffered saline), 3D (3-dimensional), 3D-qDMI (3-dimensional quantitative DNA methylation imaging)
      Figure thumbnail fx2
      The fluorescence microscopic image shows epithelial cells that have been extracted from human sputum and labeled for global DNA methylation (5mC, false-colored green), and global DNA (DAPI, false-colored blue); both markers are localized in the cell nucleus. Also, the cell bodies have been delineated by a cytoplasmic marker (false-colored red). The normal sputum contains a majority of methylated cells (cell 1 type in the fluorescence image), whereas the cancer sputum comprises a bulk of hypomethylated cells (cell 2 type). Methylated cells are characterized by a high 5mC/DAPI colocalization index (∂) and hypomethylated cells have a low index (< 15°). DAPI, 4′,6-diamidino-2-phenylindole; 5mC, 5-methylcytosine; gDNA, global nuclear DNA.
      With the use of sputum, we have developed a novel, highly sensitive screening test for NSCLC. This may impact lung cancer screening, evaluation of pulmonary nodules, and cancer surveillance algorithms.
      Currently, there are no reliable noninvasive, nonradiologic screening tests for NSCLC despite being the most common cause of cancer mortality. By using induced sputum specimens to quantify the proportion of hypomethylated cells, our novel test may be able to impact lung screening and surveillance by providing a high-sensitivity adjunct to the traditional algorithm.
      When detected early, patients with early-stage non–small cell lung cancer (NSCLC) can undergo complete surgical resection with a relatively high 5-year survival.
      • Goldstraw P.
      • Chansky K.
      • Crowley J.
      • Rami-Porta R.
      • Asamura H.
      • Eberhardt W.E.E.
      • et al.
      The IASLC lung cancer staging project: proposals for revision of the TNM Stage Groupings in the forthcoming (eighth) edition of the TNM classification for lung cancer.
      High-risk groups have been identified and include smokers and patients with chronic pulmonary diseases, such as idiopathic pulmonary fibrosis (IPF) and chronic obstructive pulmonary disease (COPD). Additionally, patients who have undergone resection for lung cancer have a high risk (up to 50%) of recurrence within the first 2 years after surgery and require close surveillance. Current approaches in diagnosing lung cancer include bronchoscopy, chest x-ray, computed tomography (CT), and positron emission tomography (PET); each is limited in detecting early-stage lung cancers. CT scans have a 20% to 30% risk of false-positive results.
      • Pinsky P.F.
      • Bellinger C.R.
      • Miller Jr., D.P.
      False-positive screens and lung cancer risk in the National Lung Screening Trial: implications for shared decision-making.
      In addition, CT imaging, even at lower doses (low-dose CT), may increase the risk of developing secondary malignancies due to repetitive exposure of ionizing radiation, as used for serial and frequent monitoring.
      • Lin E.C.
      Radiation risk from medical imaging.
      • Fabricant P.D.
      • Berkes M.B.
      • Dy C.J.
      • Bogner E.A.
      Diagnostic medical imaging radiation exposure and risk of development of solid and hematologic malignancy.
      • Brower C.
      • Rehani M.M.
      Radiation risk issues in recurrent imaging.
      Thus, there is opportunity for alternative or adjunctive technology to augment our current approach of diagnosing lung cancer.
      Sputum is a valuable tissue surrogate and a proliferative source of upper respiratory cells that undergo malignant transformation. Earlier trials using sputum cytology for noninvasive early detection of lung cancer were dependent on morphological changes of exfoliated epithelial respiratory cells; detection used staining methods such as Papanicolaou, May-Grunwald, Ziehl-Neelsen and Gomori-Methenamine, or noncommercial antibody-based colorimetric staining. These trials were not consistent in detecting lung cancer and provided no clinical advantage.
      • Flehinger B.J.
      • Melamed M.R.
      • Zaman M.B.
      • Heelan R.T.
      • Perchick W.B.
      • Martini N.
      Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Memorial Sloan-Kettering study.
      • Fontana R.S.
      • Sanderson D.R.
      • Taylor W.F.
      • Woolner L.B.
      • Miller W.E.
      • Muhm J.R.
      • et al.
      Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Mayo Clinic study.
      • Frost J.K.
      • Ball Jr., W.C.
      • Levin M.L.
      • Tockman M.S.
      • Baker R.R.
      • Carter D.
      • et al.
      Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Johns Hopkins study.
      • Kubik A.
      • Polak J.
      Lung cancer detection. Results of a randomized prospective study in Czechoslovakia.
      Methylation of certain genes in sputum of high-risk patients has been successful in detecting lung cancer cells.
      • Patz Jr., E.F.
      • Goodman P.C.
      • Bepler G.
      Screening for lung cancer.
      • Zochbauer-Muller S.
      • Minna J.D.
      • Gazdar A.F.
      Aberrant DNA methylation in lung cancer: biological and clinical implications.
      • Brock M.V.
      • Hooker C.M.
      • Ota-Machida E.
      • Han Y.
      • Guo H.
      • Ames S.
      • et al.
      DNA methylation markers and early recurrence in stage I lung cancer.
      • Tessema M.
      • Yu Y.Y.
      • Stidley C.A.
      • Machida E.O.
      • Schuebel K.E.
      • Baylin S.B.
      • et al.
      Concomitant promoter methylation of multiple genes in lung adenocarcinomas from current, former and never smokers.
      • Leng S.
      • Do K.
      • Yingling C.M.
      • Picchi M.A.
      • Wolf H.J.
      • Kennedy T.C.
      • et al.
      Defining a gene promoter methylation signature in sputum for lung cancer risk assessment.
      • Diaz-Lagares A.
      • Mendez-Gonzalez J.
      • Hervas D.
      • Saigi M.
      • Pajares M.J.
      • Garcia D.
      • et al.
      A novel epigenetic signature for early diagnosis in lung cancer.
      This approach has been limited as a general detection test because it is dependent on identifying specific abnormalities. The heterogeneity of lung cancer makes this initial approach challenging as applied methods average gene methylation across a large number of cells, thereby obscuring significant subtle information specific to a subgroup of cancerous cells in sputum.
      To address these limitations, we propose a novel approach that analyzes the global DNA methylation status of exfoliated respiratory epithelial cells in the sputum on a cell-by-cell basis (Figure 1 and Video Abstract). Global genome-wide DNA methylation in epithelial cells occurs early in tumor development and is ubiquitous across all major types of cancer.
      • Feinberg A.P.
      • Ohlsson R.
      • Henikoff S.
      The epigenetic progenitor origin of human cancer.
      Global hypomethylation affects largely repetitive DNA elements (comprising ∼50% of the human genome), results in genome instability, and parallels closely the degree of stage-defined malignancy.
      • Baylin S.B.
      • Jones P.A.
      Epigenetic determinants of cancer.
      • Ehrlich M.
      • Lacey M.
      DNA hypomethylation and hemimethylation in cancer.
      • Shen H.
      • Laird P.W.
      Interplay between the cancer genome and epigenome.
      The availability of more sensitive technologies, combining immunofluorescence labeling, high-resolution confocal scanning, and single-cell image cytometry for quantifying epigenetic biomarkers in extracted respiratory cells, allows the identification of aberrant cells, leading to the detection of lung cancer. The process involves inducing sputum, followed by staining isolated cells with fluorescent reporters that produce a specific pattern in the nuclei of labeled cells visualized by light microscopy and analyzed with a high-content imaging platform. The specific pattern produced is associated with the nuclear distribution of methylated DNA, which is altered in lung cells during early cancer development, differentiating it from normal respiratory cells.
      Figure thumbnail gr1
      Figure 1The study protocol and different histograms for 5mC load and 5mc/DAPI colocalization index for each of the study groups are described. There is a significantly increase in hypomethylated cells from the normal population to the high-risk benign to early-stage lung cancer to late-stage lung cancer. This similar phenomenon is illustrated in the scatter plots. 3D, 3-Dimensional; 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole.
      Tajbakhsh and colleagues,
      • Tajbakhsh J.
      • Mortazavi F.
      • Gupta N.K.
      DNA methylation topology differentiates between normal and malignant in cell models, resected human tissues, and exfoliated sputum cells of lung epithelium.
      who previously used this cell-phenotyping assay, identified DNA methylation patterns that are specific to cancerous cells of the lung epithelium. The matching sputum of patients with cancer contained cells with aberrant DNA methylation patterns that resembled the signature of cancer cells in the malignant tissue. The fundamentals of our high-content single-cell analysis—based on the 3-dimensional quantitative DNA methylation imaging (3D-qDMI) platform to expose and enumerate aberrantly methylated cells—have been well reported and the approach validated in basic studies using a variety of cells and tissues.
      • Tajbakhsh J.
      • Mortazavi F.
      • Gupta N.K.
      DNA methylation topology differentiates between normal and malignant in cell models, resected human tissues, and exfoliated sputum cells of lung epithelium.
      • Gertych A.
      • Wawrowsky K.A.
      • Lindsley E.
      • Vishnevsky E.
      • Farkas D.L.
      • Tajbakhsh J.
      Automated quantification of DNA demethylation effects in cells via 3D mapping of nuclear signatures and population homogeneity assessment.
      • Gertych A.
      • Farkas D.L.
      • Tajbakhsh J.
      Measuring topology of low-intensity DNA methylation sites for high-throughput assessment of epigenetic drug-induced effects in cancer cells.
      • Oh J.H.
      • Gertych A.
      • Tajbakhsh J.
      Nuclear DNA methylation and chromatin condensation phenotypes are distinct between normally proliferating/aging, rapidly growing/immortal, and senescent cells.
      • Tajbakhsh J.
      Covisualization of global DNA methylation/hydroxymethylation and protein biomarkers for ultrahigh-definition epigenetic phenotyping of stem cells.
      • Tajbakhsh J.
      • Gertych A.
      • Fagg W.S.
      • Hatada S.
      • Fair J.H.
      Early in vitro differentiation of mouse definitive endoderm is not correlated with progressive maturation of nuclear DNA methylation patterns.
      • Tajbakhsh J.
      • Stefanovski D.
      • Tang G.
      • Wawrowsky K.
      • Liu N.
      • Fair J.H.
      Dynamic heterogeneity of DNA methylation and hydroxymethylation in embryonic stem cell populations captured by single-cell 3D high-content analysis.
      • Stefanovski D.
      • Tang G.
      • Wawrowsky K.
      • Boston R.C.
      • Lambrecht N.
      • Tajbakhsh J.
      Prostate cancer diagnosis using epigenetic biomarkers, 3D high-content imaging and probabilistic cell-by-cell classifiers.
      Following up on the recent encouraging data with cells of the lung origin, we hypothesized that the detection and enumeration of aberrantly methylated cells in sputum may serve as an indication for early tumor development. We report on the first clinical study to assess the feasibility of using DNA methylation topology toward early lung cancer detection.

      Materials and Methods

      Study Design and Patients

      The study protocol was approved by the Cedars-Sinai Internal Review Board (#Pro00039831 and Pro00046876, approval date May 1, 2016, and January 23, 2018). The Institutional Review Board or equivalent ethics committee of Cedars-Sinai Medical Center approved the study protocol and publication of data. The patients provided informed written consent for the publication of the study data. This prospective study enrolled 88 subjects, divided into 3 groups: 11 (12%) in the healthy/normal group with no history of cancer or pulmonary diseases; 34 (39%) in the high-risk/benign group with benign chronic lung disorders (10 COPD, 24 IPF) without cancer; and 43 (49%) in the cancer group with patients who have NSCLC and have not received treatment (27 with stage I-II, 16 with stage III-IV). Subjects were recruited based on International Classification of Diseases 9th Revision diagnosis code. All diagnoses were confirmed via review.

      Sample Collection and Processing

      Figure 2 depicts the workflow and process. Sputum induction was performed by inhalation of hypertonic saline. The sputum collection process was standardized for all subjects. Samples were collected and kept at 4 °C until processing. A sputum of 1 to 5 mL of volume will yield thousands of epithelial cells and is sufficient for performing the test. Samples were diluted with Sputolysin (Calbiochem) Worksolution, a phosphate-buffered saline (PBS) solution containing 10 mmol/L dithiothreitol. After every incubation, the sample was centrifuged at 400g for 10 minutes at 4 °C to separate cellular and fluid phases. The cell pellet was resuspended in PBS and then transferred onto a microscope slide using a cytospin apparatus for cell attachment to the glass support. Samples were transferred onto glass slides and immediately fixed with 4% paraformaldehyde and kept in PBS at 2 °C to 8 °C until used in downstream processing. Fixed slides were subjected to an immunofluorescence assay.
      Figure thumbnail gr2
      Figure 2Schematic of workflow process. TIFFs, Tag image transfer files; 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole; 2D, 2-dimensional.

      Immunofluorescence Labeling

      Immunofluorescence was performed according to previously established protocols.
      • Oh J.H.
      • Gertych A.
      • Tajbakhsh J.
      Nuclear DNA methylation and chromatin condensation phenotypes are distinct between normally proliferating/aging, rapidly growing/immortal, and senescent cells.
      Primary and secondary antibody sets included unconjugated monoclonal mouse anti–5-methylcytosine (5mC) monoclonal antibody (Cat. No. AMM99021, Aviva Systems Biology) at 1 μg/mL concentration and Alexa488-linked donkey anti-mouse immunoglobulin-G (Cat. No. A-21202, Thermo Fisher Scientific) at 5 μg/mL concentration. The specificity/dynamic range of the antibodies was tested as previously reported.
      • Gertych A.
      • Oh J.H.
      • Wawrowsky K.A.
      • Weisenberger D.J.
      • Tajbakhsh J.
      3-D DNA methylation phenotypes correlate with cytotoxicity levels in prostate and liver cancer cell models.
      Specimens were counterstained with 4′,6-diamidino-2-phenylindole (DAPI) (Cat. No. D1306, Thermo Fisher Scientific) for detection of global nuclear DNA (gDNA). Cell Mask Red (Cat. No. H32712, Thermo Fisher Scientific) was used to delineate cell cytoplasm for identification of flat epithelial cells. By staining of the large skirt-like cytoplasm, epithelial cells can be easily identified and distinguished from immune cells with a significantly smaller cell body under a fluorescent microscope. The staining lends itself to convenient size exclusion of cells other than epithelial cells in an automated mode. Figure 3 shows impressions of labeled cells isolated from different sputum samples. For each sputum sample, we analyzed an average of 150 cells.
      Figure thumbnail gr3
      Figure 3Fluorescent images of selected sputum samples. A, High-resolution area on the slide with few cells showing a less crowded sample compared with cell sample in (B) with significantly more immune cells (monocytes), due to a higher degree of lung inflammation. 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole.

      Confocal Imaging and 3-Dimensional Image Analysis

      Subcellular-resolution images were acquired with a TCS SP5 X Supercontinuum confocal scanning microscope (Leica Microsystems). Serial optical sections were collected at increments of 500 nm. The output file format is a series of TIFF images that can be used for 3-dimensional (3D) image analysis. We used an in-house devised algorithm for pattern recognition and multi-parametric high-content analysis, which has been previously described.
      • Gertych A.
      • Wawrowsky K.A.
      • Lindsley E.
      • Vishnevsky E.
      • Farkas D.L.
      • Tajbakhsh J.
      Automated quantification of DNA demethylation effects in cells via 3D mapping of nuclear signatures and population homogeneity assessment.
      ,
      • Gertych A.
      • Farkas D.L.
      • Tajbakhsh J.
      Measuring topology of low-intensity DNA methylation sites for high-throughput assessment of epigenetic drug-induced effects in cancer cells.
      This image analysis tool operates in 3 steps: (1) Cells (within imaged cell populations) are processed for 3D segmentation; (2) fluorescence signal in 5mC and DAPI channels residing within nuclei of (only) epithelial cells are recorded; and (3) the system calculates the 5mC and gDNA content of the entire nucleus, and generates co-distribution maps (2-dimensional scatter plots) of 5mC signals and gDNA (represented by DAPI), discerned as 2-dimensional scatter plots for each individual cell in which each pixel on the plot has a 5mC-intensity value (y-axis) and a DAPI-intensity value (x-axis) representing 1 voxel in a cell nucleus (Figure 4). 5mC/DAPI signal colocalization index was calculated as the angle ∂ under the regression line and is considered a parameter for the 5mC/gDNA co-distribution.
      Figure thumbnail gr4
      Figure 4Global DNA methylation phenotyping of human sputum-derived cells with 3D-qDMI. Confocal images of fluorescently labeled cells. The cytoplasm is delineated by Cell Mask Red (red), the cell nucleus (global DNA) is delineated by DAPI (blue), and gDNA methylation is visualized by an antibody specific to 5mC (green). Left: The sputum of healthy individuals (normal) contains a majority of highly methylated cells (cell type 1) and sporadically a few hypomethylated cells (cell type 2). Right: In contrast, the sputum of a patient with lung cancer contains a large number of hypomethylated cells. The respective nuclear 5mC/DAPI co-distribution patterns, presented as scatter plots, show that normally methylated (type 1) cells in both sputum-donor groups display a steep regression line (defined by ∂) synonymous with a higher 5mC/DAPI colocalization index, whereas hypomethylated cells (type 2) produce a flatter regression line. Moreover, a frequent signature of the lung cancer–specific hypomethylated cells is the less dispersed and narrow co-distribution of 5mC and DAPI. 3D-qDMI was able to successfully distinguish between normal and aberrant cell types based on 2 5mC-biomarker parameters: global nuclear 5mC intensity and 5mC/gDNA co-distribution (index). 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole.

      Data and Statistical Analysis

      Sputum was characterized by its content of abnormal cells, and tests (binary classifiers) were established on the basis of the 2 parameters, 5mC and 5mC/DAPI colocalization (continuous measurements). For each parameter, thresholds for abnormal cells in sputum at the cell level and the percentage of abnormal cells at the patient level were derived. Patients were classified as “positive” or “negative” for lung cancer as follows: On the cell level, cells were considered abnormal if the cell value was less than a threshold where all individual cell values were considered candidate thresholds and normal if the cell value was equal to or above the threshold. On the patient level, the percentage of abnormal cells was calculated for each candidate threshold value of abnormal cells. Patients were classified as positive for lung cancer if the percentage of abnormal cells was greater or equal to a threshold where all individual values of the percentage of abnormal cells were considered candidate thresholds and negative for lung cancer if the percentage of abnormal cells was less than the established threshold. For each candidate threshold value of abnormal cells, a logistic regression model was fit at the patient level with the corresponding percentage of abnormal cells as an independent variable and 2 groups of interest as a dependent variable (eg, early-stage lung cancer vs high-risk patients with benign lung disease (COPD/IPF)), and then sensitivity and specificity were calculated at each value of the percentage of abnormal cells to discriminate between the 2 groups of interest. At each combination of candidate threshold values of abnormal cells and the percentage of abnormal cells, sensitivity and specificity were calculated and the optimal combination of threshold values that most differentiate 2 groups of interest were chosen on the basis of a minimum value for sensitivity of 90% or 95% while maximizing specificity across all possible combinations of thresholds.
      • Schafer H.
      Constructing a cut-off point for a quantitative diagnostic test.
      We chose the optimal combination of threshold values with the largest value for abnormal cells and smallest value for the percentage of abnormal cells when multiple combinations of threshold values produced the same sensitivity or specificity. For each parameter, sensitivity and specificity and corresponding 95% confidence intervals (CIs) were estimated at the optimal combination of thresholds, and patients were classified as “positive” or “negative.” Test results from the 2 parameters were combined in the following way: If a patient was classified as positive based on either parameter, then the patient was classified as positive. Otherwise, the patient was classified as negative. The performance of the combined test was examined with sensitivity and specificity, and corresponding 95% CIs were estimated. Internal validation for the combined test was conducted by estimating and correcting possible optimism in sensitivity and specificity (optimism-corrected sensitivity and specificity) using the bootstrap method with 100 replicates as described by Harrell and colleagues,
      • Harrell Jr., F.E.
      • Lee K.L.
      • Mark D.B.
      Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.
      ,
      • Harrel F.E.
      Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. Efficiency of Some Procedures for Logistic Regression Analysis.
      which provides stable estimates with lower bias than a split-sample procedure. This approach has been endorsed by Steyerberg and colleagues.
      • Steyerberg E.W.
      • Harrell Jr., F.E.
      • Borsboom G.J.
      • Eijkemans M.J.
      • Vergouwe Y.
      • Habbema J.D.
      Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.
      Data were resampled with replacement 100 times, each with the same sample size as the original data (bootstrap data). An optimal combination of 2 thresholds was chosen for each bootstrap data. Each optimal combination of thresholds was then applied to both the bootstrap data and original data, and corresponding sensitivity and specificity for the combined test were estimated for both. The difference between these 2 was calculated for sensitivity and specificity and then averaged over 100 bootstrap data, which is called an “optimism.” The optimism-corrected sensitivity and specificity were calculated as the original sensitivity and specificity minus the estimated optimism for sensitivity and specificity, respectively. Statistical analyses were performed using R version 4.0.5 and the R packages (OptimalCutpoints, epiR, tidyverse, boot, and parallel) with 2-sided tests at a significance level of .05.

      Pathologic Analysis

      For comparative cell phenotyping between our 3D approach and conventional cytology, each selected sputum sample was split into 2 equal aliquots (before sample processing). One aliquot was processed for 3D analysis as mentioned earlier. The other aliquot was transferred to the Cedars-Sinai Medical Center Cytopathology Lab for immediate processing. Each sample was fixed in 50 mL of Saccomanno's fixative (Polysciences) for 20 minutes. The specimen was blended at high speed for 2 to 7 seconds in the Waring blender (Cardinal Health), depending on the thickness of mucoid specimens. The blended sample was centrifuged 10 minutes in a benchtop centrifuge (Hettich) at 1866 rpm. Two smears were prepared from the resuspended sediment after decanting the supernatant and vortexing the sediment for 8 to 10 seconds in the vortex mixer. The smears were air dried, rinsed in 95% alcohol for 10 minutes, and stained with the Papanicolaou stain.
      • Koss L.G.
      • Melamed M.R.
      • Koss L.G.
      Koss' Diagnostic Cytology and Its Histopathologic Bases.
      The slides were screened by a cytotechnologist and reviewed by a cytopathologist. Cases were recorded as negative, atypical, suspicious, or malignant.

      Results

      Our study enrolled 11 healthy subjects with no history of cancer or pulmonary issues before testing, 34 high-risk patients with benign chronic pulmonary ailments (COPD/IPF), and 43 patients with pathologically confirmed lung cancer (Table 1). High-risk benign patients and patients with lung cancer were equally divided by gender with an average age of 68 years. The breakdown of subject demographics, lung cancer types, specific lobe, location within each lobe, and pathological stage is shown in Table 2.
      Table 1Study groups
      GroupCases (n)
      Healthy
       Age [range]
      31-52 y
       Male7
       Female4
      High-risk benign
       Age [range]
      34-87 y
       Male17
       Female17
       Pulmonary
      IPF24
      COPD10
      Lung cancer
       Age [range]
      42-93 y
       Male21
       Female22
       Stage
      I-II27
      III-IV16
      IPF, Idiopathic pulmonary fibrosis; COPD, chronic obstructive pulmonary disease.
      Table 2Patient demographics and lung cancer characteristics
      Patient demographicsn%
      Age
       Mean 67.6 y
      Range (42-93 y)
      Sex
       Male2151.2%
       Female2248.8%
      Race
       White2353.5%
       African-American614.0%
       Hispanic511.6%
       Asian511.6%
       Other49.3%
      Smoking
       Current1125.6%
       Former2660.5%
       Never614.0%
      Lung cancer typeTotal cases (n)Percentage (%)Mean size (cm) [range]Lobe distribution (n)Location within lobe (n)Stages (n)
      Non–small cell lung cancer432.6 [0.3-8.0]
      Adenocarcinoma2660.5%2.8 [0.3-8.0]
      Right upper lobe8Central8I8
      Right middle lobe2Middle6II4
      Right lower lobe6Peripheral12III5
      Right upper/middle lobe1IV9
      Left upper lobe8
      Left lower lobe1
      Squamous cell carcinoma1330.2%2.8 [1.2-5.7]
      Right upper lobe5Central3I8
      Right lower lobe3Middle4II3
      Left upper lobe3Peripheral6III2
      Left lower lobe2
      Adenosquamous carcinoma24.7%2.1 [1.2-2.9]Left upper lobe1Peripheral2I2
      Right lower lobe1
      Carcinoid12.3%1.6Left lower lobe1Middle1I1
      Large cell lung cancer12.3%1.2Left lower lobe1Middle1I1
      Measurements of global 5mC-intensity values and calculated 5mC/DAPI colocalization indices revealed that the sputum of patients with lung cancer contain a majority of aberrantly hypomethylated cells; sputum of normal cells had a majority of normally methylated cells. Hypomethylated cells are characterized by reduced 5mC intensity that ensues in flatter regression lines and ultimately smaller ∂ (on average <15°). It should be noted that hypomethylated epithelial cells were also found in normal sputum, although at a significantly smaller rate than the sputum of patients with lung cancer, as exemplified in Figure 4.
      By looking at the generated data on the cell level, we empirically determined cutoffs for the 2 parameters: 150 relative light units for 5mC-intensity and a colocalization index ∂ = 15°. The cutoffs were chosen to yield more than 50% hypomethylated cells for cancer samples and less than 50% hypomethylated cells for noncancer samples. Cells were characterized as hypomethylated if their measured parameter was less than the determined thresholds. Using these cutoffs, we calculated that healthy sputum contains approximately 18% hypomethylated cells (Figure 5, A). This fraction is increased to approximately 42% in the sputum of high-risk benign patients with chronic pulmonary disorders. In early-stage lung cancer, the number of hypomethylated cells increased to approximately 64%, and in advanced-stage lung cancer, this increased to 80%. The same trend in the proportions of hypomethylated cells can be seen for the 5mC/DAPI colocalization index: 26% in healthy donors, an increase to 66% in high-risk benign patients, 86% in early-stage lung cancer, and 93% in advanced lung cancer (Figure 5, B). However, we did not observe any correlation between the number of hypomethylated cells and the location of the tumor (central vs peripheral) or tumor size. Tumor characteristics are provided in Table E1. Of note, we found an average of 20% difference, for both parameters, between high-risk benign patients and patients diagnosed with early-stage lung cancer.
      Figure thumbnail gr5
      Figure 5Histograms showing mean values for the 2 parameters, 5mC load (global intensity) (A) and 5mC/DAPI colocalization index (B) for the different subject groups. Both parameters indicate that the number of hypomethylated cells proportionally increases from the healthy state over symptomatic (high-risk benign pulmonary diseases) to lung cancer and even progresses during cancer advancement. In particular, early lung cancer stages exhibiting at average approximately 25% more globally hypomethylated cells than high-risk benign conditions. This margin is significant in differentiating the 2 most important groups in early lung cancer detection. 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole; LCa, lung cancer.
      In a second analysis, we applied a statistical approach to identify optimal thresholds for the 2 parameters to define a hypomethylated (aberrant) cell and the percent of aberrant cells that would differentiate each specific patient study group (healthy, high-risk benign, and patient with cancer). The results are reported in Table E2, Table E3, Table E4, Table E5, Table E6, Table E7, Table E8, Table E9. Table 3 summarizes the optimal thresholds for the 2 parameters that most differentiate 2 groups of interest and the test performance for the 2 parameters and the combined test for the most important 6 differentiations. In all but 1 differentiation, the 5mC/DAPI colocalization index (parameter 2) has a better performance over 5mC intensity (parameter 1) and the combination of both parameters. Parameter 2 differentiates healthy subjects from high-risk benign cases with a 97.1% sensitivity and 63.6% specificity. The test differentiates healthy subjects from patients with stage I and II lung cancer with 95.8% sensitivity and 90.9% specificity. For the differentiation of the high-risk benign group versus the early-stage lung cancer group, 5mC-intensity demonstrates 95.8% sensitivity and 41.2% specificity increased over the 5mC/DAPI colocalization index. The angle under the regression line (5mC/DAPI colocalization index) associated with the definition of a hypomethylated cell was 6.41 or less, and the optimal threshold value for the percentage of abnormal cells associated with a cancer diagnosis was 30.30% for 5mC and 38.4% for 5mC/DAPI index.
      Table 3Summary of best performances for each differentiation and parameter
      Comparison5mC intensity (parameter 1)5mC/DAPI colocalization index (parameter 2)Combined test
      Optimism-corrected (internally validated) values.
      ThresholdSensitivity (95% CI)Specificity (95% CI)ThresholdSensitivity (95% CI)Specificity (95% CI)Sensitivity (95% CI)Specificity (95% CI)
      Aberrant cell
      Cells are classified as abnormal if value less than threshold value; normal otherwise.
      % of Aberrant cell
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Aberrant cell
      Cells are classified as abnormal if value less than threshold value; normal otherwise.
      % of Aberrant cell
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Healthy vs LCa stages I-II371.2462.9695.8 (78.9-99.9)54.5 (23.4-83.3)6.5941.5495.8 (78.9-99.9)90.9 (58.7-99.8)99.9 (98.7-100)46.5 (44.5-47.5)
      Healthy vs high-risk benign (COPD + IPF)118.592.0897.1 (84.7-99.9)27.3 (6.0-61.0)16.8945.2497.1 (84.7-99.9)63.6 (30.8-89.1)96.3 (95.7-97.3)27.5 (26.8-29.5)
      High-risk benign vs LCa stages I-II150.5730.3095.8 (78.9-99.9)41.2 (24.6-59.3)6.4138.4695.8 (78.9-99.9)26.5 (12.9-44.4)99.9 (98.7-100)26.1 (26.2-27.8)
      High-risk benign vs all LCa stages150.5730.3095.3 (84.2-99.4)41.2 (24.6-59.3)16.4169.1295.3 (84.2-99.4)44.1 (27.2-62.1)99.5 (98.9-100)35.3 (35.0-36.4)
      Controls (healthy + high-risk benign) vs LCa stages I-II150.5730.30100 (85.7-100)26.7 (14.6-41.9)6.5941.5495.8 (78.9-99.9)42.2 (27.7-57.8)99.9 (98.8-100)26.9 (26.3-27.5)
      Controls vs all LCa stages150.5730.3095.3 (84.2-99.4)40.0 (25.7-55.7)16.4169.1295.3 (84.2-99.4)55.6 (40.0-70.4)99.9 (99.2-100)38.2 (37.5-38.6)
      5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole; CI, confidence interval; LCa, lung cancer; COPD, chronic obstructive pulmonary disease; IPF, idiopathic pulmonary fibrosis.
      Optimism-corrected (internally validated) values.
      Cells are classified as abnormal if value less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.

      Comparative Analysis

      By applying standard sputum cytology, a total of 68% (n = 23 of 34) high-risk benign cases were examined by cytology, consisting of 16 IPF and 7 COPD cases. A total of 74% (n = 32 of 43) lung cancer cases were examined by cytology, including 23 early-stage and 9 advanced-stage cases. From the total of 34 patients with COPD/IPF, only samples of 23 of them were forwarded for sputum cytology; 17 (65%) were correctly identified as negative, and 6 (35%) were declared unsatisfactory due to scant cellularity. Furthermore, cytology found few atypical cells in only 1 stage IV adenocarcinoma case and misdiagnosed 27 lung cancer cases (19 stage I-II and 8 stage III-IV) as being negative for abnormal cells. Four early-stage lung cancer cases were declared unsatisfactory because of scant cellularity.

      Discussion

      Although DNA methylation imaging was introduced at the end of the 1990s, it did not gain popularity compared with other contemporaneously developed molecular methods, such as PCR-based, array-based, sequencing, high-pressure liquid chromatography, and mass spectrometry. The reasons may be as follows: DNA methylation imaging was applied in combination with radio-labeled or enzymatic reporters for detection, which lack sensitivity and multiplexing capability or affect repeatability/consistency of the assay, and did not provide enough significance in differential results because of low-image resolution. Unlike current molecular approaches and previous low-resolution imaging-based attempts that average 5mC measurements across large population of cells or only measure mean 5mC intensity values in cell nuclei, 3D-qDMI leverages the extraction of differential 5mC-relevant information by accounting for secondary effects of DNA methylation imbalances occurring throughout cellular transformation, specifically hypomethylation of global DNA. The latter mechanism elicits reorganization of the genome within cell nuclei, thus shifting 5mC/DAPI colocalization patterns.
      • Oh J.H.
      • Gertych A.
      • Tajbakhsh J.
      Nuclear DNA methylation and chromatin condensation phenotypes are distinct between normally proliferating/aging, rapidly growing/immortal, and senescent cells.
      ,
      • Espada J.
      • Esteller M.
      Epigenetic control of nuclear architecture.
      This phenomenon is well described in cell biological research but has not been used in cancer pathology. The image analysis we developed addresses this gap and displays the relevant changes as intensity distributions of the 2 types of signals that reflect the proposed mechanism: 5mC signals and gDNA represented by DAPI signals, because DAPI intercalates into adenine thymine-rich DNA, the main component of highly repetitive and compact heterochromatic sequences.
      By using the 3D-qDMI platform, we explored 5mC/DAPI codistribution patterns that could differentiate between normal and malignant in respective cell models, resected human tissues, and exfoliated sputum cells of the lung epithelium.
      • Tajbakhsh J.
      • Mortazavi F.
      • Gupta N.K.
      DNA methylation topology differentiates between normal and malignant in cell models, resected human tissues, and exfoliated sputum cells of lung epithelium.
      Because DNA methylation correlates with early events in carcinogenesis and tumor progression, this differential property of cells can serve as a signature marker in diagnostics and monitoring. Our approach quantifies the proportion of hypomethylated cells, based on the load and distribution of global 5mC (5mC + gDNA represented by DAPI) in the nuclei of epithelial cells extracted from sputum. By using 3D-qDMI, we demonstrate the ability of sputum to be tested for the early detection of lung cancer. 3D-qDMI allows for the rapid, parallel, single-cell resolution characterization of thousands of cells within heterogeneous sputum samples. This capability is highly favorable because it allows for the exclusion of nonepithelial cells from analysis, thus preventing data skewing caused by sample impurity from infiltrating hematopoietic cells and microorganisms. Another huge advantage of our approach is the bench to bedside clinical applicability. The test is a slide-based cytometric approach that is amenable to automation and scaling; it can seamlessly integrate into the workflow for lung cancer screening and serve as an aid in the diagnosis of lung cancer.
      We demonstrate the advantages of using 3D-qDMI over traditional sputum cytology analysis. By applying high-resolution, high-content 3D imaging analysis, our novel, noninvasive sputum-based screening test for NSCLC detects malignancy with a sensitivity greater than 95%. The detection of malignancy was obtained in all comparative group analyses. In addition, the test differentiated the high-risk benign group versus the early-stage lung cancer with high sensitivity group, which is the most critical distinction among all groups because it eliminates the outliers at both extremes (healthy population and patients with stage III-IV).
      Our results confirm that a sputum-based test is suitable for lung cancer detection. When original sputum samples were simultaneously processed by both methods, cytologic analysis was limited in cancer detection due to scant cellularity in 30% (n = 7) of high-risk benign cases and 12.5% (n = 4) of lung cancer cases. Sputum cytology using Papanicolaou staining demonstrated worse detection, because it only detected abnormal cells in 1 patient with stage IV lung cancer and had an overall false-negative rate of 84.4% (n = 27) in the lung cancer group, with 19 false-negatives in stage I and II and 8 false-negatives in stage III and IV.

      Study Limitations

      One limitation of the study includes using the same measurement at the cell level and the patient level without multilevel modeling. Therefore, no correlation can be made between the data of the 2 levels. Rather, abnormal versus normal cells were classified at the cell level, and for each possible threshold value of abnormal cells, a corresponding percentage of abnormal cells was calculated for each patient. Then, we examined its association with 2 groups of interest at the patient level, using a logistic regression model.

      Conclusions

      Given the ease of obtaining sputum samples from patients and the benefits of no radiation exposure, our novel test may impact lung cancer screening, evaluation of pulmonary nodules, detection of disease recurrence in patients postresection, and cancer surveillance algorithms. Using global DNA methylation topology in the cell nucleus discerned by the innovative 3D-qDMI platform may help leverage the potential of sputum in noninvasive early lung cancer detection. Additionally, our testing methodology may be a feasible option for samples showing scant cellularity, such as those obtained from induced sputum and CT-guided or navigational bronchoscopy fine-needle aspiration biopsies.

      Webcast

      You can watch a Webcast of this AATS meeting presentation by going to: https://www.aats.org/resources/1596.
      Figure thumbnail fx3

      Conflict of Interest Statement

      H.J.S. is a speaker for Intuitive, Medtronic/Covidien, and Pinnacle Biologics. J.T. has submitted a patent application relevant to the methodology applied in this study. All other authors reported no conflicts of interest.
      The Journal policy requires editors and reviewers to disclose conflicts of interest and to decline handling or reviewing manuscripts for which they may have a conflict of interest. The editors and reviewers of this article have no conflicts of interest.
      The authors thank our many collaborators at Cedars-Sinai who were instrumental in helping this study come to fruition. The authors thank Paul Noble, MD, Isabel Pedraza, MD, Jeremy Falk, MD, Rodney Mardirosian, Claudia Huiza, Charles Yoo, and Carissa Huynh and Yi Zhang MD (both at Biobank and Pathology, Cedars-Sinai Cancer Center). This project was supported by Cedars-Sinai Technology Ventures.

      Supplementary Data

      Appendix E1

      Table E1Patient characteristics
      TypeStageSize [cm]LobeLocationAge, yGenderEthnicitySmokingFEV1FEV1%DLCODLCO%Average 5mC level [relative light units]Average 5mC/DAPI index [degrees]
      Adenocarcinomalb2Left upper lobePeripheral61FWhiteFormer2.4111115.7287214.30.66
      Adeno squamousla2.9Left upper lobePeripheral57FWhiteNever2.8112023.6612381.033.17
      Adenocarcinomala1.5Right lower lobeMiddle63MWhiteFormer3.1611429.2112120.26.78
      Adenocarcinoma2a0.3Right upper lobeMiddle73FWhiteFormer1.8478N/AN/A50.70.02
      Adenocarcinomala0.8Left upper lobePeripheral75FWhiteFormer0.76399.94760.720.45
      Adenocarcinomala1.7Left upper lobeCentral66MWhiteCurrent0.69236.925282.681.53
      Adenocarcinoma2b2.7Right lower lobePeripheral60FHispanicCurrent1.54927.841111.451.15
      Squamous cell2a2.1Right lower lobePeripheral93MWhiteFormer1.49898.546251.826.68
      Adenocarcinoma4a3.1Right lower lobePeripheral70FHispanicNever2.1811413.967121.523.29
      Squamous cellla2.6Right lower lobeMiddle58FAfrican-AmericanCurrent2.229523101231.585.34
      Adenocarcinoma42.1Right lower lobePeripheral68MWhiteNever2.468624.393239.6712.98
      Adenocarcinoma4b1.8Right lower lobeMiddle67FOtherCurrent1.266912.261190.111.23
      Adenocarcinomalb2.1Left upper lobePeripheral65FAsianFormer0.94416.2629400.3124
      Adenocarcinoma2a1.9Left upper lobeMiddle75FWhiteCurrent1.9312015.279147.7412.94
      Adenocarcinomala2.5Right upper lobePeripheral65FHispanicFormer1.638416.881116.958.19
      Adenocarcinoma4b4.8Right upper lobeCentral58FAsianNever1.898914.288101.020.31
      Adenocarcinoma43.3Right lower lobePeripheral75MWhiteFormer2.571042084102.231.09
      Squamous cell3a1.7Right upper lobePeripheral82FWhiteFormer1.721311373158.182.75
      Squamous cellla2.6Right lower lobeMiddle58FAfrican-AmericanCurrent2.22952310164.820.56
      Adenocarcinoma4a2.5Right middle lobeCentral63MAsianFormer3.3112823.99537.690.25
      Carcinoidla1.6Left lower lobeMiddle69FWhiteFormer2.2613415.681156.234.81
      Adenocarcinoma3a1.7Right upper lobePeripheral57MWhiteCurrent2.357121.273199.776
      Squamous cellla1.3Left upper lobeMiddle86FWhiteFormer1.21927.6847150.3812.82
      Adenocarcinoma3a5.5Right upper/middle lobeCentral84MWhiteFormer2.4610614.26383.411.37
      Adenocarcinomala4.5Right upper lobeMiddle71FWhiteFormer1.2671365377.3817.29
      Adenosquamousla1.2Right lower lobePeripheral65MHispanicFormer3.0811117.768638.4820.44
      Adenocarcinomalb1.7Right upper lobePeripheral57MWhiteFormer2.3672186292.971.3
      Squamous cellla1.5Left lower lobePeripheral62MWhiteFormer3.091092283358.0510.43
      Squamous cell3b1.8Left lower lobeCentral84MWhiteFormerN/AN/AN/AN/A352.7119.52
      Squamous cellla1.2Right upper lobeMiddle64MWhiteCurrent0.72296.3726159.756.16
      Adenocarcinoma3b8Left lower lobeCentral42FHispanicNever1.77414.2166142.8811.3
      Adenocarcinoma3b5.7Right upper lobeCentral48MAfrican-AmericanFormerN/AN/AN/AN/A111.3913.76
      Large cell carcinomala1.2Left lower lobeMiddle73MOtherCurrent2.6173N/AN/A182.1622.62
      Squamous cell2b5.7Right upper lobeCentral66MOtherCurrentN/AN/AN/AN/A51.482.31
      Adenocarcinoma4a3.6Right middle lobePeripheral70FWhiteFormerN/AN/AN/AN/A2075.7
      Squamous celllb3.5Right upper lobePeripheral60MWhiteFormer3.9410624.379not availablenot available
      Adenocarcinoma4a1.8Left upper lobeCentral68MOtherFormerN/AN/AN/AN/A178.599.77
      Adenocarcinoma4a2.9Left upper lobeMiddle69MAsianFormerN/AN/AN/AN/A50.5211.18
      Adenocarcinoma2b2.5Left upper lobeCentral88MAfrican-AmericanFormer2.0210211.25552.51.96
      Adenocarcinoma3a0.8Right upper lobePeripheral67FAfrican-AmericanNever1.42769.541146.532.46
      Squamous celllb2.3Right upper lobePeripheral60MWhiteFormer3.3910418.7870124.795.89
      Squamous cell2a5.4Left upper lobeCentral74FAsianFormer0.55393.921266.079.22
      Squamous celllb4.3Left upper lobePeripheral69FAfrican-AmericanCurrent143943165.515.81
      FEV, Forced expiratory volume; DLCO, carbon monoxide diffusing capacity; 5mC, 5-methylcytosine; DAPI, 4′,6-diamidino-2-phenylindole; N/A, not available.
      Table E2Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for healthy versus high-risk benign (chronic obstructive pulmonary disease + idiopathic pulmonary fibrosis)
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC118.592.0897.1 (84.7-99.9)27.3 (6.0-61.0)NANA
      5mC/DAPI16.8945.2497.1 (84.7-99.9)63.6 (30.8-89.1)NANA
      CombinedNANA100 (89.7-100)27.3 (6.0-61.0)96.3 (95.7-97.3)27.5 (26.8-29.5)
      Minimum sensitivity of 90% while maximizing specificity5mC104.562.3891.2 (76.3-98.1)27.3 (6.0-61.0)NANA
      5mC/DAPI4.6124.2191.2 (76.3-98.1)72.7 (39.0-94.0)NANA
      CombinedNANA97.1 (84.7-99.9)27.3 (6.0-61.0)99.8 (98.1-100)28.0 (26.8-28.4)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindole; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E3Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for healthy versus early-stage lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC371.2462.9695.8 (78.9-99.9)54.5 (23.4-83.3)NANA
      5mC/DAPI6.5941.5495.8 (78.9-99.9)90.9 (58.7-99.8)NANA
      CombinedNANA100 (85.8-100)45.5 (16.7-76.6)99.9 (98.7-100)46.5 (44.5-47.5)
      Minimum sensitivity of 90% while maximizing specificity5mC228.5153.0391.7 (73.0-99.0)63.6 (30.8-89.1)NANA
      5mC/DAPI4.6541.0091.7 (73.0-99.0)90.9 (58.7-99.8)NANA
      CombinedNANA100 (85.8-100)54.5 (23.4-83.3)100 (98.4-100)54.3 (53.8-56.8)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E4Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for high-risk benign (chronic obstructive pulmonary disease + idiopathic pulmonary fibrosis) versus all stages of lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC150.5730.3095.3 (84.2-99.4)41.2 (24.6-59.3)NANA
      5mC/DAPI16.4169.1295.3 (84.2-99.4)44.1 (27.2-62.1)NANA
      CombinedNANA100 (91.8-100)35.3 (19.7-53.5)99.5 (98.9-100)35.3 (35.0-36.4)
      Minimum sensitivity of 90% while maximizing specificity5mC159.3240.3090.7 (77.9-97.4)50.0 (32.4-67.6)NANA
      5mC/DAPI14.1869.1290.7 (77.9-97.4)47.1 (29.8-64.9)NANA
      CombinedNANA93.0 (80.9-98.5)38.2 (22.2-56.4)92.3 (91.9-93.3)39.0 (37.8-39.6)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E5Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for high-risk benign versus early-stage lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC150.5730.3095.8 (78.9-99.9)41.2 (24.6-59.3)NANA
      5mC/DAPI6.4138.4695.8 (78.9-99.9)26.5 (12.9-44.4)NANA
      CombinedNANA100 (85.8-100)26.5 (12.9-44.4)100 (98.7-100)26.1 (26.2-27.8)
      Minimum sensitivity of 90% while maximizing specificity5mC205.3549.2491.7 (73.0-99.0)47.1 (29.8-64.9)NANA
      5mC/DAPI14.2869.1291.7 (73.0-99.0)47.1 (29.8-64.9)NANA
      CombinedNANA95.8 (78.9-99.9)41.2 (24.6-59.3)94.2 (92.2-96.5)41.4 (40.7-42.9)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E6Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for high-risk benign versus advanced lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC463.9288.24100 (82.4-100)58.8 (40.7-75.4)NANA
      5mC/DAPI17.0776.47100 (82.4-100)47.1 (29.8-64.9)NANA
      CombinedNANA100 (82.4-100)38.2 (22.2-56.4)99.9 (98.5-100)38.4 (38.1-39.7)
      Minimum sensitivity of 90% while maximizing specificity5mC174.6165.5294.7 (74.0-99.9)73.5 (55.6-87.1)NANA
      5mC/DAPI15.8279.0294.7 (74.0-99.9)55.9 (37.9-72.8)NANA
      CombinedNANA94.7 (74.0-99.9)52.9 (35.1-70.2)94.4 (92.9-95.6)53.1 (52.4-54.6)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E7Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for controls (healthy + high-risk benign) versus all stages of lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC150.5730.3095.3 (84.2-99.4)40.0 (25.7-55.7)NANA
      5mC/DAPI16.4169.1295.3 (84.2-99.4)55.6 (40.0-70.4)NANA
      CombinedNANA100 (91.8-100)33.3 (20.0-49.0)100 (99.2-100)38.2 (37.5-38.6)
      Minimum sensitivity of 90% while maximizing specificity5mC159.3240.3090.7 (77.9-97.4)53.3 (37.9-68.3)NANA
      5mC/DAPI14.1869.1290.7 (77.9-97.4)57.8 (42.2-72.3)NANA
      CombinedNANA94.0 (80.9-98.5)42.2 (27.7-57.8)94.7 (93.8-95.2)56.4 (55.1-56.6)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E8Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for controls versus early-stage lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC150.5730.30100 (85.7-100)26.7 (14.6-41.9)NANA
      5mC/DAPI6.5941.5495.8 (78.9-99.9)42.2 (27.7-57.8)NANA
      CombinedNANA100 (85.8-100)26.7 (14.6-41.9)100 (98.8-100)26.9 (26.3-27.5)
      Minimum sensitivity of 90% while maximizing specificity5mC205.3149.2491.7 (73.0-99.0)48.9 (33.7-64.2)NANA
      5mC/DAPI14.2869.1291.7 (73.0-99.0)57.8 (42.2-72.3)NANA
      CombinedNANA95.8 (78.9-99.9)42.2 (27.7-57.8)95.9 (94.4-96.7)42.2 (41.7-43.3)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Table E9Optimal threshold values for 5-methylcytosine and 5-methylcytosine 4′,6-diamidino-2-phenylindol colocalization index and estimated sensitivity and specificity for 5-methylcytosine, 5-methylcytosine 4′,6-diamidino-2-phenylindol, and the combined parameters at the optimal thresholds for controls versus advanced lung cancer
      Criterion for identifying optimal thresholds for parametersParameterOptimal threshold value for abnormal cells at the cell level
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Optimal threshold value for the percentage of abnormal cells at the patient level (%)
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.
      Estimated sensitivity (%) at the optimal thresholds (95% CI)Specificity (%) at the optimal thresholds (95% CI)Optimism-corrected estimated sensitivity (%) (95% CI)Optimism-corrected estimated specificity (%) (95% CI)
      Minimum sensitivity of 95% while maximizing specificity5mC463.9288.24100 (82.4-100)55.6 (40.0-70.4)NANA
      5mC/DAPI17.0776.47100 (82.4-100)57.8 (42.2-72.3)NANA
      CombinedNANA100 (82.4-100)37.8 (23.8-53.5)100 (98.7-100)37.9 (37.3-38.7)
      Minimum sensitivity of 90% while maximizing specificity5mC198.3169.6694.7 (74.0-99.9)73.3 (58.1-85.4)NANA
      5mC/DAPI15.8279.0294.7 (74.0-99.9)64.4 (48.8-78.1)NANA
      CombinedNANA94.7 (74.0-99.9)55.6 (40.0-70.4)94.6 (93.3-95.6)56.5 (54.9-56.7)
      CI, Confidence interval; 5mC/DAPI, 5-methylcytosine 4′,6-diamidino-2-phenylindol; NA, not available.
      Calls are classified as abnormal if value is less than threshold value; normal otherwise.
      Patients are classified as positive to cancer if the percentage of abnormal cells is threshold value or greater; negative otherwise.

      References

        • Goldstraw P.
        • Chansky K.
        • Crowley J.
        • Rami-Porta R.
        • Asamura H.
        • Eberhardt W.E.E.
        • et al.
        The IASLC lung cancer staging project: proposals for revision of the TNM Stage Groupings in the forthcoming (eighth) edition of the TNM classification for lung cancer.
        J Thorac Oncol. 2016; 11: 39-51
        • Pinsky P.F.
        • Bellinger C.R.
        • Miller Jr., D.P.
        False-positive screens and lung cancer risk in the National Lung Screening Trial: implications for shared decision-making.
        J Med Screen. 2018; 25: 110-112
        • Lin E.C.
        Radiation risk from medical imaging.
        Mayo Clin Proc. 2010; 85 (quiz 1146): 1142-1146
        • Fabricant P.D.
        • Berkes M.B.
        • Dy C.J.
        • Bogner E.A.
        Diagnostic medical imaging radiation exposure and risk of development of solid and hematologic malignancy.
        Orthopedics. 2012; 35: 415-420
        • Brower C.
        • Rehani M.M.
        Radiation risk issues in recurrent imaging.
        Br J Radiol. 2021; 94: 20210389
        • Flehinger B.J.
        • Melamed M.R.
        • Zaman M.B.
        • Heelan R.T.
        • Perchick W.B.
        • Martini N.
        Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Memorial Sloan-Kettering study.
        Am Rev Respir Dis. 1984; 130: 555-560
        • Fontana R.S.
        • Sanderson D.R.
        • Taylor W.F.
        • Woolner L.B.
        • Miller W.E.
        • Muhm J.R.
        • et al.
        Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Mayo Clinic study.
        Am Rev Respir Dis. 1984; 130: 561-565
        • Frost J.K.
        • Ball Jr., W.C.
        • Levin M.L.
        • Tockman M.S.
        • Baker R.R.
        • Carter D.
        • et al.
        Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Johns Hopkins study.
        Am Rev Respir Dis. 1984; 130: 549-554
        • Kubik A.
        • Polak J.
        Lung cancer detection. Results of a randomized prospective study in Czechoslovakia.
        Cancer. 1986; 57: 2427-2437
        • Patz Jr., E.F.
        • Goodman P.C.
        • Bepler G.
        Screening for lung cancer.
        N Engl J Med. 2000; 343: 1627-1633
        • Zochbauer-Muller S.
        • Minna J.D.
        • Gazdar A.F.
        Aberrant DNA methylation in lung cancer: biological and clinical implications.
        Oncologist. 2002; 7: 451-457
        • Brock M.V.
        • Hooker C.M.
        • Ota-Machida E.
        • Han Y.
        • Guo H.
        • Ames S.
        • et al.
        DNA methylation markers and early recurrence in stage I lung cancer.
        N Engl J Med. 2008; 358: 1118-1128
        • Tessema M.
        • Yu Y.Y.
        • Stidley C.A.
        • Machida E.O.
        • Schuebel K.E.
        • Baylin S.B.
        • et al.
        Concomitant promoter methylation of multiple genes in lung adenocarcinomas from current, former and never smokers.
        Carcinogenesis. 2009; 30: 1132-1138
        • Leng S.
        • Do K.
        • Yingling C.M.
        • Picchi M.A.
        • Wolf H.J.
        • Kennedy T.C.
        • et al.
        Defining a gene promoter methylation signature in sputum for lung cancer risk assessment.
        Clin Cancer Res. 2012; 18: 3387-3395
        • Diaz-Lagares A.
        • Mendez-Gonzalez J.
        • Hervas D.
        • Saigi M.
        • Pajares M.J.
        • Garcia D.
        • et al.
        A novel epigenetic signature for early diagnosis in lung cancer.
        Clin Cancer Res. 2016; 22: 3361-3371
        • Feinberg A.P.
        • Ohlsson R.
        • Henikoff S.
        The epigenetic progenitor origin of human cancer.
        Nat Rev Genet. 2006; 7: 21-33
        • Baylin S.B.
        • Jones P.A.
        Epigenetic determinants of cancer.
        Cold Spring Harb Perspect Biol. 2016; 8
        • Ehrlich M.
        • Lacey M.
        DNA hypomethylation and hemimethylation in cancer.
        Adv Exp Med Biol. 2013; 754: 31-56
        • Shen H.
        • Laird P.W.
        Interplay between the cancer genome and epigenome.
        Cell. 2013; 153: 38-55
        • Tajbakhsh J.
        • Mortazavi F.
        • Gupta N.K.
        DNA methylation topology differentiates between normal and malignant in cell models, resected human tissues, and exfoliated sputum cells of lung epithelium.
        Front Oncol. 2022; 12: 991120
        • Gertych A.
        • Wawrowsky K.A.
        • Lindsley E.
        • Vishnevsky E.
        • Farkas D.L.
        • Tajbakhsh J.
        Automated quantification of DNA demethylation effects in cells via 3D mapping of nuclear signatures and population homogeneity assessment.
        Cytometry A. 2009; 75: 569-583
        • Gertych A.
        • Farkas D.L.
        • Tajbakhsh J.
        Measuring topology of low-intensity DNA methylation sites for high-throughput assessment of epigenetic drug-induced effects in cancer cells.
        Exp Cell Res. 2010; 316: 3150-3160
        • Oh J.H.
        • Gertych A.
        • Tajbakhsh J.
        Nuclear DNA methylation and chromatin condensation phenotypes are distinct between normally proliferating/aging, rapidly growing/immortal, and senescent cells.
        Oncotarget. 2013; 4: 474-493
        • Tajbakhsh J.
        Covisualization of global DNA methylation/hydroxymethylation and protein biomarkers for ultrahigh-definition epigenetic phenotyping of stem cells.
        Methods Mol Biol. 2020; 2150: 79-92
        • Tajbakhsh J.
        • Gertych A.
        • Fagg W.S.
        • Hatada S.
        • Fair J.H.
        Early in vitro differentiation of mouse definitive endoderm is not correlated with progressive maturation of nuclear DNA methylation patterns.
        PLoS One. 2011; 6: e21861
        • Tajbakhsh J.
        • Stefanovski D.
        • Tang G.
        • Wawrowsky K.
        • Liu N.
        • Fair J.H.
        Dynamic heterogeneity of DNA methylation and hydroxymethylation in embryonic stem cell populations captured by single-cell 3D high-content analysis.
        Exp Cell Res. 2015; 332: 190-201
        • Stefanovski D.
        • Tang G.
        • Wawrowsky K.
        • Boston R.C.
        • Lambrecht N.
        • Tajbakhsh J.
        Prostate cancer diagnosis using epigenetic biomarkers, 3D high-content imaging and probabilistic cell-by-cell classifiers.
        Oncotarget. 2017; 8: 57278-57301
        • Gertych A.
        • Oh J.H.
        • Wawrowsky K.A.
        • Weisenberger D.J.
        • Tajbakhsh J.
        3-D DNA methylation phenotypes correlate with cytotoxicity levels in prostate and liver cancer cell models.
        BMC Pharmacol Toxicol. 2013; 14: 11
        • Schafer H.
        Constructing a cut-off point for a quantitative diagnostic test.
        Stat Med. 1989; 8: 1381-1391
        • Harrell Jr., F.E.
        • Lee K.L.
        • Mark D.B.
        Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.
        Stat Med. 1996; 15: 361-387
        • Harrel F.E.
        Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. Efficiency of Some Procedures for Logistic Regression Analysis.
        Springer Series in Statistics, 2001
        • Steyerberg E.W.
        • Harrell Jr., F.E.
        • Borsboom G.J.
        • Eijkemans M.J.
        • Vergouwe Y.
        • Habbema J.D.
        Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.
        J Clin Epidemiol. 2001; 54: 774-781
      1. R: A Language and Environment for Statistical Computing. [computer program]. R Foundation for Statistical Computing, 2020
        • Koss L.G.
        • Melamed M.R.
        • Koss L.G.
        Koss' Diagnostic Cytology and Its Histopathologic Bases.
        5th ed. Lippincott Williams & Wilkins, 2006
        • Espada J.
        • Esteller M.
        Epigenetic control of nuclear architecture.
        Cell Mol Life Sci. 2007; 64: 449-457