1Department of Pathology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
2Center for Cancer Genome Discovery, Asan Institute for Life Sciences, Seoul, , Korea
3Samsung Genome Institute, Sungkyunkwan University School of Medicine, Seoul, Korea
4Department of Pathology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea
5Department of Pathology, Yonsei University College of Medicine, Seoul, Korea
6Department of Pathology, Seoul National University College of Medicine, Seoul, Korea
© 2017 The Korean Society of Pathologists/The Korean Society for Cytopathology
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Subject | Database/Algorithm | URL |
---|---|---|
Germline polymorphism | dbSNP, dbVar | |
1000 Genomes Project | ||
ExAC | ||
Cancer-specific somatic variants | COSMIC | |
cBioPortal | ||
My Cancer Genome | ||
CIViC | ||
Personalized Cancer Therapy, MD Anderson Cancer Center | ||
Genotype-phenotype association, not limited to cancer | ClinVar | |
Human Gene Mutation Database | ||
Leiden Open Variation Database | ||
In silico functional prediction | dbNSFP (pre-computed in silico functional prediction and annotation of non-synonymous SNVs) | |
Ensembl Variant Effector Predictor |
Element | Example |
---|---|
Patient identification | Registration number, age, gender, ordering physician |
Specimen type | FFPE, fresh frozen |
Pathologic diagnosis | Lung adenocarcinoma, clorectal cancer |
Tissue sample identification | Specimen number, block number |
Important dates | Date on reception or on report |
Percentage of tumor nuclei of the sample used | 30% |
Variants found | Variant details according to HGVS mutation nomenclature |
Version of reference genome used | hg19 build 36 |
NGS method used | Amplicon-based or hybridization capture-based |
Key quality control metrics | Mean target coverage, percentage of selected bases, duplication rate |
Genes or genomic regions included in the panel | Exonic regions of gene A, B, C, etc. |
Interpretation and summary | EGFR L858R variants predict response to EGFR tyrosine kinase inhibitors (Erlotinib or Gefitinib) |
Type of material | Advantage | Disadvantage |
---|---|---|
Genomic DNA from cell line | Large amount available | May have heterogeneity associated with cell line maintenance |
Similar complexity to patient’s DNA | Possible genomic instability over time | |
May have well-characterized variants | ||
Genomic DNA from patient’s sample | Identical condition to real samples | Not necessarily renewable |
Limited amount of DNA | ||
Well characterized genetic information may be limited | ||
Synthetic DNA | Can synthesize a broad range of sequences and variations | Does not represent actual human cancer genome |
May not perform as actual human cancer DNA due to differences in sequence complexity | ||
Can make sequence templates with complex regions (deletions or duplications) | ||
Will not cover all regions of tested genome | ||
Large amount available | May exhibit higher variant calls due to errors in synthesis | |
Electronic reference data files | Can engineer any wanted sequence files | Reference only for data analysis step |
Requires many reference datasets to mimick many types of sequence data | ||
Data files may not be interoperable among different platforms |
Item | Checklist | Consequences of non-conformity | Improvement suggestions |
---|---|---|---|
Tissue sample adequacy | Criteria for inadequate specimen | Testing inadequate specimens may lead to a waste of time and money or depletion of available samples. | Check sample adequacy rigorously before testing. |
Minimum tumor content | Request further sampling in case of inadequate samples. | ||
Appropriate sample handling including fixation and transportation | Inadequate amount or tumor content can lead to false-negative test results. | ||
Nucleic acid extraction | DNA quantity and quality in terms of amplifiable DNA | DNA with suboptimal quality may inhibit sequencing reaction. | Failed samples should be reported as such and further material might be requested with specified requirement. |
Small amount or fragmentation of DNA may lead to poor quality sequencing data with insufficient or uneven coverage and/or high duplication rate. | |||
Trying another validated extraction method may often helps. | |||
Sample identification | Sample identity tracking throughout all steps | Misidentification of samples could lead to incorrect patient management. | If there is any concern about sample identity, starting over from DNA extraction may be necessary. |
Introduction of polymorphic SNP markers into gene panel and running another genotyping method with the same marker set might be helpful. | |||
Library preparation | Minimum library concentration | Poor sequencing library may lead to insufficient or uneven coverage. | Consider modification of library preparation method or an alternative method to verify any uncertain results. |
Libraries with poor complexity or bias may result in false-negatives. False-positives may also occur due to potential amplification bias. | |||
Sequencing | Criteria for minimum sequencing depth and other quality metrics (% reads mapped to target regions, % of targets with specified coverage, duplication rate) | Inadequate coverage is associated with higher levels of uncertainty of the test results. | Repeat sequencing with existing library or start over from DNA extraction step. |
Genomic regions with insufficient local coverage may lead to inaccurate results for variants located in those regions. | Verification of uncertain results with another method may be helpful, especially, in case of actionable variants. | ||
Variant detection and review | Variant allele frequency, local sequencing depth and quality score | Failure to filter out sequencing artifacts may lead to false-positive results. | Manually review of clinically important variants even if computational algorithms called no mutation on them. |
Presence of the same variant in forward and reverse strands | Clinically important variants may sometimes be missed. | Any ambiguous or unexpected results should be reviewed by laboratory scientists and pathologists. | |
Mapping quality of sequencing reads | Verify variants with another method, if applicable. | ||
Potential sequencing artifacts | |||
Bioinformatics | Correct pipeline and version | Using outdated or inadequate software can lead to false-positive or false-negative results. | Update software on a regular basis. |
Appropriate version and build of human reference sequence | |||
Cross-contamination? | |||
Reporting | Endorsed by an authorized competent pathologist? | Variants with clinical significance may be reported erroneously, leading to inappropriate treatment. | Responsible pathologists should be given enough time and opportunities for education and training. |
Quality metrics | General considerations upon validation | Actions to be considered during quality controls |
---|---|---|
Depth of coverage | Minimum coverage threshold required for desired sensitivity and specificity across targeted regions should be established. | When the coverage threshold could not be met in a suspicious variant call, additional validation by an alternate method (e.g., |
Sanger sequencing) should be considered. | ||
Uniformity of coverage | Coverage across targeted regions should be as uniform as possible to produce reliable results across all targeted genomic regions. | Uniformity of coverage often decreases with samples having low DNA amount or highly degraded DNA. |
When coverage uniformity significantly deviates from that established during initial validation, this may indicate potential analytical errors. | ||
GC bias | GC content affects the efficiency of the sequencing reactions and will affect the uniformity of coverage. | GC bias should be monitored in every run to detect any change in test performance. |
The amount of GC bias should be established across the targeted genomic regions. | ||
Base call quality scores | Informatics filters should be in place to eliminate any reads having a base call quality score below the acceptable quality score. | Quality scores are not readily comparable from one sequencing platform to another. |
Mapping quality | During validation, it is important to make sure that the test analyzes the reads that map specifically to the targeted genomic regions. | The proportion of reads that do not map to targeted regions should be monitored during each run. |
Informatics filters should be in place to eliminate any reads that map to non-targeted regions. | Poor mapping quality may come from non-specific amplification, capture of off-target DNA, or contamination. | |
Proportion of duplicated reads | Informatics filters should be in place to eliminate duplicate reads resulting from clonal amplification during alignment. | The amount of duplicate reads should be monitored to prevent skewing of allelic fractions. |
This list is not comprehensive and only provides some examples. All websites were last accessed on January 3, 2017. ExAC, Exome Aggregation Consortium; COSMIC, Catalog of Somatic Mutations in Cancer; CIViC, Clinical Interpretations of Variants in Cancer; NSFP, nonsynonymous functional prediction; SNV, single nucleotide variation.
NGS, next-generation sequencing; FFPE, formalin-fixed paraffin embedded; HGVS, Human Genome Variation Society; EGFR, epidermal growth factor receptor.
Adapted by permission from Macmillan Publishers Ltd: [Nature Biotechnology] Gargis NGS, next-generation sequencing.
NGS, next-generation sequencing; QC, quality control.
Adapted by permission from Macmillan Publishers Ltd: [Nature Biotechnology] Gargis NGS, next-generation sequencing.