Skip to main content

Glossary of Terms and Abbreviations

Written by Support

For clarity and consistency across the Franklin platform, this article provides a comprehensive list of definitions, acronyms, and abbreviations commonly used throughout the system. It is intended to support your understanding of terminology related to genomic and bioinformatic work, platform-specific features, and data sources.

Genomics & bioinformatics terms

Term/abbreviation

Description

ACMG

American College of Medical Genetics and Genomics

Actionability

A measure of whether a piece of genetic information known to the clinician is clinically meaningful and worth acting on with the patient

Actionable Gene

A gene responsible for a given disease that has direct or indirect therapeutic applications

Alignment

The process of arranging sequencing reads against a reference genome to identify similarities and differences

Allele

One of two or more alternative forms of a gene or genetic sequence at the same locus on a chromosome

AMP

Association for Molecular Pathology

API

Application Programming Interface: set of definitions and protocols for building and integrating application software

ASCO

American Society of Clinical Oncology

Assay

A set of predefined configurations and protocols used in laboratory analysis, specifically designed to focus on and prioritize specific areas of interest

B

Benign - one of the possible classes of a genetic variant according to its likelihood of being pathogenic; see the term “Variant Classification”

BAM

Binary Alignment Map

BWA

Burrows-Wheeler Alignment algorithm - industry standard open-source aligner

Carrier Screening

Genetic testing to determine if a person carries a gene mutation that could be passed to offspring

CDS

Clinical Decision Support - health information technology that provides clinicians with knowledge and person-specific information to enhance health and health care

ClinGen

Clinical Genome Resource

CMA

Chromosomal Microarray Analysis

CNA

Copy Number Aberration

CNV

Copy Number Variation - genomic alterations where sections of DNA are duplicated or deleted

Coverage

The average number of times a given base in the genome is sequenced; also referred to as sequencing depth

Curated Data

Genetic data that has been reviewed, organized, standardized, and annotated by experts to ensure accuracy, consistency, and clinical relevance prior to use in interpretation

DNA

Deoxyribonucleic Acid - the molecule that carries the genetic information used in the growth, development, functioning, and reproduction of all living organisms

ECS

Expanded Carrier Screening

ESHG

European Society of Human Genetics

FASTQ

A text-based format for storing both biological sequences and their quality scores

FreeBayes

Open-source variant calling tool

GA4GH

Global Alliance for Genomics and Health

GATK

Genome Analysis Toolkit - industry-standard variant calling software

GenCC

Gene Curation Coalition

Genetic Profile

A collection of characterized genetic variants and markers of an individual derived from genomic analysis, used to support clinical interpretation

Genotype

The complete set of genetic information of an organism, or the genetic constitution at a specific locus, typically described in terms of alleles present

Germline Variant

A genetic variant inherited from a parent that is present in all cells

GIAB

Genome in a Bottle

GRCh37

Genome Reference Consortium Human Build 37

GRCh38

Genome Reference Consortium Human Build 38

Haplotype

A group of genes within an organism that was inherited from a single parent

HGMD

Human Gene Mutation Database

HGVS

Human Genome Variation Society - a standardized nomenclature system for describing gene variations

HPO

Human Phenotype Ontology

HRD

Homologous Recombination Deficiency - the inability of a cell to effectively repair DNA double-strand breaks using the homologous recombination repair pathway

Indel

Insertion or deletion of nucleotides in the genome

ISCN

International System for Human Cytogenomic Nomenclature

Knowledge Base

A centralized, structured repository of curated genetic information, panels, classified variants, and references used to support variant interpretation and clinical decision making within Franklin

LP

Likely Pathogenic - one of the possible classes of a genetic variant according to its likelihood of being pathogenic; see the term “Variant Classification”.

MAF

Minor Allele Frequency - the frequency at which the second-most-common allele occurs in a given population

MANE

Matched Annotation from NCBI and EBI

MMR

Mismatch Repair - system within cells that corrects errors that spontaneously occur during DNA replication

MNV

Multi-Nucleotide Variant

MOI

Mode of Inheritance - the manner in which a genetic trait or disease is passed from one generation to another

MSI

Microsatellite Instability - a measure of predisposition to mutation that results from impaired DNA Mismatch Repair

NGS

Next-Generation Sequencing - a general term used to describe high throughput sequencing technologies

NIST

National Institute of Standards and Technology

NMD

Nonsense-Mediated Decay - a cellular mechanism that degrades mRNA transcripts containing premature stop codons

P

Pathogenic - one of the possible classes of a genetic variant according to its likelihood of being pathogenic; see the term “Variant Classification”

P/M/DN

Parental / Maternal / De novo – Inheritance indication of a variant. The variant might be inherited paternally or maternally; In cases where the variant is a novel mutation, not inherited paternally or maternally, it is indicated as De Novo

Panel Sequencing

Targeted sequencing of specific sets of genes relevant to conditions

Pathogenicity

The potential capacity of genetic variants to cause disease

Phenotype

The observable characteristics or traits of an organism resulting from the interaction of its genotype with the environment

Pipeline

A sequence of automated computational processes applied to sequencing data (e.g., alignment, variant calling, annotation, filtration) to transform raw reads into interpretable results

PM (PM1, PM2, PM3, PM4, PM5, PM6)

Pathogenic Moderate indication - indication category of moderate pathogenicity according to ACMG/AMP guidelines. Includes the criteria PM1, PM2, PM3, PM4, PM5, PM6

PP (PP1, PP2, PP3, PP4, PP5)

Pathogenic Supporting indication - indication category of supporting pathogenicity according to ACMG/AMP guidelines. Includes the criteria PP1, PP2, PP3, PP4, PP5

PS (PS1, PS2, PS3, PS4)

Pathogenic Strong indication - indication category of strong pathogenicity according to ACMG/AMP guidelines. Includes the criteria PS1, PS2, PS3, PS4

Pseudonymization

Data processing technique in which personally identifiable information is replaced with one or more artificial identifiers so that the data can no longer be attributed to a specific individual without additional information kept separately

QC

Quality Control - the processes and metrics used to assess the reliability and integrity of sequencing and analysis data (e.g., read depth, mapping quality, variant confidence scores)

Reference Genome

A representative digital nucleic acid sequence assembled as a standard against which sequencing reads are aligned and compared (e.g., GRCh37, GRCh38)

RNA

Ribonucleic Acid - a nucleic acid that plays a central role in the coding, decoding, regulation, and expression of genes

SAM

Sequence Alignment Map

SNP

Single Nucleotide Polymorphism

SNP Calling

Process to determine positions where polymorphisms exist compared to a reference sequence

SNV

Single Nucleotide Variant

Somatic Variant

A genetic variant acquired during a person's lifetime, not inherited; can occur in any cell except germ cells and is not passed to children

SV

Structural Variants - large genomic alterations typically encompassing at least 50 bp

TMB

Tumor Mutational Burden - a genetic characteristic measuring the number of non-inherited mutations per million bases of investigated genomic sequence

UPD

Uniparental Disomy - genetic abnormality where both copies of a chromosome are inherited from one parent

VAF

Variant Allele Frequency

Variant Classification

The categorization of a genetic variant according to its likelihood of being pathogenic (e.g., Benign, Likely Benign, VUS, Likely Pathogenic, Pathogenic) based on established criteria such as ACMG/AMP

VCF

Variant Call Format - output file format used to store genetic sequence variations

VICC

Variant Interpretation for Cancer Consortium

VUS

Variant of Uncertain Significance

WES

Whole Exome Sequencing - sequencing focusing on coding regions of the genome

WGS

Whole Genome Sequencing - comprehensive analysis of the entire genome

Platform-specific terms

Term/abbreviation

Definition

Active User

A unique Franklin user account that has logged in and performed at least one analytical action within a defined monitoring period, as used in platform utilization and post-market surveillance metrics

Analytical Version

Version identifier that changes when Franklin changes its secondary analysis (alignment, variant calling). New versions require analytical performance validation

Benchmark

Pre-defined performance thresholds used to evaluate the quality and accuracy of analysis results

Case

A patient sample analysis instance in Franklin, containing genetic data and associated metadata

Clinical Version

Version identifier that changes when Franklin changes its tertiary analysis (classification, priority, benchmark thresholds). New versions require clinical performance validation

DB

Database

Hard Panel

Hard panels are applied before the annotation phase, and once implemented, they restrict the scope of analysis, preventing any expansion. This panel type is primarily used for regulatory compliance, ensuring that only the variants within the panel are reported and avoiding secondary findings. Hard panels require all variants found on the panel to be reported and are useful for fixed regulatory requirements, trimming the analysis before annotation for strict adherence to compliance standards

Lab Integration

The interface between Franklin and laboratory workflows, ranging from manual uploads to API integration to full laboratory information system integration

Non-Standard Case

A case using a custom template workflow, typically accessed via sample bulk upload

NR

Not Relevant - the marking of a variant as irrelevant for analysis, can be set using the “NR” button

Primary Analysis

Initial processing of raw sequencing data by the sequencer

Secondary Analysis

Alignment of sequencing reads to reference genome and variant calling

Soft Panel

A soft panel is dynamic and acts as a filter to restrict the view of variants. It can be applied after the case has been generated, allowing for flexibility in addressing specific variants of interest. Additional soft panels can be added to refine the analysis further. Soft panels mask genes and offer the option to decide which variants to report, as seen in cases where broad tumor profiles, like TSO500, are used with soft panels applied post-analysis to maintain existing ordering procedures while expanding the scope of the analysis

Solution

The Solution in Franklin refers to the complete, end-to-end genetic analysis and interpretation process. The solution covers everything from raw data to an entirely generated report. It includes all necessary components, such as workflows, panels, assays, and associated configurations, and encompasses both secondary and tertiary analysis. Each solution is designed to address specific clinical diagnostic or research needs, providing a seamless and integrated experience. Solutions are tailored to categories like germline or oncology, ensuring users have a comprehensive set of tools aligned with their clinical or research objectives

Solution Category

Highest-level categorization of genetic analysis in Franklin (e.g., Oncology, Reproductive Health, Genetics)

Standard Case

Pre-defined case type available through the "New Case" user flow, accessible to all users

Template Workflow

A pre-configured set of parameters (pipeline, kit, sequencer, file inputs, QC metrics) tailored for specific genetic tests

Tertiary Analysis

Annotation, filtering, and interpretation of variants

Validated Workflow

Validated Workflow in Franklin refers to the combination of a specific kit and a corresponding workflow that has been verified for accuracy and performance. Examples of validated workflows include TSO500 by Illumina, Agilent CGP, Thermo Fisher OCA, and Oncomine Comprehensive. Franklin offers support for more than 500 validated workflows, each of which resides within an Assay. These workflows are essential for ensuring that each analysis is conducted according to the specifications of the kit used, making it possible to consistently deliver high-quality, reliable results

Workbench

The Workbench is the primary interface for case analysis, providing essential information needed to evaluate a sample. Once processing is complete, the evaluation begins from this page. When the "WB" (abbreviation for Workbench) icon appears within the variant list, it means that the variant is included in the workbench

Workflow

A pre-configured set of parameters including pipeline, kit, sequencer, file inputs, and quality control metrics

Data sources and references

Source

Description

ClinGen

Clinical Genome Resource - NIH-funded resource dedicated to building an authoritative central resource that defines the clinical relevance of genes and variants

ClinVar

Public archive of reports on relationships between human genetic variants and phenotypes, maintained by NCBI

COSMIC

Catalogue of Somatic Mutations in Cancer - database of somatic mutations found in human cancers

dbSNP

Database of Single Nucleotide Polymorphisms - NCBI database of genetic variation

gnomAD

Genome Aggregation Database - database aggregating and harmonizing exome and genome sequencing data from large-scale sequencing projects

OMIM

Online Mendelian Inheritance in Man - comprehensive compendium of human genes and genetic phenotypes

RefSeq

Reference Sequence Database - NCBI collection of curated, non-redundant genomic DNA, transcript, and protein sequences

1000 Genomes

Population-level genomic reference dataset providing allele frequencies across diverse human populations

APOGEE

Machine-learning meta-predictor used for pathogenicity prediction of mitochondrial variants

BayesDel

Deleteriousness meta-score used to predict the pathogenicity of missense and splice-site variants

BRCA Exchange

Public resource aggregating BRCA1 and BRCA2 variant data and expert classifications from ENIGMA and other consortia

DANN

Deep-learning based pathogenicity score for variants across the genome

dbscSNV

Database of splice-consensus single nucleotide variants used for splice-site variant prediction

dbVAR

NCBI database of human genomic structural variation

DECIPHER

Database of submicroscopic chromosomal imbalances and phenotypes, used for clinical interpretation of CNVs

DGV

Database of Genomic Variants - catalogue of structural variation in the human genome

ESP 6500

NHLBI Exome Sequencing Project dataset providing allele frequencies from ~6,500 exomes

ExAC

Exome Aggregation Consortium - aggregation of exome sequencing data providing allele frequencies (predecessor of gnomAD)

FATHMM

Functional Analysis through Hidden Markov Models - tool predicting the functional consequences of coding and non-coding variants

FDA Approved Drugs

Public list of drugs approved by the U.S. Food and Drug Administration used for therapeutic annotation

fitCons

Score estimating the probability that a genomic position is under natural selection (fitness consequences)

Gene2Phenotype

Curated dataset of gene-disease associations maintained by EBI, used to support clinical interpretation

GenoCanyon

Functional potential score predicting the significance of non-coding variants

GERP

Genomic Evolutionary Rate Profiling - conservation score indicating evolutionary constraint at a given genomic position

MetaLR

Logistic-regression meta-predictor of missense variant deleteriousness

MetaSVM

Support-vector-machine meta-predictor of missense variant deleteriousness

MitoMap

Compendium of polymorphisms and mutations of the human mitochondrial DNA

MitoTip

Prediction score for the pathogenicity of mitochondrial tRNA variants

Monarch

Monarch Initiative - integrated platform linking genes, phenotypes and diseases across species

MutationAssessor

Tool predicting the functional impact of amino acid substitutions based on evolutionary conservation

MutationTaster

Prediction tool evaluating the disease-causing potential of DNA sequence alterations

Orphanet

European portal for information on rare diseases and orphan drugs

PharmGKB

Pharmacogenomics Knowledgebase - curated resource on the impact of genetic variation on drug response

PolyPhen-2

Polymorphism Phenotyping v2 - tool predicting the possible impact of amino acid substitutions on the structure and function of a human protein

PrimateAI

Deep-learning based pathogenicity prediction for missense variants, trained on non-human primate variation

PubMed

NIH-maintained database of biomedical literature references and abstracts

REVEL

Rare Exome Variant Ensemble Learner - ensemble meta-predictor for missense variant pathogenicity

SIFT

Sorting Intolerant From Tolerant - tool predicting whether an amino acid substitution affects protein function

SpliceAI

Deep-learning tool predicting splice-altering effects of genetic variants

TopMed BRAVO

Trans-Omics for Precision Medicine BRAVO - allele frequency dataset derived from the TOPMed program

TrAP

Transcript-inferred Pathogenicity score - prediction score for the impact of variants on splicing and protein function

UK10K

UK10K Project - dataset providing allele frequencies from ~10,000 UK-based individuals

UniProt

Universal Protein Resource - comprehensive, high-quality database of protein sequence and functional information

VARITY

Meta-predictor of missense variant pathogenicity based on machine learning

Did this answer your question?