Single nucleotide polymorphisms, commonly referred to as SNPs (pronounced "snips"), represent the prevailing form of genetic variation within the human genome. When a cell undergoes division to produce a new cell, it first duplicates its DNA to ensure each new cell inherits a complete set of genetic instructions. However, errors can occur during this replication process, akin to typographical errors, leading to alterations in the DNA sequence at specific points known as single nucleotide polymorphisms or SNPs. Each SNP denotes a variance in a single DNA building block, or nucleotide. For instance, a SNP might replace a cytosine (C) nucleotide with a thymine (T) nucleotide within a particular DNA fragment. SNPs are pervasive in human DNA, occurring roughly once every 1,000 nucleotides on average, resulting in approximately 4 to 5 million SNPs within an individual's genome. To be classified as SNPs, these variations must be present in at least 1% of the population, and scientists have identified over 600 million SNPs across global populations.
Primarily, SNPs manifest in the non-coding regions of DNA between genes. They serve as valuable biomarkers, and when located in regulatory regions within or adjacent to genes, they may directly influence gene function, potentially impacting disease susceptibility and progression.
While many SNPs exhibit negligible impact on health or development, certain genetic variations have emerged as pivotal in human health research. SNPs play a crucial role in predicting an individual's reaction to specific medications, sensitivity to environmental factors like toxins, and susceptibility to diseases. Moreover, SNPs serve as invaluable tools in tracing the transmission of genetic variants linked to diseases within familial lineages.
Cutting-edge technologies, such as high-throughput sequencing and long-read sequencing, employed by CD Genomics, facilitate the robust analysis of SNP and SNV genotyping. This advanced sequencing approach allows for comprehensive and efficient examination of genetic material, providing valuable insights into the molecular landscape and potential biomarkers associated with various conditions.
Single nucleotide polymorphisms (SNPs) are categorized based on the specific nucleotide substitutions they entail. Here are the common types of SNPs:
Beyond nucleotide substitutions, SNPs can also be classified based on their genomic location or their potential impact on gene function. Some notable categories include:
Understanding the various types and implications of SNPs is essential for unraveling their role in genetic diversity, disease susceptibility, and individual traits.
Recommended reading:
An Overview of SNP Genotyping Technologies.
How to Choose Suitable SNP Genotyping Method.
Single nucleotide variants (SNVs) represent alterations involving a single nucleotide within a DNA sequence, presenting in three primary patterns: single nucleotide substitution, single nucleotide deletion, and single nucleotide insertion. Substitution entails the mutation of one nucleotide into another, deletion involves the removal of a single nucleotide at a specific genomic location, and insertion denotes the repeated occurrence of a single nucleotide at a particular genomic site.
Within the realm of genomic variation analysis in cancer, a single nucleotide variant distinctive to cancer cells compared to normal tissue signifies a somatic mutation, termed SNV.
Types of Genetic Variation. (Nesta et al., 2021)
SNP, or single nucleotide polymorphism, refers to the substitution of a specific position in the DNA sequence with another single nucleotide (adenine, guanine, cytosine, or thymine). It stands as the most prevalent form of variation in the genome.
SNV (single nucleotide variant): This term denotes a variant at a solitary nucleotide position within the genome, irrespective of its frequency within the population. It serves as a neutral descriptor indicating a deviation from the reference sequence.
SNP (single nucleotide polymorphism): Also signifying variation at a lone nucleotide position, SNP characterizes a variation that is widespread in the population. Typically, for a variant to be termed a SNP, its prevalence in a given population generally surpasses 1%.
All SNPs are SNVs as they all denote variation at a single nucleotide. However, not all SNVs qualify as SNPs since not all single nucleotide variants are prevalent within the population.
Copy number variation (CNV) refers to alterations in segments of DNA within the genome, each spanning a thousand base pairs or more. These variations can result in individuals possessing either more, fewer, or no copies of a particular gene or DNA segment compared to the typical two copies. CNVs can encompass entire genes or larger genomic regions, potentially impacting gene dosage and function.
SNPs (single nucleotide polymorphisms) and CNVs represent distinct forms of genetic diversity, yet both wield significant influence over an individual's genotype and potential phenotype. When investigating the genetic underpinnings of certain diseases or traits, researchers often explore the relationship between these two variations. Integrating insights from both SNPs and CNVs enables a more comprehensive understanding of genetic factors.
In genomic association studies, SNP arrays serve as invaluable tools, capable of detecting not only SNPs but also CNVs. By leveraging these arrays, researchers can simultaneously assess the impact of both types of genetic variants within the same study cohort. This integrated approach enhances the depth and precision of genetic investigations, shedding light on the intricate interplay between SNPs and CNVs in shaping biological traits and disease susceptibility.
SNP and CNV. (Mérot et al., 2020)
The importance of SNP and SNV genotyping lies in its ability to unravel the intricate genetic landscape underlying traits, diseases, and individual variations.
PCR-RFLP stands as a classic method for SNP detection. Initially, the target fragment undergoes amplification via PCR, followed by digestion of the PCR product utilizing a specific restriction endonuclease. Should a SNP be present and alter the endonuclease cleavage site, the length of the digested fragment undergoes modification. These alterations are discernible through electrophoresis, allowing for the identification of SNP variations.
The TaqMan approach employs a distinct fluorescently labeled probe to discern SNPs. During PCR, if the target SNP site perfectly matches the probe, disassembly occurs, releasing fluorescence. The intensity of this fluorescence serves as an indicator for determining the SNP species, offering a sensitive and accurate genotyping solution.
Advancements in sequencing technology have enabled the simultaneous detection of thousands of SNPs. Deep sequencing of either the entire genome or specific gene regions facilitates the identification of a vast number of SNPs, providing comprehensive insights into genetic variations and their implications.
Principle of the WGS-SNP strategy. (Doitsidou et al., 2010)
Gene microarray represent another high-throughput method capable of detecting millions of SNPs concurrently. The sample's DNA undergoes fragmentation and hybridization with pre-designed probes. By assessing the signal intensity post-hybridization, the type of SNP can be determined swiftly and efficiently.
Recommended reading: The Applications of SNP Microarray.
Sanger sequencing, a traditional DNA sequencing technique, remains a reliable method for SNP detection. Initially, the target region undergoes PCR amplification before sequencing via the Sanger method. By comparing the sequencing results with a reference sequence, SNPs can be accurately identified, providing valuable genetic information.
In summary, various genotyping techniques offer diverse approaches for detecting SNPs and SNVs, each possessing distinct advantages in terms of sensitivity, throughput, and accuracy. These methods play a pivotal role in unraveling the complexities of genetic variations, thereby advancing our understanding of genetics and genomics.
References: