Skip to main content
Social Sci LibreTexts

3.5: Genomics, Epigenetics, and Genetic Testing

  • Page ID
    158733
    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    GENOMICS

    The genome is all of the genetic material for an organism. In the case of humans, this includes 46 chromosomes and mtDNA. The human genome contains approximately three billion base pairs of DNA and has regions that are both noncoding and coding. Scientists now estimate that the human genome contains 20,000–25,000 protein-coding genes, with each chromosome containing a few hundred to a few thousand genes. As our knowledge of heredity increases, researchers have begun to realize the importance of epigenetics, or changes in gene expression that do not result in a change of the underlying DNA sequence. Epigenetics research is also crucial for unraveling gene regulation, which involves complex interactions between DNA, RNA, proteins, and the environment.

    Definition: genome

    All the genetic information of an organism.

    The vast majority of the human genome is noncoding, meaning there are no instructions to make a protein or RNA product in these regions. Historically, noncoding DNA was referred to as “junk DNA” because these vast segments of the genome were thought to be irrelevant and non-functional. However, continual improvement of DNA sequencing technology along with world-wide scientific collaborations and consortia have contributed to our increased understanding of how the genome functions. Through these technological advances and collaborations, we have since discovered that many of these noncoding DNA regions are involved in dynamic genetic regulatory processes.

    Definition: sequencing

    A molecular laboratory procedure that produces the order of nucleotide bases (i.e., sequences).

    Genomics is a diverse field of molecular biology that focuses on genomic evolution, structure and function; gene mapping; and genotyping (determining the alleles present). Evolutionary genomics determined that humans and chimpanzees share a significant portion of shared DNA sequence (about 98.8%). Given the phenotypic differences between humans and chimpanzees, having a DNA sequence difference of 1.2% seems surprising. However, a lot of genomics research is also focused on understanding how noncoding genomic regions influence how individual genes are turned “on” and “off” (i.e., regulated). Therefore, although DNA sequences are identical, regulatory differences in noncoding genetic regions (e.g., promoters) are believed to be largely responsible for the physical differences between humans and chimpanzees.

    Definition: genotyping

     A molecular procedure that is performed to test for the presence of certain alleles or to discover new ones.

    Further understanding of genomic regulatory elements can lead to new therapies and personalized treatments for a broad range of diseases. For example, targeting the regulatory region of a pathogenic gene to “turn off” its expression can prevent its otherwise harmful effects. Such molecular targeting approaches can be personalized based on an individual’s genetic makeup. Genome-wide association studies (GWAS) seek to determine genes that are linked to complex traits and diseases and typically require significant computational efforts. This is because millions of DNA sequences must be analyzed and GWAS sometimes include thousands of participants. During the beginning of the genomics field, most of the large-scale genomics studies only included North American, European, and East Asian participants and patients. Researchers are now focusing on increasing ethnic diversity in genomic studies and databases. In turn, accuracy of individual disease risk across all human populations will be improved and more rare-disease-causing alleles will be identified.

    Definition: pathogenic

    A genetic mutation (i.e., allele) that has a harmful phenotypic disease-causing effect.

    EPIGENETICS

    All cells within your body have the same copy of DNA. For example, a brain neuron has the same DNA blueprint as does a skin cell on your arm. Although these cells have the same genetic information, they are considered specialized. The reason all cells within the body have the same DNA but different morphologies and functions is that different subsets of genes are turned “on” and “off” within the different cell types. A more precise explanation is that there is differential expression of genes among different cell types. In the case of neuronal cells, a unique subset of genes are active that allow them to grow axons to send and receive messages. This subset of genes will be inactive in non-neuronal cell types such as skin cells. Epigenetics is a branch of genetics that studies how these genes are regulated through mechanisms that do not change the underlying DNA sequence. Special Topics: Epigenetics and X Chromosome Inactivation details a well-known example of epigenetic regulation.

    The prefix epi means “on, above, or near,” and epigenetic mechanisms such as DNA methylation and histone modifications occur on, above, or near DNA. The addition of a methyl group (—CH₃) to DNA is known as DNA methylation (Figure 3.5.1). DNA methylation and other modifications made to the histones around which DNA are wrapped are thought to make chromatin more compact. This DNA is inaccessible to transcription factors and RNA polymerases, thus preventing genes from being turned on (i.e., transcribed). Other histone modifications have the opposite effect by loosening chromatin, which makes genes accessible to transcription factors.

    Definition: DNA methylation

    Methyl groups bind DNA, which modifies the transcriptional activity of a gene by turning it “on” or “off.”

    It is important to note that environmental factors can alter DNA methylation and histone modifications and also that these changes can be passed from generation to generation. For example, someone’s epigenetic profile can be altered during a stressful time (e.g., natural disasters, famine, etc.), and those regulatory changes can be inherited by the next generation. Moreover, our epigenetic expression profile changes as we age. For example, certain places in our genome become “hyper” or “hypo” methylated over time. Identical twins also have epigenetic profiles that become more different as they age. Researchers are only beginning to understand what all of these genome-wide epigenetic changes mean. Scientists have also discovered that changes in epigenetic modifications can alter gene expression in ways that contribute to diseases. It is also important to note that, unlike DNA mutations (which permanently change the nucleotide sequence), epigenetic changes can be easily reversed. A lot of research now focuses on how drugs can alter or modulate changes in DNA methylation and histone modifications to treat diseases such as cancer.

    Definition: epigenetic profile

    The methylation pattern throughout a genome—that is, which genes (and other genomic sites) are methylated and unmethylated.

    Different types of epigenetic histone tail modifications that can tighten (top) and loosen (bottom) the chromatin of DNA.Figure \(\PageIndex{1}\): Different types of epigenetic histone tail modifications that can tighten (top) and loosen (bottom) the chromatin of DNA.
    Special Topic: Epigentics and X Chromosome Inactivation
    3.6.2.jpgFigure \(\PageIndex{2}\): A multicolored coat pattern as the result of X chromosome inactivation during development.

    Mary Lyon was a British geneticist that presented a hypothesis for X chromosome inactivation (called the Lyon hypothesis) based on her work and other studies of the day. Females inherit two X chromosomes, one from each parent. Males have one functional X chromosome; however, this does not mean females have more active genes than males. During the genetic embryonic development of many female mammals, one of the X chromosomes is inactivated at random, so females have one functional X chromosome. The process of X chromosome inactivation in females occurs through epigenetic mechanisms, such as DNA methylation and histone modifications. Recent studies have analyzed the role of a long noncoding RNA called X-inactive specific transcript (XIST), which is largely responsible for the random silencing of one of the X chromosomes. The presence of two X chromosomes is the signal for XIST RNA to be expressed so that one X chromosome can be inactivated. However, some cells may have an active paternal X chromosome while other cells may have an active maternal X chromosome. This phenomenon is easily seen in calico and tortoiseshell cats (Figure 3.5.2). In cats, the gene that controls coat color is found on the X chromosome. During early embryo development, random inactivation of X chromosomes gives rise to populations of cells that express black or orange, which results in the unique coat patterning. Therefore, calico cats are typically always female.

    In order to assist with public health efforts, newborn screening for genetic diseases have been available in the United States for over 50 years. One of the first available genetic tests was to confirm a phenylketonuria (PKU) diagnosis in infants, which is easily treatable with a dietary change. Currently, each state decides what genes are included on newborn screening panels and some states even have programs to help with infant medical follow-ups.

    There are now hundreds of laboratories that provide testing for a few thousand different genes that can inform medical decisions for infants and adults. What has made this industry possible are the advancements in technology and decreased cost to patients. Moreover, genetic testing has been made available publicly to anyone without the assistance of medical professionals.

    GENETIC TESTING

    Polymerase Chain Reaction (PCR) and Sanger Sequencing

    One of the most important inventions in the genetics field was polymerase chain reaction (PCR). In order for researchers to visualize and therefore analyze DNA, the concentration must meet certain thresholds. In 1985, Kary Mullis developed PCR, which can amplify millions of copies of DNA from a very small amount of template DNA (Figure 3.40). For example, a trace amount of DNA at a crime scene can be amplified and tested for a DNA match. Also, aDNA is typically degraded, so a few remaining molecules of DNA can be amplified to reconstruct ancient genomes. The PCR assay uses similar biochemical reactions to our own cells during DNA replication.

    Definition: polymerase chain reaction (PCR)

    A molecular biology procedure that can make copies of genomic DNA segments. A small amount of DNA is used as a starting template and is then used to make millions of copies.

    Gel electrophoresis used to visualize DNA after PCR amplification.Figure \(\PageIndex{3}\): Gel electrophoresis used to visualize DNA after PCR amplification.

    In Sanger sequencing, PCR sequences can be analyzed at the nucleotide level with the help of fluorescent labeling. Several different types of alleles and genetic changes can be detected in DNA by using this analysis. Figure 3.5.4 shows someone who is heterozygous for a single nucleotide allele. These methods continue to be used extensively alongside larger-scale genome technologies.

    Definition: Sanger sequencing

    A process that involves the usage of fluorescently labeled nucleotides to visualize DNA (PCR fragments) at the nucleotide level.

    Sanger sequencing results showing a heterozygous DNA nucleotide.Figure \(\PageIndex{4}\): Sanger sequencing results showing a heterozygous DNA nucleotide.

    Genetic Biotechnology and Clinical Testing

    Microarray chip with fluorescent labeled probes that hybridize with DNA to detect homozygous and heterozygous nucleotides throughout the genome.Figure \(\PageIndex{5}\): Microarray chip with fluorescent labeled probes that hybridize with DNA to detect homozygous and heterozygous nucleotides throughout the genome.

    Genetic innovations are transforming the healthcare industry. However, the different types of technology and the results of these tests often include a learning curve for patients, the public, and medical practitioners. Microarray technology, when DNA samples are genotyped (or “screened”) for specific alleles, has been available for quite some time (Figure 3.42). Presently, microarray chips can include hundreds of alleles that are known to be associated with various diseases. The microarray chip only binds with a DNA sample if it is “positive” for that particular allele and a fluorescent signal is emitted, which can be further analyzed.

    Definition: microarray technology

    A genotyping procedure that utilizes a microarray chip, which is a collection of thousands of short nucleotide sequences attached to a solid surface that can probe genomic DNA.

    If a patient is suspected of having a rare genetic condition that cannot be easily diagnosed or the diagnosis is entirely unknown, whole genome sequencing may be recommended by a doctor. Next-generation sequencing (NGS) is a newer technology that can screen the entire genome by analyzing millions of sequences within a single machine run (Figure 3.5.6). However, sequencing the entire genome yields a significant amount of data and information. Therefore, clinical NGS genetic testing typically only includes a small subset of the genome known to have pathogenic disease-causing mutations.

    Definition: next-generation sequencing (NGS)

    A genotyping technology that involves producing millions of nucleotide sequences (from a single DNA sample) that are then read with a sequencing machine. It can be used for analyzing entire genomes or specific regions and requires extensive program-based applications.

    3.7.4.pngFigure \(\PageIndex{6}\): Next-generation sequencing machines.

    There is a diversity of clinical genetics tests available to assist patients with making medically informed decisions about family planning and health, including assistance with in vitro fertilization (IVF) procedures and embryo genetic screening. To ensure accuracy, it is highly important that all clinical laboratories are continually regulated. The Clinical Laboratory Improvement Amendments (CLIA) are United States federal standards that all human laboratory testing clinics must follow. A major benefit provided by some clinical genetic testing companies is access to genetic counselors, who have specialized education and training in medical genetics and counseling. Both partners are usually tested to see if there is a risk for passing on a disease to a child. Counselors use their skillset to aid patients and doctors with risk assessment for genetic diseases and interpretation of genetic testing results. Genetic counselors also guide and support patients when making impactful medical decisions.

    Direct-to-Consumer (DTC) Genetic Testing

    A positive result for a genetic allele associated with an increased risk for celiac disease.Figure \(\PageIndex{7}\): A positive result for a genetic allele associated with an increased risk for celiac disease.

    Genetic testing that is performed without the guidance of medical professionals is called direct-to-consumer (DTC) genetic testing. Companies that sell affordable genome sequencing products to the public continue to increase in number and popularity. These companies have marketing campaigns typically based on the notion of “personal empowerment,” which can be achieved by “knowing more about your DNA.” For example, if you are identified as having a slightly increased risk for developing celiac disease (Figure 3.5.7), then you may be motivated to modify your dietary consumption by removing gluten from your diet. Another scenario is that you could test positive for a known pathogenic BRCA1 or BRCA2 allele. In this case, you may want to follow up with additional testing from a medical facility, which could lead to life-altering decisions. DNA sequencing products for entertainment purposes are also available. For example, by having your “Vinome” analyzed, you can discover if you are more predisposed to liking Pinots as your favorite wine. Additionally, an individual and their partner can be genotyped to predict what physical characteristics their baby might inherit.

    DTC testing typically lacks genetic counselor services and regulations are not as strict. This has led to some controversies including company genetic products that provide health information. The company 23andMe was the first on the market to offer DTC health testing, and in 2013, the U.S. Food and Drug Administration (FDA) intervened. 23andMe worked toward complying with FDA regulations and then gained approval to offer testing on a few medically related genes. In 2017, 23andMe offered a “Late-Onset Alzheimer’s Disease” genetic risk report. Such offerings have been criticized because customers could receive results they may not fully be able to interpret. In turn, this could increase the stress of participants (sometimes called the “burden of knowing”) and could also lead to unnecessary medical intervention. In order to address this issue, 23andMe now provides disclaimers and also interactive learning modules that customers must complete if they wish to view certain genotyping results. However, individuals who tested positive for a disease-causing allele have also been able to successfully seek medical help. The potential for harm and the proposed benefits of DTC testing continue to be a topic of debate and investigation.

    An example of ancestry percentage results provided to customers.Figure \(\PageIndex{8}\): An example of ancestry percentage results provided to customers.

    Ancestry percentage tests are also widely popular (Figure 3.5.8). Customers are genotyped and their alleles are assigned to different groups from around the world (Chapter 4 will discuss human biological variation in further detail). However, the scientific significance and potential harm of ancestry percentage tests have been called into question. For example, most alleles tested are not exclusive to one population, and populations may be defined differently depending on the testing companies. If an allele is assigned to the “Irish” population, there is a good chance that the allele may have evolved in a different cultural group or region that pre-dates the formation of the country Ireland. In other words, genetic variation often pre-dates the origins the population and geographical names of the region used by genetic testing companies. Another critique is that someone’s identity need not include biological relationships. Individuals also have the option to find and connect online with individuals with whom they share portions of their genome, which has resulted in both positive and negative outcomes. Another interesting consideration is that law enforcement is currently developing forensic techniques that involve mining DTC genomic databases for the purpose of identifying suspects linked to crimes. Regardless of these various considerations, there are now millions of individuals worldwide who have “unlocked the secrets” of their DNA, and the multi-billion dollar genomics market only continues to grow.

    As you have seen in this chapter, DNA provides instructions to our cells, which results in the creation and regulation of proteins. Understanding these fundamental mechanisms is important to being able to understand how the evolutionary process works (see Chapter 4) and how humans vary from one another (see Chapters 13 and 14). It is also the advancement in genetic technologies—including ancient DNA studies, genomics, and epigenetics—that has led to new anthropological understandings about our biological relationships to other living (extant) and extinct primates. Many of these genetic discoveries will be covered in the chapters to come.

    FIGURE ATTRIBUTIONS

    Figure 3.5.1 Epigenetic Control (Biology 2e, Figure 16.7) by OpenStax is used under a CC BY 4.0 License.

    Figure 3.5.2 “Rue” the calico cat by Hayley Mann is under a CC BY-NC 4.0 License.

    Figure 3.5.3 PCR electrophoresis gel by Hayley Mann is under a CC BY-NC 4.0 License.

    Figure 3.5.4 Sanger sequencing with heterozygous result by Hayley Mann is under a CC BY-NC 4.0 License.

    Figure 3.5.5 DNA microarray by Guillaume Paumier (user:guillom) is used under a CC BY-SA 3.0 License.

    Figure 3.5.6 Illumina Hiseq 2,000 sequencers, BGI Hong Kong sequencing room by Scotted400 is used under a CC BY 3.0 License.

    Figure 3.5.7 Positive carrier result for celiac disease allele by Hayley Mann is under a CC BY-NC 4.0 License.

    Figure 3.5.8 DNA ancestry percentage test results by Hayley Mann is under a CC BY-NC 4.0 License.


    This page titled 3.5: Genomics, Epigenetics, and Genetic Testing is shared under a CC BY-NC 4.0 license and was authored, remixed, and/or curated by Beth Shook, Katie Nelson, Kelsie Aguilera, & Lara Braff, Eds. (Society for Anthropology in Community Colleges) via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.