"

12 The Genetic Code and the Tree of Life

Two key classes of chemicals have been seen to operate in all known living things on Earth: amino acids and nucleic acids.  Amino acids are the fundamental building blocks of proteins, which are essential for most biological processes.  Nucleic acids make up the genetic code of all living things on Earth, providing the molecular basis for evolution. Once life formed on Earth, over time it became more complex…eventually leading to humans capable of pondering their own existence. All life on Earth descends from the very first instance of life. Natural selection drove species to form and adapt to new environments and ecological niches. The rise of oxygen in the atmosphere about 2.5 billion years ago caused a proliferation of new life forms that had more efficient metabolic pathways. We will look at some of the milestones in the formation of more complex life throughout Earth’s 4.5 billion year history.

Learning Objectives

By the end of this chapter, you will be able to:

  • Explain the structure of amino acids and proteins
  • Discuss the importance of proteins for the functioning of life
  • Describe the basic features of DNA and RNA and their components
  • Discuss the role of an organism’s genetic code and the impact of genetic mutations
  • Describe the three different domains of the tree of life
  • Explain the difference between prokaryotes and eukaryotes
  • Explain how endosymbiosis leads to more complex cells
  • Describe how cyanobacteria increased the amount of oxygen in the Earth’s early atmosphere
  • Explain how aerobic respiration is more efficient than anaerobic respiration

Amino Acids to Proteins

Amino acids and the proteins they combine to form are found in every living organism on Earth. Proteins are needed to make critical biological reactions happen on timescales relevant to life. Proteins regulate the processes that drive life. Though proteins are found across all living things and carry out a wide variety of roles, every protein on Earth is made up of only twenty different amino acids.

Amino Acids

Amino acids are molecules that are defined by a specific structure; all amino acids consist of a central carbon atom forming four bonds to: (1) a hydrogen atom, (2) an amino group (-NH2), (3) a carboxyl group (-COOH), and (4) a changeable side chain (see Figure 1 below). In organic chemistry, molecules with a carboxyl group are called carboxylic acids. This along with the amino group gives these compounds the name amino acid.

Figure 1 – The components of an amino acid. Each amino acid contains an amino group (blue) and a carboxyl group (red), which gives them the name amino acid. Amino acids acquire different properties by hosting different side chains (green), often denoted by an R or called the R-group. This allows for the wide variety of functionality seen in proteins.

The amino and carboxyl groups allow amino acids to bond to one another through a process of dehydration, i.e. a process that releases water (H2O).  Figure 2 below diagrams the reaction with structural formulas.  Water is formed by the loss of an -OH group and a hydrogen (H) atom.

This process forms a new bond, called a peptide bond, between the carbon and nitrogen atoms.  Chains of amino acids are joined by peptide bonds; these chains are therefore also called polypeptide chains. Figure 3 shows an example of a polypeptide chain.  Polypeptide chains are then folded into proteins.  Note that Figure 3 uses a mixture of a structural and molecular formula; specifically, not all bonds to Hydrogen atoms are shown with a line.

 

Figure 2 – The formation of a peptide bond. The -OH from the carboxyl group and the -H from the amino group react to form the peptide bond (purple). These atoms are released as a molecule of water, H2O.
Figure 3 – A polypeptide chain. Pictured above is the chain of the first seven amino acids that make up the OPN1LW gene, which allows humans to distinguish red colors. The different amino acids in the chain feature different side chains, shown and labeled in green. This is simply an example, so there is no need to memorize the amino acid side chains or their names. Can you identify the repeating pattern of the backbone of the chain?

The side chains give each amino acid a unique functionality. Examples of amino acids and their side chains are shown in Figure 3 in green.  Amino acids can be positively or negatively charged, water-repellent, bulky, bent into different configurations, or have other properties depending on their side chains. These differences help the polypeptides fold as they form proteins, bind to specific compounds, or chemically react in different ways.

Despite the fact that more than 500 amino acids exist on Earth, living things on Earth incorporate only 20 different amino acids to form the vast array of proteins that are used to regulate chemical reactions in all aspects of life. This is similar in spirit to the concept that even though the English languages only uses 26 letters, those same letters make up the words that compose millions of books.

Want to know more: Essential Amino Acids, Body Building, and You

Of the 20 amino acids used by life on Earth, the human body is capable of synthesizing all but nine. These nine amino acids are known as the essential amino acids. It is important to include sources of these amino acids either from meat or plants in a healthy diet since the body has no other source for them.

In fact, many products sell amino acids as supplements targeted towards endurance athletes and body builders. Scientists have been able to trace different amino acids and the role they play in muscle contraction or recovery to identify what the body needs after the coach has said “last set'” for the third set in a row. For example, glutamine is drained during intense physical activity. If the body’s glutamine stores become depleted, the body begins to break down muscle cells to compensate.

Proteins

It is commonly said that you are what you eat, but perhaps more correctly, you are what your proteins decide to do. Proteins are the driving force behind the processes of life. Many proteins act as enzymes, which are highly specified molecules that allow complicated organic reactions to progress more easily.

In most organic reactions, the molecules involved must first assume an unfavorable, intermediate configuration (see Figure 5 below) before progressing to the finished product. Enzymes bind to these starting molecules and act to stabilize the intermediary state. This makes it easier for molecules to progress to the desired final products. Enzymes are the primary why to increase reaction rates for biochemical processes.

Figure 5 – An example of how enzymes operate. If the goal is to bend a metal bar until it breaks, the shape of an enzyme will encourage the bent shape of the metal rather than either the starting bar of the ending broken pieces. In this example, the bent bar is the transition state that is stabilized.  If there was an enzyme meant to bend a metal bar into a circular spring, what might that enzyme look like?

In addition to acting as enzymes, proteins also fulfill several other important roles. Proteins are involved with cell signaling, which helps different cells in the body work together. Antibodies are proteins that work with the body’s immune system to recognize and destroy foreign substances that might cause illness. Structural proteins give shape or rigidity to cells, such as those that make up our nails or hair. Motor proteins allow for the movement of single-celled organisms. In short, proteins are critical for all of the basic functions of life.

 

Chirality of Amino Acids and Proteins

The central carbon in an amino acid can serve as a chiral center because it is typically bound to four different groups. Recall that chirality is defined as the property of an object that can not be superimposed on its mirror image, like how your palm-up hands cannot lay exactly on one another.  The one exception is the amino acid, glycine, whose hydrogen side chain makes it a symmetric molecule.

The chirality of amino acids means there exists both left-handed and right-handed amino acids (Figure 6 below). Oddly, while either configuration is possible, life on Earth only uses left-handed amino acids.

Figure 6 – The two possible configurations of amino acids with differing chiralities. The R group is pointing out of the page in both cases.  Can you think of something that is chiral, such as amino acids or your hands?

Studies have been done to investigate how the chirality of protein affects how it is made and performs. Synthesis of both left-handed and right-handed amino acids is not only possible, it is chemically equivalent. From an energy standpoint, protein that is composed entirely of right-handed amino acids should function just as well as proteins made of left-handed amino acids. The only difference is that using only one type of chirality could add an extra layer of regulation in biochemical reactions that may help to reduce synthesis errors.

How did life come to pick left-handed proteins over right-handed ones? There are many competing theories as to the origins of this inequality. One idea is that left-handed amino acids are slightly more water soluble (i.e., easier to dissolve in water), which could have made them easier to incorporate into early life.  Amino acids may also have been affected by the polarization of light.

Want to know more: Polarization

The polarization of light defines how the wave component of light oscillates relative to the direction in which the light is moving. Figure 7 shows light traveling from the bottom-right corner to the top-left corner.  It shows how the polarization of light can be (1) incoherent, as in the right-most third of Figure 7, (2) oscillate back and forth in only one direction, i.e., linearly polarized as in the center of Figure 7, or (3) circularly polarized as is shown in the first third of Figure 7. Different types of filters (shown in Figure 7 as blue squares) help give rise to these types of polarization. In the early Solar System, it is thought that dust grains could have caused all light to be circularly polarized.

Figure 7 – Three different polarizations of light.  The polarization of light defines how the wave component of light (red curve) oscillates relative to the direction in which the light is moving (bottom right to top left in the above figure). Most light, as shown in the bottom and right of the figure, is randomly polarized.  This polarization can be changed with different filters, represented by blue squares in the figure.  A linear polarizer causes the light to move back and forth along one direction, is shown in the center of the figure.  A quarter wave plate results in circularly polarized light, as shown in the top and left of the figure.

It is possible that this circularly polarized light might have been more damaging to right-handed amino acids or more favorable to left-handed ones. Amino acids found in space, for example on meteorites, also exhibit an excess of left-handed molecules. If polarized sunlight gave rise to this imbalance, it could have tipped the scales to the left for life on Earth. Regardless of what established the original inequality, biological processes probably accentuated the imbalance.

Nucleic Acids to Genes

Nucleic acids are used throughout life on Earth to transfer genetic information during cell replication.  This genetic information defines the nature and structure of organisms.

Genes that are shared through cell replication and reproduction is why offspring look like their parents.  Differences in genetic information can give rise to differences in the traits of an organism.  For example, a change in genetic information may result an organism that is taller/shorter or fur color that is lighter/darker.

The process of gene replication—specifically gene mutation, which describes the errors that occur during gene replication—allow for different traits to arise.  Some traits may allow an organism to survive better in a given environment.  Lighter fur may be useful to blend in with a snowy environment; darker fur may be more helpful in a dark forest.

Organisms with these advantageous traits are more likely to be able to reproduce and pass on their favorable traits.  This process of natural selection allows organisms to adapt and evolve to different environments.  This ability to grow better equipped to surviving in an environment, also known as Darwinian evolution, is thought to be a key aspect of all living things.

Nucleotides

The nucleic acids found in all known living things on Earth take two forms: ribonucleic acid (RNA) and deoxyribonucleic acid (DNA).  Nucleic acids are long, complex chains made up of nucleotides, a specific kind of molecule as diagramed in Figure 8.  Nucleotides consist of three components: (1) a central sugar, (2) an interchangeable base, and (3) a phosphate group.

Figure 8 – RNA and DNA consist of nucleotides with three components: a central sugar (blue), a phosphate group (red), and an interchangeable base (green). The difference between the ribose sugar, found in RNA, and the deoxyribose sugar, found in DNA, is highlighted in purple.  What do you think the prefix deoxy- means in this context?

RNA and DNA differ by the central sugar of the nucleotides that makes them up.  RNA uses ribose sugar while DNA uses deoxyribose sugar, which has one less oxygen molecule (note the atoms bonded to the 2nd carbon atom of each nucleotide shown in Figure 8).

The base of a nucleotide changes depending on the specific nucleotide, much like how different amino acids have different side chains.  In living things on Earth, five different bases are used.  RNA uses adenine, cytosine, guanine, and uracil (ACGU).  DNA also uses adenine, cytosine and guanine, but uses thymine in place of uracil (ACGT).  These bases bond to each other in specified ways.  Cytosine always binds to guanine.  Adenine bonds to the thymine in DNA and the uracil in RNA.

The structural formula for the five different bases are drawn in Figure 9.  Nucleotide bases fall into two categories: double-ringed purines (A,G) and single-ring pyrimidines (C, T, U).  Some of these bases were named after the material where they were first discovered. For instance, guanine was first discovered from guano, a fancy name for bird poop.

Figure 9 – The five bases known to be used in nucleotides.  The bases are separated into purines, with two rings (left), and pyrimidines, with one ring (right). A dotted, blue line indicates where each base binds to the central sugar of a nucleotide. Thymine is used in DNA while uracil is specific to RNA. The other three bases are used in both RNA and DNA. What is the only difference between thymine and uracil?

A phosphate group consists of a phosphorus atom bound to four oxygen atoms. This configuration contains many high energy bonds.  The energy stored in this phosphate group allows nucelotides to undergo the reaction that links nucleotides together to form the long-chain nucleic acids that are RNA and DNA.  These nucleic acids have a specific structure that was revealed only in the 1950s.

Want to know more: ATP – Cellular Energy Banks

An influx of energy is required to carry out many of the processes necessary for life.  You eat to enable your body to create stores of energy to use for these reactions.  You eat to create adenosine triphosphate, or ATP.

ATP, shown in Figure 10, is the most widely used energy carrier in all living organisms on Earth.  ATP is also a nucleotide consisting of (1) a ribose sugar, (2) an adenine base, and (3) three phosphate groups bonded in a chain.  When energy is needed for a chemical reaction, one of the high-energy phosphate to oxygen bonds in the chain is broken.  This converts ATP to ADP, adenosine diphosphate, where only two phosphate groups remain.

Phosphate groups are used across living organisms on Earth to supply the energy for necessary reactions.  Can you think of reasons why having a common source of energy across reactions would be beneficial?  (Hint: would you rather have different charging cables for each of your devices, or one common charging cable?)

Figure 10 – The chemical structure of adenosine triphosphate, ATP. The three phosphate groups are shown in red. What bond breaks to release a phosphate group, which consists of one phosphorus (P) atom and four oxygen (O) atoms?
How do we know: DNA’s structure

The exact structure of DNA was revealed through a series of insights that built on one another. When DNA was broken up into its constituent nucleotides, it was discovered that certain bases always appeared in the same proportions. The number of adenine and thymine nucleotides was always equal, and the number of cytosine nucleotides was always equal to the number of guanine nucleotides. In 1949, Erwin Chargoff, an Austro-Hungarian-born American biochemist, sought to explain this observation with the idea of base pairing—the idea that in DNA adenine is always bonded to thymine and cytosine is always bonded to guanine.

Observations of DNA using x-ray crystallography further revealed the structure of the molecule. X-ray crystallography is a complicated technique akin to shining a flashlight into a hall of mirrors and determining where the mirrors are placed based on the way that the light bounces around.

Rosalind Franklin, a British chemist, had a background in physical chemistry that she used to improved on x-ray crystallography techniques in the mid 1900s.  Franklin produced unprecedentedly precise x-ray crystallography images (see Figure 11) while working in the lab of Maurice Wilkins. Her most famous photo, known as “Photo 51”, held the key to DNA’s structure.  This photo was heralded by J.D. Bernal, the father of x-ray crystallography in biochemistry, as “among the most beautiful x-ray photographs of any substance ever taken.

Figure 11 – DNA: picture and pioneer. (left) The x-ray crystallography image of DNA taken by Rosalind Franklin (right) that confirmed the double helix model.

The theoretical biochemists James Watson and Francis Crick used the idea of base pairing and Franklin’s images to reveal the double-helix structure of DNA. Watson discovered that the adenine-thymine bond was exactly the same length as the cytosine-guanine bond, which helped him form the picture of each base pair as rungs of a ladder. Crick helped to develop a mathematical model for the pattern that a helical structure would produce with x-ray crystallography.

In 1951, Crick and Watson began to work together.  When Maurice Wilkins showed them Rosalind Franklin’s Photo 51, they were able to piece together the double-helix model of DNA (see Figure 12).

Figure 12 – The double helix of DNA.  The base pairs adenine-thymine (green and red respectively) and cytosine-guanine (blue and black respectively) form the rungs of the twisted ladder.  The central deoxyribose sugar and phosphate of the nucleotides forms the backbone of the twisted ladder.

 

The double-helix backbone of DNA is composed of the sugar and phosphate components of nucleotides. The bases stick out from this backbone and bind to their appropriate counterpart through weak hydrogen bonds. In the most common form, the bases appear parallel to each other, like a well-designed stairwell. The double-stranded nature of DNA affords a rigid, stable, and long-lived structure. During DNA replication, each strand is checked against the other to reduce copying errors or accidental mutations.

Translating the Genetic Code

The genetic code in DNA is translated into instructions for how to manufacture proteins with the help of RNA. Messenger RNA (mRNA) transcribes the code from where DNA is located in the cell and carries this information to the ribosome.  Ribosomes are the molecules responsible for fabricating proteins in a cell.

A ribosome can read the genetic code from mRNA and translate it to the necessary amino acids to build a protein.  Ribosomes themselves are made up in part of RNA, known as ribosomal RNA (rRNA).  Each amino acid is specified by different codons, a sequence of three base pairs. Transfer RNA (tRNA) matches up each codon with the appropriate amino acid.  These different amino acids are bonded together into a polypeptide chain that can then be folded into the needed proteins.

Figure 13 – The process of translating the genetic code to proteins. mRNA, pictured in blue with multicolored bases, carries the genetic information from DNA. This information is interpreted via codons (three base pairs) into amino acids with the help of tRNA carrying the specified amino acid. The rRNA in the ribosome then helps to link together the amino acids into a polypeptide chain ready to be folded into a protein. What about the tRNA pictured here allows it to correspond a specific codon to a specific amino acid?

The Genetic Code

To summarize, the cellular process of transcription generates messenger RNA (mRNA), a mobile molecular copy of one or more genes with an alphabet of A, C, G, and uracil (U). Translation of the mRNA template converts nucleotide-based genetic information into a protein product. Protein sequences consist of 20 commonly occurring amino acids; therefore, it can be said that the protein alphabet consists of 20 letters. Each amino acid is defined by a three-nucleotide sequence called the triplet codon. The relationship between a nucleotide codon and its corresponding amino acid is called the genetic code.

Given the different numbers of “letters” in the mRNA and protein “alphabets,” combinations of nucleotides corresponded to single amino acids. Using a three-nucleotide code means that there are a total of 64 (4 × 4 × 4) possible combinations; therefore, a given amino acid is encoded by more than one nucleotide triplet.

Figure 14 – The genetic code for translating each nucleotide triplet, or codon, in mRNA into an amino acid or a termination signal in a nascent protein. (credit: OpenStax Concepts of Biology, modification of work by NIH)

Three of the 64 codons terminate protein synthesis and release the polypeptide from the translation machinery. These triplets are called stop codons. Another codon, AUG, also has a special function. In addition to specifying the amino acid methionine, it also serves as the start codon to initiate translation. The genetic code is universal. With a few exceptions, virtually all species use the same genetic code for protein synthesis, which is powerful evidence that all life on Earth shares a common origin.

DNA encodes and stores genes, which describe a unit of genetic information.  Together, all the genes stored in DNA provide a very lengthy instruction manual for all living creatures on Earth. If you took the entire chain of DNA in one human cell and completely stretched it out, it would measure roughly 2 meters, the average height of an NBA player. If you took all of the DNA from all of the cells in a human body and joined them end-to-end, they would cross the diameter of the Solar System twice.

Different segments of DNA are known as chromosomes. An organism’s genome is the complete collection of chromosomes.  Humans have 23 pairs of chromosomes, encoding roughly 25,000 genes using about 3 billion base pairs. A mosaic of the entire human genome was sequenced between 1990 and 2008.  This monumental effort brought together twenty different institutions in six different countries. This remains one of the most impressive collaborative projects in science.

Want to know more: My, What Big Genomes You Have

All the better to encode with…or is it?  Intuitively, it might seem that a larger genome would correspond to more complex organisms.  A larger genome would allow for more genes, meaning more genetic traits, meaning more complexity.  This is wrong.

An example of different genome sizes is given by the table below.  Various species are ordered by increasing genome size as defined by the number of base pairs in the organism’s complete genome.

Species Base Pairs Genes
Virus 170,000 ?
E. Coli 4,600,000 3,200
Fruit Fly 180,000,000 13,600
Chicken 1,000,000,000 23,000
Corn 2,500,000,000 59,000
Human 3,000,000,000 25,000
Lily 100,000,000,000 ?
Grasshopper 180,000,000,000 ?
Amoeba 670,000,000,000 ?

Not only does genome size not scale with perceived organism complexity, it also does not scale with the number of genes.  Humans have longer genomes than chickens, but we lose out to grasshoppers.  Humans also have a longer genome than corn, but less genes.

One reason for this is that most DNA is noncoding DNA, which does not translate directly to genes. Noncoding DNA may instead be used to signal the start of a gene, to help with DNA coiling, and/or potentially carry out several other functions that we have yet to discover.

Remarkably, more than 98% of the human genome is non-coding.  In contrast, only 20% of the DNA in bacteria is noncoding DNA.  The bladderwort plant currently holds the record for most efficient genome with only 3% noncoding DNA.

DNA encodes important instructions for life, but it can become damaged when base pairs or whole segments of DNA are deleted, inverted, duplicated, or moved around. Mutations can be damaging, for example, causing cells to become cancerous. Damage sustained to the phosphate-sugar backbone of DNA is one of the primary causes of mutations.  This type of damage is a common result of exposure to UV radiation, such as from the Sun (never skimp on sunscreen).

However, mutations can also occur naturally, resulting in expressed altered genes that give rise to new characteristics. This is the mechanism for Darwinian evolution: beneficial traits arising from mutation will be preferentially selected when mating and propagated through succeeding generations.

 

Natural mutations arise at measurable rates for different species. This mutation rate allows us to measure the genetic distance between species. This value is obtained by determining the statistical number of mutations required to change one species’ genome into another’s. For example, deer and giraffes are close in genetic difference, the genome of a deer requires relatively few differences to change into the genome of a giraffe when compared to, say, the genome of sunflowers.

Concept Check
If the history of organisms can be traced through their genetic code, is it possible to trace this history all the way back to the beginning of life?

The Last Universal Common Ancestor

The Last Universal Common Ancestor (LUCA) on Earth is a concept, rather than an actual organism. Any universal characteristics of life on Earth are universal either because they are inherited or because they are truly fundamental to life in general. Without a second example of life we are unable to understand how common these features will be on other worlds.

LUCA represents the earliest shared qualities of ancestral life and likely appeared on Earth between 3.5 to 4 Gya and seeded our planet with life. All of the features of life today would have been inherited from LUCA. From what we know about terrestrial biology, this means that LUCA would have been carbon-based, dependent on water, incorporated left-handed amino acids, and used ATP for energy transport. LUCA would have used DNA or RNA to encode genes and translate them into proteins. Significantly, the codons that translate for specific amino acids are the same in every known organism. This code must have been passed down from a common ancestor from which every other species has since evolved. LUCA might not have even been as sophisticated as a single celled organism.

The Tree of Life

The first instance of life was likely very simple and single-celled. Today, life is fantastically varied and complex. We separate the life we see today into three domains based on shared cellular structure and genetic material. The three domains of life, archaea, bacteria, and eukaryotes, are a diversification of the Last Universal Common Ancestor (LUCA).

Figure 15 – The [pb_glossary id="490"]phylogenetic[/pb_glossary] Tree of Life. The Tree of Life shows genetic relationship between all life on Earth. The stem represents the concept of the Last Universal Common Ancestor (LUCA).
  • Bacteria make up the largest domain with both the greatest number of individual species and a biomass that exceeds the combined biomass of all plants and animals. Bacteria were one of the earliest forms of life on our planet and they are found in most habitats. Bacteria can rapidly recombine their genes with other bacteria to allow for genetic innovations, such as resistance to antibiotics.
  • Archaea are genetically distinct from bacteria with their own, separate domain. Archaea are known to generate energy in a variety of ways and have been found in some of the most extreme environments on Earth. Halophiles, for example, exist in extremely salty conditions.
  • Eukaryotes are distinct from prokaryotes in containing a central nucleus enclosed in a membrane and also contain other membrane-bound organelles. Examples of eukaryote organelles include chloroplast, the site of photosynthesis in plants and some algae, and mitochondria, where energy is generated in a cell. Eukaryotes can be uni- or multi-cellular, allowing for larger and more complex organisms.
Figure 16 – A comparison of eukaryotic cells to prokaryotic cells. Note how the nucleoid, or DNA, of the prokaryotic cell is free floating while the DNA of the eukaryotic cell is contained with a nuclear membrane. While both have certain organelles, such as ribosomes, only eukaryotes possess membrane-bound organelles like mitochondria.

Both bacteria and archaea are prokaryotes: single-celled microbes that do not contain membrane-bound organelles. However, the membranes in archaea incorporate a different type of lipid than either bacteria or eukaryotes. Genetic analysis reveals that archaea are closer to eukaryotes in an evolutionary sense than they are to bacteria. Archaea and eukaryotes use many of the same enzymes for DNA translation.

The Diversification of Life

Endosymbiosis

Eukaryotic cells have gained membrane-bound organelles and increased complexity through the process of endosymbiosis. Endosymbiosis is a process whereby primitive organisms benefited by living inside other organisms. Chloroplasts and mitochondria are examples of highly complicated organelles in eukaryotes that have their own membrane. They retained the DNA, messenger RNA, transfer RNA, and ribosomes of their bacterial ancestors before they were symbiotically incorporated into larger eukaryotic cells. The larger cell presumably provided protection and easy access to organic molecules while the chloroplast and mitochondria provided energy to the larger cell. This beneficial relationship led to the creation of larger, more efficient cells.

Want to know more: Mother’s Mitochondria

With sexual reproduction, the offspring ends up with a combination of the mother’s and father’s DNA. However, the DNA found in mitochondria is exclusively the mother’s DNA. When cells are replicated, the mitochondria split themselves as needed and so maintain a self-consistent set of DNA. With mammals, the egg destroys most of the mitochondria in sperm when they merge. In addition, most of the sperm’s mitochondria is positioned in the tail to provide energy and does not make it into the egg.

This preservation of maternal mitochondrial DNA is often used to trance ancestries. Because it is contained only in the mitochondria and is infrequently used, mitochondrial DNA also suffers fewer mutations. It is therefore also helpful in determining the ancestry of different species and how they fit onto the tree of life.

Cyanobacteria

The rise of oxygen likely began with organisms known as cyanobacteria, an early type of photosynthetic bacteria thought to be the first organism to produce oxygen as a byproduct. Photosynthesis is the process by which organisms can harness the energy of the sun to generate energy for their own use. Cyanobacteria are the only know prokaryotes to produce oxygen, and this adaptation brought about the destruction of many other organisms.

Want to know more: Oxidation Reactions and the Free Radicals

Oxidation describes a chemical process in which a molecule, atom, or ion loses an electron. Oxygen is a particularly good oxidizing agent because its nucleus strongly attracts electrons to fill its valence shell. Oxidation often produces free radicals, which are very reactive. They can attack and break apart bonds in other molecules, inciting a chain reaction that is damaging for biochemical reactions.

Antioxidants are compounds that inhibit oxidation and thereby prevent the formation of free radicals. Plants and animals have many natural antioxidant systems in place to guard against this or use antioxidant vitamins such as vitamin A, vitamin C, and vitamin E. A certain amount of antioxidants is required in a well-balanced diet. However, clinical studies have been unable to prove benefits of artificially increasing antioxidant intake or antioxidant supplements.

The rise in atmospheric oxygen was far from smooth and steady (see Figure 16 below). Cyanobacteria slowly became more abundant, but there was still a significant delay in the build up of oxygen. There are several processes that would have hindered the rise in oxygen. Oxygen would have reacted with various chemicals, mainly iron, in the oceans, and these reactions would trap oxygen, prevent it from building up the atmosphere.

Figure 16 – The evolution of oxygen in Earth’s atmosphere. The upper red and lower green lines represent the range of the estimates.  Time is measured in billions of years ago (Ga), and the rise is divided into stages: Stage 1 (3.85–2.45 Ga): Practically no O2 in the atmosphere; Stage 2 (2.45–1.85 Ga): O2 produced, but absorbed in oceans and seabed rock; Stage 3 (1.85–0.85 Ga): O2 starts to gas out of the oceans, but is absorbed by land surfaces and formation of ozone layer; Stages 4 and 5 (0.85 Ga–present): O2 sinks filled, the gas accumulates. Credit: Heinrich D. Holland derivative work: Loudubewe, CC BY-SA 3.0, via Wikimedia Commons.

Oxygen may also have been taken up by microbes in metabolic pathways that generate energy. Organisms that used the oxygen to oxidize ammonia appear to have been plentiful at the time and could have been effective in the reduction of free-floating oxygen. Other organisms produced methane as a byproduct that could have acted as a sink for atmospheric oxygen. However, organisms that are known to produce methane require nickel to carry out the necessary reactions, and concentrations of nickel were dropping. The decrease of nickel would mean less excreted methane, and would allow oxygen to begin accumulating.

The increase of oxygen in the atmosphere was poisonous for anaerobic organisms. Most life before the Great Oxidation Event was anaerobic, so the rise of atmospheric oxygen surely resulted in one of the most significant extinction events in Earth’s history. The fossil record shows a mass extinction of anaerobic life around 2.4-1.6 billion years ago, coincident with the rise of aerobic life.

Concept Check

Why didn’t oxygen build up in the Earth’s atmosphere immediately after cyanobacteria began producing it (around 3.2 billion years ago)? Give two reasons and explain each.

Anaerobic vs Aerobic Life

The rise of atmospheric oxygen coincided with the appearance of far more complicated life forms. There is every reason to believe that the rise in oxygen would have been responsible since aerobic metabolisms are more efficient.

Respiration, in biological terms, describes the process by which organisms convert nutrients into usable energy by forming ATP bonds. Respiration begins with glycolysis, wherein glucose, a sugar, is broken down to form two molecules of a compound called pyruvate along with two molecules of ATP. Without oxygen, organisms have no choice but to undergo anaerobic respiration, or fermentation. Anaerobic respiration of yeast is what makes bread rise and beer bubbly. In this scenario, pyruvate is shuttled down a pathway which produces just two molecules of ATP.

Side Note: A byproduct of anaerobic respiration is a molecule called lactic acid. Lactic acid should be very familiar to any athlete or any student who has been very late to class and had to make a run for it. When our bodies overexert themselves, we begin to use up more oxygen than we can take in. In order to produce the energy needed to keep running, the body switches to anaerobic respiration, which leads to a build up of lactic acid. Lactic acid can damage muscle cells and hinder recovery. The process of producing this lactate is also the cause of next-day muscle soreness.

Concept Check

What are the advantages of aerobic respiration for more complex life?

In the presence of oxygen, aerobic respiration becomes possible. With the help of oxygen, pyruvate can be broken down and enters a more complicated pathway known as the Krebs cycle or the citric acid cycle (note that organisms exhibit an enormous variety of metabolic pathways; the citric acid cycle merely represents one of the more common and well understood pathways). Through the citric acid cycle, organisms can produce from 30–36 ATP from just one molecule of glucose. Though more complicated to assemble, and therefore likely taking longer to evolve, this process can be up to 18-fold more efficient than anaerobic respiration. With more energy, it is possible to carry out more biochemical processes. This may have allowed cells to become increasingly complex and trend towards the more varied, multicellular life we see today.

Want to know more: Viruses

Viruses are even more abundant than bacteria. A virus consists of three functional parts: (1) genetic material, (2) protein coat, and typically (3) an envelope of lipids outside the protein coat. For genetic material, viruses have been discovered to use both DNA and RNA. The protein coat, also known as the capsid, encases and protects this genetic material. The envelope of lipids adds an additional layer of protection.

However, the question of whether viruses can be considered life is hotly debated. Viruses have their own genetic material and are even capable of evolving through natural selection. Viruses survive and replicate by infecting a host cell. After attaching to a cell, the virus injects its DNA into the host cell. Now, the virus is capable of taking over the host cell’s replication mechanisms. In doing so, the virus can now create copies of itself until it has exhausted the cell’s resources. Millions of viruses can be made in this period before the cell dies and viruses escape to infect new host cells. Because viruses require the metabolism of a host cell to produce energy and reproduce, they can not be called self-sustaining. In this way, viruses fail the NASA definition of life.

Despite their differences, evidence of viruses appear wherever life does, suggesting that viruses and life evolved together.  The history of viruses is traced through their DNA or RNA and has given rise to three theories on how viruses came to be.

Regressive evolution theory proposes that viruses were once components of small, parasitic cells. Similar to the idea of endosymbiosis, viruses may have begun as small structures within larger cells that over time became separated. In fact, there exist today some bacteria that, like viruses, can only reproduce in a host cell. As they evolved, these small parasitic structures regressed further from cell-like characteristics, becoming the viruses we see today. However, there is no evidence of types of cells today that could serve as an intermediary between early and present day viruses. Even the smallest cellular parasites fail to really resemble viruses at all.

Another theory is the escaped gene theory, which proposes that viruses got their start as DNA or RNA that escaped from the genome of a larger organism. Surprisingly mobile DNA has recently come to play a large role in biology. Plasmids (Figure below) are circular units of DNA and separate from the genome of an organism. They are most commonly found in bacteria and have been known to move between cells. Scientists have also recently discovered transposons, or “jumping genes,” which are large segments of DNA that can move around within a cell’s genome. While either of these mechanisms could have provided the genetic material for viruses, it remains unclear where the complicated capsids enclosing these genes arose from.

Figure 4 – Plasmids, pictured as small, blue loops. Being much smaller than chromosomal DNA, plasmids can be transcribed more easily and move from one organism to another. Credit:
The pre-cellular origin theory proposes that viruses formed from complex proteins that spontaneously arose even before RNA. Viruses could have evolved alongside the first cells or even before. However, this fails to account for the dependency viruses have on host cells. If viruses need the mechanism of a host cell to survive, how did they manage before cells even evolved?

The exact origin of viruses, much like the origin of life, remains an open question. This vein of research is being pushed forward through analysis of viral and host DNA sequences. What we can say is that genetic comparisons show that the origins of viruses may have predated life splitting into the three different domains. Perhaps the precursors to viruses evolved from the self-replicating molecules that dominated the RNA world. Similar to RNA, viruses are capable of self-assembling in host cells.

Viruses are now well enough understood to be used as tools in biomedical research. In a process called gene therapy, researchers use viruses to inserting genes into specific cells, offering possible treatments for diseases like cystic fibrosis. Some viruses will seek out and destroy cancer cells, while leaving healthy cells alone, allowing for a highly targeted and effective treatments.

Key Concepts and Summary

Two classes of molecules are common to every form of life on Earth: amino acids and nucleic bases. Understanding the fundamental and relatively simple structure of these molecules allows us to see the more complex patterns of biochemistry. Amino acids are the units for building proteins. While there are hundreds of possible amino acid chains, life on Earth uses only a common set of twenty and some amino acids have been detected with radio telescopes in giant molecular clouds – the star-forming regions of space.  The nucleic bases adenine, guanine, thymine and cytosine are components of DNA (and in RNA with a substitute of uracil for thymine). In addition to nucleic bases, the nucleotides in RNA and DNA contain a central sugar and a phosphate group. DNA encodes the genetic information for every organism in a collection of chromosomes.  DNA transcription errors result in mutations that occur naturally, but the mutation rate can increase with exposure to ultraviolet light or carcinogenic chemicals. All life on Earth shares common characteristic: it is carbon-based, dependent on water, has left-handed amino acids, uses ATP for energy storage and transport and has an inherited genetic code for building proteins to carry out processes in the living cell. These shared attributes are distilled into the concept of the Last Universal Common Ancestor at the root of the Tree of Life. There are three phylogenic branches on the Tree of Life: bacteria, archaea, and eukaryotes. The most complex organisms are eukaryotes, which evolved in part by the process of endosymbiosis to incorporate compartmentalized prokaryotic cells. The first cells were anaerobic – using chemical gradients for energy – and mutated into cells that used the energy of the sun for aerobic respiration. Aerobic respiration is 18 times more efficient in producing ATP than anerobic processes. Photosynthesizing cyanobacteria produce oxygen as a byproduct and are thought to be responsible for the rise of an oxygen-rich atmosphere on Earth.

 

Review Questions

Summary Questions

  1. What is an amino acid and how do amino acids form proteins?
  2. What functions do proteins have in living things?
  3. What is a nucleic acid? How do the two main nucleic acids used by life, RNA and DNA differ?
  4. What is the chemical structure of DNA?
  5. How are nucleic acids and genes related?
  6. What causes genetic mutations?
  7. What are the three domains on the tree of life and how do scientists distinguish between them?
  8. What are the main differences between prokaryotes and eukaryotes?
  9. Give an example of endosymbiosis and explain what advantages and disadvantages were gained by organisms that evolved from such situations.
  10. How did cyanobacteria cause one of Earth’s mass extinctions?
  11. What causes aerobic metabolism to be more efficient than anaerobic metabolism?

Exercises

  1. It is fairly easy to extract DNA from a strawberry using household materials. See the step-by-step instructions here. In a formal laboratory setting or on your own, follow the steps to extract DNA and take detailed notes of the challenges or successes in your endeavor. What do you observe about the characteristics of the DNA you extracted?
  2. Four examples of eukaryotic organelles that evolved from endosymbiosis include (1) choloroplasts, (2) mitochondria, (3) nitroplasts, and (4) diazoplasts. Conduct the necessary research to construct a table that compares these four examples in separate rows with three different columns: (1) the kinds of organisms that contains these organelles, (2) the approximate timeframes for when the incorporation of these organelles occurred, and (3) explanations of the function of these organelles.

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Cosmic Origins Copyright © by Debra Fischer; Allyson Sheffield; Joshua Tan; Lily Ling Zhao; and Dawn Erb is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.