Protein Synthesis — From DNA to Protein

Understand protein synthesis with clear diagrams, real-world examples, and exam tips.

CBSE NEET 13 min read

Protein synthesis is the process by which cells build proteins — the molecular machines that run every function in your body. From the enzymes that digest your food to the haemoglobin carrying oxygen in your blood, every protein starts as instructions encoded in DNA.

The central message: DNA → RNA → Protein. This is the central dogma of molecular biology, and protein synthesis is how this information transfer actually happens. We’ll break this down into two major stages — transcription (DNA to RNA) and translation (RNA to protein).

Understanding protein synthesis deeply is non-negotiable for NEET. It appears every year, often as complex MCQs testing the sequence of events, the role of different RNA types, and molecular details like codons, anticodons, and the peptidyl transferase reaction.

Key Terms and Definitions

Gene: A specific segment of DNA that contains the instructions to make one protein (or functional RNA). Humans have about 20,000–25,000 protein-coding genes.

Transcription: The process of copying a portion of DNA into messenger RNA (mRNA). Happens in the nucleus (in eukaryotes).

Translation: The process of reading the mRNA sequence and assembling the corresponding amino acid chain at ribosomes. Happens in the cytoplasm.

Codon: A sequence of three nucleotides on mRNA that specifies one amino acid (or a start/stop signal). There are 64 possible codons.

Anticodon: The complementary three-nucleotide sequence on tRNA that pairs with a codon on mRNA.

Ribosome: The molecular machine where translation occurs. Made of rRNA + proteins, with two subunits (large and small).

Template strand: The DNA strand that is read by RNA polymerase during transcription (read 3’→5’, RNA is synthesized 5’→3’).

Coding strand: The non-template DNA strand with the same sequence as the mRNA (except T is replaced by U).

Stage 1: Transcription — Making mRNA from DNA

What happens

RNA polymerase binds to a specific DNA sequence called the promoter and unwinds the double helix. It reads the template strand (3’→5’) and synthesizes a complementary mRNA strand (5’→3’), using ribonucleotides (A, U, G, C — note: uracil replaces thymine).

When RNA polymerase reaches a terminator sequence, transcription stops and the mRNA is released.

In eukaryotes: mRNA processing

The initial transcript (pre-mRNA) must be processed before it can be translated:

  1. 5’ capping: A modified guanosine cap is added to the 5’ end. Protects mRNA and helps ribosome binding.
  2. 3’ poly-A tail: ~200 adenine nucleotides added to the 3’ end. Increases mRNA stability and aids nuclear export.
  3. Splicing: Non-coding sequences called introns are removed; exons are joined together. Done by a complex called the spliceosome.

NEET frequently asks about introns and exons. Remember: Introns are In the way (removed), Exons are EXpressed (kept). In prokaryotes, there are no introns — this is a key difference tested in comparative biology questions.

In prokaryotes

Prokaryotes have no nucleus, so transcription and translation happen simultaneously — ribosomes begin translating the mRNA even before transcription is complete. This is called coupled transcription-translation and is a classic topic in NEET.

DNA template strand (3’→5’) → RNA polymerase → mRNA (5’→3’)

Base pairing rules:

  • A (DNA) → U (mRNA)
  • T (DNA) → A (mRNA)
  • G (DNA) → C (mRNA)
  • C (DNA) → G (mRNA)

Stage 2: Translation — Making Protein from mRNA

The players

Three types of RNA work together:

  • mRNA: carries the genetic message (sequence of codons)
  • tRNA: adapter molecule; each tRNA has an anticodon and carries the corresponding amino acid
  • rRNA: structural and catalytic component of ribosomes

The genetic code

The genetic code is the set of rules mapping codons to amino acids. Key properties:

  • Universal: nearly all organisms use the same code
  • Degenerate (redundant): most amino acids are coded by more than one codon (e.g., leucine has 6 codons)
  • Non-overlapping: each nucleotide belongs to only one codon
  • Non-punctuated: no “comma” between codons — reading frame is continuous

AUG is the start codon — it codes for methionine and sets the reading frame. UAA, UAG, UGA are stop codons — they don’t code for any amino acid.

The steps of translation

Initiation: The small ribosomal subunit binds the mRNA at the 5’ cap. It scans until it finds the start codon AUG. The initiator tRNA (carrying methionine) pairs with AUG at the P site. The large subunit joins, forming the complete ribosome. The A site is now empty and ready.

Elongation: This is the repeating cycle:

  1. A charged tRNA enters the A site, its anticodon pairs with the mRNA codon
  2. Peptidyl transferase (an rRNA enzyme in the large subunit) catalyzes the peptide bond — transfers the growing chain from the P site tRNA to the A site tRNA
  3. Translocation: ribosome moves 3 nucleotides along mRNA (5’→3’). The A site tRNA with the growing chain shifts to P site; the empty tRNA at P site shifts to E site (exit) and leaves.

Termination: When a stop codon (UAA, UAG, or UGA) enters the A site, there is no tRNA with a matching anticodon. Instead, release factors bind and trigger hydrolysis of the peptide chain. The ribosome dissociates into its two subunits, releasing the completed polypeptide.

A site = Aminoacyl site (incoming tRNA with new amino acid) P site = Peptidyl site (tRNA carrying growing chain) E site = Exit site (empty tRNA leaving)

Reading direction: mRNA moves 5’→3’ through the ribosome

Post-translational modifications

The raw polypeptide chain often needs further modification:

  • Folding: assisted by molecular chaperones
  • Cleavage: signal peptides removed
  • Chemical modifications: phosphorylation, glycosylation, etc.
  • Quaternary structure: multiple subunits associate (e.g., haemoglobin = 4 subunits)

Solved Examples

Example 1 (CBSE Level)

Q: If a DNA template strand has the sequence 3’-TACGAATTCG-5’, what will be the mRNA sequence and the first amino acid coded?

Solution:

  • mRNA (complementary, 5’→3’): 5’-AUGCUUAGC-3’
  • First codon: AUG → Methionine (start codon)
  • Second codon: CUU → Leucine
  • Third codon: AGC → Serine

Example 2 (NEET Level)

Q: A mutation changes the codon UCG to UCA. What effect does this have on the protein?

Solution: Both UCG and UCA code for Serine. This is a silent mutation (synonymous substitution) — the amino acid sequence is unchanged because the genetic code is degenerate. The protein remains the same. This is a common NEET MCQ testing understanding of code degeneracy.

Example 3 (NEET Advanced)

Q: In a cell-free translation system, if ribosomes are treated with an antibiotic that specifically inhibits peptidyl transferase activity, what would happen?

Solution: Peptidyl transferase catalyzes peptide bond formation (the elongation step). Inhibiting it would block polypeptide elongation — the first amino acid (methionine) would be attached to the initiator tRNA, initiation would be complete, but no peptide bond would form. The ribosome would stall at the first elongation step. This is how antibiotics like chloramphenicol work (they inhibit the 50S subunit’s peptidyl transferase center).

Exam-Specific Tips

NEET Weightage: Protein synthesis (Molecular Basis of Inheritance chapter) typically contributes 3-5 questions per year. This is one of the highest-weightage chapters in Class 12 Biology. Topics most frequently tested: role of AUG, differences between prokaryotic and eukaryotic translation, mRNA processing in eukaryotes, the wobble hypothesis, and inhibitors of translation.

CBSE Board Exam: For 5-mark questions, always draw the ribosome with A, P, and E sites labeled. Describe initiation, elongation, and termination as three distinct stages. The NCERT diagram of polysomes (multiple ribosomes on one mRNA) is worth memorizing.

For NEET MCQs: The most confusion occurs around:

  1. Whether it’s the template or coding strand that matches mRNA sequence
  2. Direction of synthesis (always 5’→3’ for mRNA)
  3. Where transcription vs translation occurs in eukaryotes

Common Mistakes to Avoid

Mistake 1: Confusing template and coding strands. The mRNA sequence matches the coding strand (except U instead of T). The template strand is the mirror image. When given a DNA sequence, identify which strand is being read before writing the mRNA.

Mistake 2: Saying RNA polymerase “unwinds” the DNA and doesn’t need a primer. True — RNA polymerase doesn’t need a primer (unlike DNA polymerase). But it does need to recognize a promoter sequence. Many students write that RNA polymerase works exactly like DNA polymerase.

Mistake 3: Thinking the P site holds the incoming tRNA. It’s the opposite — the A (aminoacyl) site receives the incoming charged tRNA; the P (peptidyl) site holds the growing chain.

Mistake 4: Writing that stop codons are recognized by tRNA. They are not — stop codons are recognized by release factors (proteins), not by tRNA. No tRNA has an anticodon complementary to UAA, UAG, or UGA.

Mistake 5: Confusing prokaryotic and eukaryotic ribosomes. Prokaryotes: 70S (50S + 30S subunits). Eukaryotes: 80S (60S + 40S subunits). The ‘S’ stands for Svedberg units (measure of sedimentation rate, not additive). This is tested almost every year in NEET.

Practice Questions

Q1: What is the anticodon for the mRNA codon 5’-GUC-3’?

The anticodon on tRNA is complementary and antiparallel to the codon. For codon 5’-GUC-3’, the anticodon is 3’-CAG-5’, or written 5’→3’: GAC.

Q2: A gene has 900 base pairs. How many amino acids will the final protein contain (approximately)?

900 base pairs = 900 nucleotides on each strand. The mRNA will be ~900 nucleotides long. Each codon = 3 nucleotides. So maximum codons = 300. But we subtract 1 for the stop codon (which doesn’t code for an amino acid). Also, in eukaryotes, introns are removed, so the actual protein is likely shorter. For the purpose of this calculation: approximately 299 amino acids (300 codons − 1 stop codon). If the question specifies it’s a prokaryotic gene, the answer is 299.

Q3: Why does mRNA need a 5’ cap and poly-A tail in eukaryotes but not in prokaryotes?

In eukaryotes, mRNA must travel from the nucleus to the cytoplasm after transcription. The 5’ cap and poly-A tail protect the mRNA from degradation by exonucleases during this journey and also help in ribosome recognition. In prokaryotes, there is no nucleus — transcription and translation are coupled, so mRNA is immediately translated without needing protective modifications.

Q4: If a single base substitution converts AUG to GUG in the middle of an mRNA, what happens?

AUG (methionine) → GUG (valine). This is a missense mutation — the amino acid at that position changes from methionine to valine. Whether this affects protein function depends on the location: if it’s in the active site of an enzyme, the protein may lose function; if it’s in a less critical region, the protein may still work (conservative mutation if the chemical properties are similar). Unlike a nonsense mutation, protein synthesis doesn’t stop prematurely.

Q5: What is the role of aminoacyl-tRNA synthetase?

Aminoacyl-tRNA synthetase (also called “charging enzyme”) catalyzes the attachment of the correct amino acid to its corresponding tRNA. This is called charging the tRNA. There is one aminoacyl-tRNA synthetase for each of the 20 amino acids. This step requires ATP energy and is critical — if the wrong amino acid is attached to a tRNA, it will be incorporated into the protein incorrectly. This is the real quality control step in translation.

Q6: What happens at polysomes?

A polysome (polyribosome) is a structure where multiple ribosomes are simultaneously translating the same mRNA molecule. Once the first ribosome moves forward a few codons from AUG, a second ribosome can attach and begin translation. This allows one mRNA molecule to be translated by 10–100 ribosomes simultaneously, massively increasing protein production efficiency. Polysomes are visible in electron micrographs as “beads on a string.”

Q7: Which antibiotic inhibits 50S subunit and what does it block?

Chloramphenicol inhibits the 50S subunit of prokaryotic ribosomes by blocking peptidyl transferase activity. This halts peptide bond formation and stops translation elongation. It’s selective for prokaryotes (70S ribosomes) and doesn’t affect eukaryotic ribosomes (80S), which is why it can be used as an antibiotic. However, mitochondria have 70S ribosomes, which is why chloramphenicol can have mitochondrial side effects at high doses.

Q8: Distinguish between transcription and replication in terms of the enzyme used and primer requirement.

Replication: Enzyme = DNA polymerase III (prokaryotes); requires a primer (short RNA sequence) to begin synthesis. Synthesizes DNA using DNA as template.

Transcription: Enzyme = RNA polymerase; does not require a primer. Can start synthesis de novo (from scratch). Synthesizes RNA using DNA as template.

Both proceed 5’→3’ for the new strand and read the template 3’→5’.

FAQs

Q: What is the difference between the coding strand and the template strand?

The template strand is the one actually read by RNA polymerase during transcription. The coding strand (also called sense strand or non-template strand) has the same sequence as the mRNA (with T instead of U). In your NCERT textbook, the coding strand is often written on top and the template strand below — the mRNA sequence matches the coding strand.

Q: Why is the genetic code described as degenerate?

Degenerate means multiple codons specify the same amino acid. For example, UUU and UUC both code for phenylalanine. This redundancy is protective — a mutation in the third base of a codon often doesn’t change the amino acid (third base wobble). Only methionine (AUG) and tryptophan (UGG) have single codons.

Q: What is the wobble hypothesis?

Proposed by Francis Crick, the wobble hypothesis states that the pairing between the third position of a codon and the first position of an anticodon is less strict (more flexible) than at the other positions. A single tRNA can recognize multiple codons that differ only in the third base. This reduces the number of tRNA molecules needed and explains how 61 sense codons can be read by fewer than 61 distinct tRNAs.

Q: Does translation occur in the nucleus?

In eukaryotes, translation occurs in the cytoplasm — either on free ribosomes (for proteins that stay in the cytoplasm) or on ribosomes attached to the rough endoplasmic reticulum (for proteins destined for secretion or membrane insertion). Transcription occurs in the nucleus.

Q: What is a reading frame and why does it matter?

A reading frame is the way nucleotides are grouped into triplet codons starting from a specific point. Each mRNA has three possible reading frames, but only one is the correct one (established by the AUG start codon). A frameshift mutation (insertion or deletion of 1-2 nucleotides) shifts the reading frame, completely changing all downstream codons — often disastrous for protein function.

Q: What is the significance of AUG?

AUG is special for two reasons. First, it’s the universal start codon — translation begins at AUG. Second, it codes for methionine. So the first amino acid in every newly synthesized protein is methionine (though it’s often removed later by post-translational processing). In prokaryotes, the initiating methionine is a modified form called N-formylmethionine (fMet).

Practice Questions