Viewing DNA, RNA & Proteins in Jalview School Resources

4 easy-to-run web-based bioinformatic projects that are aimed at 15-18 year old secondary school students who are studying biology at Higher, Advanced Higher or A level.
  Project 1: single click exercise to view DNA & RNA sequences and 3D structures.
  Project 2: single click exercise to view a human myoglobin protein sequence and its 3D structure, followed by an exercise that uses a similarity tree to compare the myoglobin sequences from different animals.
  Project 3: single click exercises to view the sequences and 3D structures of proteins with a range of different biological functions.
  Project 4: exercise to view the exons and introns in the HBB gene and identify the genetic mutation on DNA that is linked to sickle cell anaemia and view its location on the 3D structure of haemoglobin protein.

Requirement: Computer with a web browser and internet access. Since JalviewJS opens inside a web browser, no software has to be downloaded.

Open the Workbook

Click the link to view the Workbook in an adjacent window (or download depending on the browser).
Next follow the instructions for each exercise.

Links to Run the Exercises

Clicking the links below will launch JalviewJS in an adjacent web browser tab and open files as appropriate.
Note: It may take a few seconds to load the sequences.

Project 1-Exercise 1: Click to  view DNA fragment
Project 1-Exercise 2: Click to  view the coding sequence of DNA & codons alongside the protein product
Project 1-Exercise 3: Click to  view an RNA molecule
Project 2-Exercise 4: Click to  view Human Myoglobin protein sequence and 3D structure
Project 2-Exercise 5: Click to view Myoglobin protein sequences from different species and create a Tree
Project 3-Exercise 6: Click to  view Haemoglobin protein (transport)
Project 2-Exercise 1: Click to  view Collagen protein (structure)
Project 2-Exercise 1: Click to  view Amylase protein (enzyme)
Project 2-Exercise 1: Click to  view Antibody protein (defence)
Project 2-Exercise 1: Click to  view Insulin protein (signalling)
Project 2-Exercise 1: Click to  view Ferritin protein (storage)
Project 2-Exercise 1: Click to  view Myoglobin protein (transport)
Project 4-Exercise 7: Click to view Exons & Introns in the DNA of the HBB  gene
Project 4-Exercise 8: Click to view the coding DNA of the HBB gene & its protein product

Advanced Exercise 1: Click to open JalviewJS desktop window in an adjacent browser window
Advanced Exercise 2: Click to view the coding DNA of the HBB gene & its protein product - reading files from the public biological databases (EMBLCDS id CAG46711)

Jalview School Videos

(i) Bioinformatics & Big Data - considers why we need bioinformatics software such as Jalview

(ii) Introducing the Jalview School Workbook Projects

(iii) What is Jalview?

Jalview is one of the most widely used applications for visualising and analysing DNA, RNA and protein alignments. It is free-to-use computer software with an interactive multi-window interface. This allows scientists to read files directly from public biological databases and view sequences, 3D structures and evolutionary trees. Jalview has a number of analysis tools for aligning sequences, measuring similarities, exploring features and comparing structures. Jalview has been developed in Prof Geoff Barton's group in School of Life Sciences the University of Dundee, with the funding support of the BBSRC and the Wellcome Trust.

Frequently Asked Questions - I’m having trouble viewing the 3D structure

Loading the 3D structure requires opening a context menu. The position of the mouse is critical when opening context menus.
Place the mouse over the name of the sequences (not the sequences themselves).
Then right click the mouse to open the context menu. This manoeuvre is demonstrated in the Jalview School Video above.


The RCSB PDB Newsletter article featuring the Jalview School Workbook

Free Public Biological Databases

(i) UniProt is a database of protein sequences (
(ii) Protein Data Bank (PDB) is a database of crystallographic, three-dimensional structural data of large biological molecules (
(iii) Ensembl is a genomic database (
(iv) EMBL (CDS) data originates from the European Nucleotide Archive (ENA) database of annotated DNA and RNA sequences (
(v) PFAM is a database of protein families is a database of protein families (
(vi) RFAM is a database containing information about non-coding RNA families and other structured RNA elements (

Database Identifiers

PDB IDs used in the workbook

PDB ID Molecule Function Link
3BSE DNA fragment


3nmm Haemoglobon Transport

1smd Amylase Enzyme

1cag Collagen Structure

1igt Antibody Defence

5xb1 Ferritin Storage

6trz Insulin Signalling

3rgk Myoglobin Transport

Uniprot Myoglobin IDs used in the workbook

Uniprot ID


P02144 Human
P02147 Gorilla
P02185 Whale
P02192 Cow
P04247 Mouse
P68276 Dolphin
P02197 Chicken
P63113 Dog
P02190 Sheep
P02187 Elephant

P02144; P02147; P02185; P02192; P04247; P68276; P02197; P85077; P63113; P02190; P02187

EmblCDs IDs used in the workbook

CAG46747 EmblCDS ID for MB gene that codes for myoglobin protein (P02144)
CAG46711 EmblCDS ID for HBB gene that codes for Hemoglobin subunit beta protein (P68871)




The workbook was produced by Suzanne Duce with help from Mungo Carstairs, Benedict Soares, Bob Hanson, Dmitry Finkelbergs, Charlotte Campbell, Jim Procter and Geoff Barton.