Visualising DNA, RNA & Proteins Jalview School Workbook Resources

Visualising DNA, RNA and proteins workbook is aimed at 16-18 year old secondary school students studying Higher, Advanced Higher or A level biology.
The easy-to-use design requires only a desktop or laptop computer with a web browser and internet access.
The workbook contains 4 practical web-based bioinformatic projects:
  Project 1 views DNA and RNA sequences and their 3D structures.
  Project 2 views protein sequences and their 3D structures.
  Project 3 compares the sequence of human myoglobin protein with myoglobin sequences from other animals.
  Project 4 views exons and introns in the human haemoglobin subunit beta (HBB) gene. This is the gene involved in sickle cell anaemia, an inherited red blood cell disorder which affects millions of people throughout the world. The genetic mutations linked to this disease is identified by viewing the coding DNA and its protein product in a split-screen viewer.

Getting started:
(i) Click on the ‘View the School Workbook’ link in Section 1 to open the booklet in an adjacent window, or download the pdf depending on the browser.
(ii) Follow the instructions in the workbook, the links in Section 2 start each exercise by launching JalviewJS and loading the sequences.
If you get stuck, the videos in section (b) may help.

(1) Click to View the School Workbook in an adjacent web browser tab

Recent update: 12th March 2021

(2) Follow the Instructions in the School Workbook

Use the links below to open JalviewJS in the web browser and view the molecules.
Note it may take a few seconds to open JalviewJS and load the sequences.

Project 1-Exercise 1: Click to  View DNA fragment (3BSE) in JalviewJS
Project 1-Exercise 2: Click to  View Codons in JalviewJS with CAG46747 from EMBLCDS database
Project 1-Exercise 3: Click to  View RNA (2GIS) in JalviewJS
Project 2-Exercise 4: Click to  View Myoglobin protein (3rgk - transport)
Project 2-Exercise 1: Click to  View Collagen protein (1cag - structure)
Project 2-Exercise 1: Click to  View Amylase protein (1smd - enzyme)
Project 2-Exercise 1: Click to  View Antibody protein (1igt - storage)
Project 2-Exercise 1: Click to  View Insulin protein (1trz - signalling)
Project 2-Exercise 1: Click to  View Ferritin protein (5xb1 - storage)
Project 3-Exercise 5: Click to View Myoglobin from different species and create a Tree
Project 4-Exercise 6: Click to View Exons & Introns
Project 4-Exercise 7: Click to View the coding DNA & its protein product with CAG46711 from EMBLCDS database

(3) Frequently Asked Questions

I am having trouble viewing the 3D structure in Project 2?
Loading the 3D structure requires opening a context menu, with context menus the position of the mouse is significant. Place the mouse over the names of the sequences and then right click the mouse, and the required context menu should menu appears.
This manoeuvre is demonstrated in the Jalview School Video 3 (


(A) School Workbook Videos

School Video 1: Bioinformatics and Big Data - 1 min 45 sec background video that considers why we need bioinformatics software such as Jalview

School Video 2: Introducing Jalview School Workbook - 2 min introduction video about Jalview and the 4 school workbook projects

School Video 3: HELP running the Jalview School Workbook Projects - 5 min FAQ video provides background information to get up and running

(B) Read the RCSB PDB Newsletter article about the Jalview School Workbook

(C) What is Jalview?

Jalview is one of the most widely used applications for visualising and analysing DNA, RNA and protein multiple sequence alignments. It is free-to-use computer software with an interactive multi-window interface. This allows researchers to read files directly from public biological databases and view sequences, 3D structures and evolutionary trees. Jalview has a number of analysis tools for aligning sequences, measuring similarities, exploring features and comparing structures. Jalview has been developed in Prof Geoff Barton's group in School of Life Sciences the University of Dundee, with the funding support of the BBSRC and the Wellcome Trust.

Click to view the Getting Started using Jalview videos.
Click to view all the Jalview training videos.
These videos are located on the Jalview Online Training YouTube channel.

(D) Database Identifiers and Links

PDB IDs used in Project 1 & 2

PDB ID Molecule Function Link
3DSE DNA fragment


1smd Amylase Enzyme

1cag Collagen Structure

1igt Antibody Defence

5xb1 Ferritin Storage

6bcx Insulin Signalling

1mbn Myoglobin Transport

Uniprot Myoglobin IDs used in Project 3

Uniprot ID


P02144 Human
P02147 Gorilla
P02185 Whale
P02192 Cow
P04247 Mouse
P68276 Dolphin
P02197 Chicken
P63113 Dog
P02190 Sheep
P02187 Elephant

P02144; P02147; P02185; P02192; P04247; P68276; P02197; P85077; P63113; P02190; P02187

DNA Identifier Codes used in project 3 & 4

CAG46747 EmblCDS ID for MB gene that codes for myoglobin protein (P02144)
CAG46711 EmblCDS ID for HBB gene that codes for Hemoglobin subunit beta protein (P68871)

Other links

Free Public Biological Databases


(i) UniProt is a database of protein sequences (
(ii) Protein Data Bank (PDB) is a database of crystallographic, three-dimensional structural data of large biological molecules (
(iii) Ensembl is a genomic database (
(iv) EMBL (CDS) data originates from the European Nucleotide Archive (ENA) database of annotated DNA and RNA sequences (
(v) PFAM is a database of protein families is a database of protein families (
(vi) RFAM is a database containing information about non-coding RNA families and other structured RNA elements (

(E) Other Resources

Project 1 Walkthrough Video (from 2020)


Project 3 & 4 Walkthrough Video (from 2020)

Blackboard Collaborate Workshop Instructions


(F) Acknowledgements

The workbook was produced by Suzanne Duce with help from Mungo Carstairs, Benedict Soares, Bob Hanson, Dmitry Finkelbergs, Charlotte Campbell, Jim Procter and Geoff Barton.