Alignment File Formats

Jalview understands a wide range of sequence alignment formats. In order to determine which format has been used for an alignment, Jalview tries to detect some text or formatting unique to one of the formats:

Format Unique File Feature File Extension
FASTA >SequenceName
THISISASEQUENCE
.fa, .fasta
MSF !! AA_MULTIPLE_ALIGNMENT 1.0
..
//
.msf
CLUSTALW CLUSTAL .aln
PILEUP PileUp
PIR >P1; .pir
BLC >Seq1
>Seq2
.blc
PFAM SequenceName THISISASEQUENCE .pfam
Stockholm # STOCKHOLM VersionNumber
...
//
.stk, .sto
Phylip Line starts with two numbers separated by white space
...
//
.phy
EMBL Line starts with ID, followed by a space, and is followed by a 7 character identifier terminated with a semicolon. .txt
GenBank Line starts with LOCUS .gb,.gbk
JSON Data starts with '{'
Data ends with '}'

See BioJSON for more infomation about the Jalview JSON format
.json

The file extensions are used to associate Jalview alignment icons with alignment files: