What does a valid mutation site look like? (original) (raw)

The general format of a mutation code is:gene:[ref]coordinates[alt] where gene is a gene code (or nuc for the genomic nucleotide sequence), ref is the nucleotide or amino acids in the reference, alt is the specific nucleotide or amino acid for the mutatant. Either of ref or alt can be missing if no specific state is required.

Rules can either specify [min|max]_[ref|alt|ambig|oth] OR the call required at a mutation e.g. "N:S235F": (not )[ref|alt|ambig|oth]

Valid Mutation Definitions

The following are valid ways to describe variants of each type. We prefer the definition at the top of each list, but provide alternatives for backwards compatibility.

SNP:

Amino acid mutation:

Deletion:

Insertion (currently parsed but not typed):

Examples

"nuc:C3037T"
"snp:A27259C"
"orf1ab:P4715L"
"n:RG203KR"
"del:28362:9"
"orf1ab:SGF3675-"