CE PROGRAMME PERMET L'ACCES PAR MNEMONIQUE AUX SEQUENCES, COMPOSITIONS, DES PROTEINES CONTENUES DANS LA BANQUE NBRF P R O T E I N S E Q U E N C E D A T A B A S E of the Protein Identification Resource (PIR) Supported by the Division of Research Resources of the NIH Release 2.1, August 1984 2784 sequences, 557759 amino acids W.C. Barker, L.T. Hunt, B.C. Orcutt, D.G. George, L.S. Yeh, H.R. Chen, M.C. Blomquist, G.C. Johnson, E.I. Seibel-Ross, and R.S. Ledley ------------------------- Table 1 One- and Three-letter Amino Acid Abbreviations A Ala Alanine C Cys Cysteine D Asp Aspartic acid E Glu Glutamic acid F Phe Phenylalanine G Gly Glycine H His Histidine I Ile Isoleucine K Lys Lysine L Leu Leucine M Met Methionine N Asn Asparagine P Pro Proline Q Gln Glutamine R Arg Arginine S Ser Serine T Thr Threonine V Val Valine W Trp Tryptophan Y Tyr Tyrosine B Asx Asp or Asn, not distinguished Z Glx Glu or Gln, not distinguished X X Undetermined or atypical amino acid These abbreviations conform to those suggested by the IUPAC-IUB Commission on Biochemical Nomenclature, J. Biol. Chem. 243, 3557-3559, 1968. Table 2 Punctuation in Protein Sequences A blank between two adjacent amino acids indicates that they are connected, as determined experimentally. () Encloses a region, the composition but not the complete sequence of which has been determined experimentally, or encloses a single residue that has been tentatively identified. = Indicates )(, the juxtaposition of two regions of indeterminate sequence, while preserving proper spacing between residues. / Indicates that the adjacent amino acids are from different peptides, not necessarily connected. When the amino end of a protein has not been determined, / precedes the first residue. When the carboxyl end has not been determined, / follows the last residue. When )/, /(, or )/( are needed, only / is used. . Outside of parentheses, indicates the ends of sequenced fragments. The relative order of these fragments was not determined experimentally but is clear from homology or other indirect evidence. . Within parentheses, indicates that the amino acid to its left has been placed with at least 90% confidence by homology with known sequences. , Indicates that the amino acid to its left could not be positioned with confidence by homology. If the structure of related proteins is not known, the position of the amino acids within parentheses is arbitrary. -------------------------------------