Main Content

aa2int

Convert amino acid sequence from letter to integer representation

    Description

    SeqInt = aa2int(SeqChar) converts a character vector or string containing single-letter codes specifying an amino acid sequence to a row vector of integers specifying the same amino acid sequence. For valid letter codes, see Mapping Amino Acid Letter Codes to Integers.

    example

    SeqInt = aa2int(SeqChar,Unknown=unknownAA) specifies the number used to represent an unknown amino acid.

    Examples

    collapse all

    Create a random amino acid sequence.

    SeqChar = randseq(20,Alphabet="amino")
    SeqChar = 
    'TYNYMRQLVVDVVITNHYSV'
    

    Convert the sequence from letter to integer representation.

    SeqInt = aa2int(SeqChar)
    SeqInt = 1×20 uint8 row vector
    
       17   19    3   19   13    2    6   11   20   20    4   20   20   10   17    3    9   19   16   20
    
    

    Input Arguments

    collapse all

    Amino acid sequence, specified as one of the following:

    • Character vector or string scalar containing single-letter codes specifying an amino acid sequence. For valid letter codes, see . Unknown characters are mapped to 0. Integers are arbitrarily assigned to IUB/IUPAC letters.

    • MATLAB® structure containing a Sequence field that contains an amino acid sequence, such as the output returned by fastaread, getgenpept, genpeptread, getpdb, and pdbread.

    Number representing an unknown amino acid character, specified as a numeric scalar.

    More About

    collapse all

    Version History

    Introduced before R2006a

    See Also

    Functions