why nucleotides is more than 4?
3 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
hi as I know the no. of nucleotides is 4 letters. why in matlab consider it 17 letters as in table here:
thanks
0 commentaires
Réponse acceptée
Walter Roberson
le 16 Nov 2011
The table there looks pretty straight-forward to me: http://www.mathworks.com/help/toolbox/bioinfo/ref/int2nt.html#bp_rekb-1 . It has codes for situations in which particular sets of nucleotides are known to be present or known to be absent.
Besides, the number of known nucleotides is not 4: it is currently 8. The 7th and 8th were announced in July 2011, with the 5th and 6th having been announced in April 2005.
Plus de réponses (1)
Lucio Cetto
le 19 Nov 2011
Ambiguous nucleotide symbols are used to characterize sequences that can have variations. It was introduced in the 80's and they are useful nowadays in certain cases, for example describing restriction enzymes. (e.g. http://www.chem.qmul.ac.uk/iubmb/misc/naseq.html). In my personal opinion I think that there are other situations in which we have better options, such as sequence motifs, sequence profiles and the more elaborated profile HMMs. If you plan to convert to aa, Matlab can actually use also ambiguous aa codes when possible, although this is no longer a standard practice; most people now uses only ACGT.
0 commentaires
Voir également
Catégories
En savoir plus sur Genomics and Next Generation Sequencing dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!