I have to translate the complement of a DNA sequence into amino acids
TTTCAATACTAGCATGACCAAAGTGGGAACCCCCTTACGTAGCATGACCCATATATATATATATA
TATATATATATATATGGGTCATGCTACGTAAGGGGGTTCCCACTTTGGTCATGCTAGTATTGAAA
+1 TyrIleTyrIleTyrGlySerCysTyrValArgGlyPheProLeuTrpSerCysStpTyrStp
+2 IleTyrIleTyrMetGlyHisAlaThrOc*GlyGlySerHisPheGlyHisAlaSerIleglu
+3 TyrIleTyrIleTrpValMetLeuArgLysGlyValProThrLeuValMetLeuValLeuLys
- The fist sequence is the normal sequence,
- The second one is the complementary sequence,
- The one with +1 is the amino acid sequence corresponding to my complementary sequence
- The one with +2 is the amino acid sequence corresponding to my complementary sequence starting at the second base
- The one with +3 is the amino acid sequence corresponding to my complementary sequence beginning with the third base
i have tried the next code to get my results, but so i get just a complementair seq. without split.
seq = "CCGGAAGAGCTTACTTAG"
basecomplement = {'A': 'T', 'C': 'G', 'G': 'C', 'T': 'A'}
def translate(seq):
x = 0
aaseq = []
while True:
try:
aaseq.append(basecomplement[seq[x:x+1]])
x += 1
except (IndexError, KeyError):
break
return aaseq
for frame in range(1):
#print(translate(seq[frame:]))
rseqn= (''.join(item.split('|')[0] for item in translate(seq[frame:])))
rseqn = list(rseqn)
rseqn.reverse()
print( rseqn)
can someone help me to get my results ??
Use:
this produces the correct complementary (reversed) secuence:
Note that you do not need the for loop (the current one in fact is doing nothing) to determine DNA or RNA complementary sequences, as this is independent on translation frame.
Having said that, however, I must stress that ALL your code can be simplified in four lines if you start using BioPython for your Bioinformatic tasks: