DEHA2G12166g
some similarities with uniprot|P00431 Saccharomyces cerevisiae YKR066c CCP1 mitochondrial cytochrome c peroxidase
Element type: CDS
Element length: 1965 nucleotides,
on sense strand of
Deha2G: 1003647..1005611.
Other names:
DEHA-CDS1329.1
DEHA-IPF3862
DEHA0G12925g
Coding sequence: 655 codons.
Element length: 1965 nucleotides,
on sense strand of
Deha2G: 1003647..1005611.
Other names:
DEHA-CDS1329.1
DEHA-IPF3862
DEHA0G12925g
Coding sequence: 655 codons.
Database cross references:
EMBL: CR382139
GenomeReviews: CR382139_GR
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382139
GenomeReviews: CR382139_GR
Homologs and Orthologs
Homologs in protein family: GL3C4845Orthologs: strict determination not possible; homologs must be refined manually
Protein DEHA2G12166p 
some similarities with uniprot|P00431 Saccharomyces cerevisiae YKR066c CCP1 mitochondrial cytochrome c peroxidase; RecName: Full=Putative heme-binding peroxidase; EC=1.11.1.-;
Protein domain map
Sequence data 
>DEHA2G12166g.nt ATGGACGGTAATTTTAGGAGAATGTCTGGTGAAAAAGAACCACATAGGCCACAGCCGACG GCCCTTGGGCGCTTCGAATTGATTATCAAGTTTGTATTTAGATGTGCTATTTTGCCATGG TTGCTATGGGAATTGGGGTTTCGGTCTTATTATAAGACTGTTCTTGGAGTATTTGATATG ATATTCAAGGTATTACTGTTATTGATTGCTCCGGAAAATGGTGGATTGTTTAATGGAATC AGTGGGGATAATGGGCGGTACGGTGTAAATAGTGCCGGTGGGGTTGACATAGATGGATTG TTTGACGACGTTCTCAGGGAAGGAAAAGTTAATGTAGAAAAGAAGTCTGATACATTTATT CTGAGGATCAATTCCAGGTACCGGTGGGTGATAGACGGGATTGAAAGCCAGCAAATATGG CAGTCGACCCGGGAAAATGTAGAAGAATTATACTTTATATTCGAGAAAGCGGATTACGTA TTCGATAGATATGTCAAACACTCGCCGTTTTGTGTCTTGAACAAGTTGAACCCCAATGGA AAGAAGTCCCACTATGAATATGACGGCATTCCCTTAAGTAAGTTCGAGGGCTTTGGGAGC TACCTTCTGTACAAACGCTTAGGAACTAAAACCCTCACAACCACAAAGCCGCATTTACAT CCAATACTTTCTTCTACAATGACTGCAATACAGAAACCAGTGGTAGCAAAACGCGAAGCT CCAAAAGCAGAAGTAAATCCCACCGTATCCAGACTGACCCAAACTGAAACTATCAAGCCC ACTAAGGAAAGGACTGTGAGCGTTTTTAGTCCCCCGGTGTTTAATTTTGCAGCTAACTCT TTTGCTCAGTCGGTATCCAACGAATACAAGCATGCAAGTCCCAAACGTATCAAATTAGAT AATTCAAGAGTTCAAGTCCCCAGTATAGTCAAGCCGAAGCAAGACAAACCGAGACCGCCT GCAATCGTCACCAAGCCTCGGGTCATTAATATTGAATTTCCAAACAAACAGAAGTCTGGA TTTAAACTTTTAATTAGGCCTAAACATGAACCGAGTATTAAAAAGCAGAAGCAAGGTATT GAGGTCTTATCCACCTCAAACACCAAGAGAATCACGAAATCTATATCAGTTGATGATGTA GAATACGTAGAGAAAGTTAAGCATGCAATCAAACAAGTATTACCCAAGCCCGATTATGAC GATGGGTCCTTGGGTCCTGTAATTTTGCGACTCGCATGGCATTGTTGCGCTACTTACAAT AAATTCACTGGTAATGGTGGTTCGAATGGTTCAACTATGAGATTTGTTCCTGAAATTACT GATGATGGCAACTCTGGTCTTGACATTGCACGTTCTGCACTCGAACCTATAAAACAAAAA TTCCCTGATATCACCTACTCGGATTTATGGACTCTAGCTGGTAAAATTTCTATTCAAGAA ATGGGGGGTCCGAAGATACCATGGAGATGCGGTAGAGTTGATTGCATTGACGATAGATAT GTCCCACCCAACGGCAGGTTACCATTCGCATACAAAAATGCCAACCATATTCGGGAAACA TTCGGTAGAATGGGGTTCAATGATAGAGAAACCGTCCTGTTATTGGGTGCACATGGTTTG GGAAGATGTCACAAGAGGTTCAGCGGATGGGAAGGAAAATGGACCGAAAACCCCACTTCG TTCTCTAACGACTTCTATAAGGTGTTGTTAGATGAAGAATGGAGTCTAGGAACGGTGCCG GAAACCGGAAAAGAGCAGTATTATAACAAAGACAAATCCCTAATCATGCTAAATACCGAC ATTGAGCTAATTAGAGATCCTCACTTCCTACATTTTGTTAAGCTATATAGTCAACACCAA GCGACATTCTTTCAAGATTTTGCCAACGCCTTTGGAAAGCTCTTAGAGTTGGGTATAGAA AGAGATTCCAACGGTAACGTCTTACCCAAGAATGAGTTCTATTGA
>DEHA2G12166g.cds ATGGACGGTAATTTTAGGAGAATGTCTGGTGAAAAAGAACCACATAGGCCACAGCCGACG GCCCTTGGGCGCTTCGAATTGATTATCAAGTTTGTATTTAGATGTGCTATTTTGCCATGG TTGCTATGGGAATTGGGGTTTCGGTCTTATTATAAGACTGTTCTTGGAGTATTTGATATG ATATTCAAGGTATTACTGTTATTGATTGCTCCGGAAAATGGTGGATTGTTTAATGGAATC AGTGGGGATAATGGGCGGTACGGTGTAAATAGTGCCGGTGGGGTTGACATAGATGGATTG TTTGACGACGTTCTCAGGGAAGGAAAAGTTAATGTAGAAAAGAAGTCTGATACATTTATT CTGAGGATCAATTCCAGGTACCGGTGGGTGATAGACGGGATTGAAAGCCAGCAAATATGG CAGTCGACCCGGGAAAATGTAGAAGAATTATACTTTATATTCGAGAAAGCGGATTACGTA TTCGATAGATATGTCAAACACTCGCCGTTTTGTGTCTTGAACAAGTTGAACCCCAATGGA AAGAAGTCCCACTATGAATATGACGGCATTCCCTTAAGTAAGTTCGAGGGCTTTGGGAGC TACCTTCTGTACAAACGCTTAGGAACTAAAACCCTCACAACCACAAAGCCGCATTTACAT CCAATACTTTCTTCTACAATGACTGCAATACAGAAACCAGTGGTAGCAAAACGCGAAGCT CCAAAAGCAGAAGTAAATCCCACCGTATCCAGACTGACCCAAACTGAAACTATCAAGCCC ACTAAGGAAAGGACTGTGAGCGTTTTTAGTCCCCCGGTGTTTAATTTTGCAGCTAACTCT TTTGCTCAGTCGGTATCCAACGAATACAAGCATGCAAGTCCCAAACGTATCAAATTAGAT AATTCAAGAGTTCAAGTCCCCAGTATAGTCAAGCCGAAGCAAGACAAACCGAGACCGCCT GCAATCGTCACCAAGCCTCGGGTCATTAATATTGAATTTCCAAACAAACAGAAGTCTGGA TTTAAACTTTTAATTAGGCCTAAACATGAACCGAGTATTAAAAAGCAGAAGCAAGGTATT GAGGTCTTATCCACCTCAAACACCAAGAGAATCACGAAATCTATATCAGTTGATGATGTA GAATACGTAGAGAAAGTTAAGCATGCAATCAAACAAGTATTACCCAAGCCCGATTATGAC GATGGGTCCTTGGGTCCTGTAATTTTGCGACTCGCATGGCATTGTTGCGCTACTTACAAT AAATTCACTGGTAATGGTGGTTCGAATGGTTCAACTATGAGATTTGTTCCTGAAATTACT GATGATGGCAACTCTGGTCTTGACATTGCACGTTCTGCACTCGAACCTATAAAACAAAAA TTCCCTGATATCACCTACTCGGATTTATGGACTCTAGCTGGTAAAATTTCTATTCAAGAA ATGGGGGGTCCGAAGATACCATGGAGATGCGGTAGAGTTGATTGCATTGACGATAGATAT GTCCCACCCAACGGCAGGTTACCATTCGCATACAAAAATGCCAACCATATTCGGGAAACA TTCGGTAGAATGGGGTTCAATGATAGAGAAACCGTCCTGTTATTGGGTGCACATGGTTTG GGAAGATGTCACAAGAGGTTCAGCGGATGGGAAGGAAAATGGACCGAAAACCCCACTTCG TTCTCTAACGACTTCTATAAGGTGTTGTTAGATGAAGAATGGAGTCTAGGAACGGTGCCG GAAACCGGAAAAGAGCAGTATTATAACAAAGACAAATCCCTAATCATGCTAAATACCGAC ATTGAGCTAATTAGAGATCCTCACTTCCTACATTTTGTTAAGCTATATAGTCAACACCAA GCGACATTCTTTCAAGATTTTGCCAACGCCTTTGGAAAGCTCTTAGAGTTGGGTATAGAA AGAGATTCCAACGGTAACGTCTTACCCAAGAATGAGTTCTATTGA
>DEHA2G12166g.aa MDGNFRRMSGEKEPHRPQPTALGRFELIIKFVFRCAILPWLLWELGFRSYYKTVLGVFDM IFKVLLLLIAPENGGLFNGISGDNGRYGVNSAGGVDIDGLFDDVLREGKVNVEKKSDTFI LRINSRYRWVIDGIESQQIWQSTRENVEELYFIFEKADYVFDRYVKHSPFCVLNKLNPNG KKSHYEYDGIPLSKFEGFGSYLLYKRLGTKTLTTTKPHLHPILSSTMTAIQKPVVAKREA PKAEVNPTVSRLTQTETIKPTKERTVSVFSPPVFNFAANSFAQSVSNEYKHASPKRIKLD NSRVQVPSIVKPKQDKPRPPAIVTKPRVINIEFPNKQKSGFKLLIRPKHEPSIKKQKQGI EVLSTSNTKRITKSISVDDVEYVEKVKHAIKQVLPKPDYDDGSLGPVILRLAWHCCATYN KFTGNGGSNGSTMRFVPEITDDGNSGLDIARSALEPIKQKFPDITYSDLWTLAGKISIQE MGGPKIPWRCGRVDCIDDRYVPPNGRLPFAYKNANHIRETFGRMGFNDRETVLLLGAHGL GRCHKRFSGWEGKWTENPTSFSNDFYKVLLDEEWSLGTVPETGKEQYYNKDKSLIMLNTD IELIRDPHFLHFVKLYSQHQATFFQDFANAFGKLLELGIERDSNGNVLPKNEFY*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/DEHA/DEHA2G12166p