CAGL0K08184g
highly similar to uniprot|P00431 Saccharomyces cerevisiae YKR066c CCP1 Mitochondrial cytochrome-c peroxidase
Element type: CDS
Element length: 1074 nucleotides,
on sense strand of
Cagl0K: 813815..814888.
Other names:
CAGL-CDS3101.1
CAGL-IPF3127
Coding sequence: 358 codons.
Element length: 1074 nucleotides,
on sense strand of
Cagl0K: 813815..814888.
Other names:
CAGL-CDS3101.1
CAGL-IPF3127
Coding sequence: 358 codons.
Database cross references:
EMBL: CR380957
GeneID: 2890345
HOGENOM: Q6FMG7
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380957
GeneID: 2890345
HOGENOM: Q6FMG7
Homologs and Orthologs
Homologs in protein families: GL3C0873 GL3C0873.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR002016
InterPro: IPR002207
KEGG: cgr:CAGL0K08184g
PRINTS: PR00458
PRINTS: PR00459
PROSITE: PS00435
PROSITE: PS00436
PROSITE: PS50873
PeroxiBase: 2366
Pfam: PF00141
RefSeq: XP_448577.1
SMR: Q6FMG7
UniProtKB/Swiss-Prot: Q6FMG7
UniprotKB: CCPR_CANGA
InterPro: IPR002016
InterPro: IPR002207
KEGG: cgr:CAGL0K08184g
PRINTS: PR00458
PRINTS: PR00459
PROSITE: PS00435
PROSITE: PS00436
PROSITE: PS50873
PeroxiBase: 2366
Pfam: PF00141
RefSeq: XP_448577.1
SMR: Q6FMG7
UniProtKB/Swiss-Prot: Q6FMG7
UniprotKB: CCPR_CANGA
Sequence data 
>CAGL0K08184g.nt TAACCAGAGTAGGGTCGATGAATTGATTAGCATGAGGTGCACACACTAGGATAGTGGGTA CTCCTCGAGGAGGCACATTGTAACCACCACGGACTTTGATCTCTCTGAAGAATGTAGTTA TCACAGCGTTGAAAAGGAACACAATCAGGTCATACAAGAACGTATGAATATTATACGTAT AGCCATTGTAGTCATTAATATACTTATGCCCAACCTCAGGGACATGGGCAGGCATTGCGA ACTCAAACTATTATAGGGTGTAACAAACAAATAATAATAATTTCTGACAATCAAACTCTA TATTTCGAGCTTCTCAAGTTTATACTATTTCAATGGTAAATGAGTGTCAAACTGCCCGGT TAAGGGAAAAATACGATGTAATTTGGATTTAGTTGCAATCGATGACTTTCAAGGAAAGAC AAACTCAACGCTGTACCTTAACGGAGATGCCAAGTCGATGAATCAGAATCACTATACAAT GCTATGGAACTATAGGACGGACATATCGGGAGGCTCGAAAGTAACAAAGGGTCCATATTA GAAGTTGTCCTTTGTGTGAATAATCAACCAGGTCATTGATGGCAAGGTTCCATTAGAGGG AAGAGGAGAGGCATTATCTAACTGTTACTAAGGAACCACTGAGGCAGCAGTAATCGATAG GAGTTAGTATTTGATGATTACCTGGTGGAGAAAGTCACTATCGTGATCCAAGAGTACACT AACAGTACACAGGCAGAGAAAGAGGAGTAACCTTTGGGGAGAGCCAGATAGGTTCTAGTT CAATGATGGGTACGCACTAAAATTAGTAAGCAGTGACCACAATCCTTCGAGAAACTCTTT GGATACCTAGGATGAGTAAAACTTGGTATCCGAGAGAACGTATATAAAGCTTTTAAGGTT ATGTTATGGGGAAGTCTCTATCAGGATAAGTTTGTCTACATATTTACTAACATTGAGAAA ACTTTGTTTCATCAACAGATCAAACTTAGCTAAGGAAAAGATGTCTGCTACTGCATTGAG AATTGCTCCAATTGCCTCCAGAACATTCCAAAGAAGGTTGGGCTACTTGCTGGCTGGTGT TGCTACCGGTGCTGCTGCTACTGTTGCTTATAAGGCTCAGAAGAACAACAACTACTACAA ATACAACAACAACAATAACAACAACAGTGGCTTCAAGGCTGGTGCACTAGCTGCTGCTGC TGGTGTTGTTCACTTAGCTCATGAAGAAGACAAGAAGACTGCTGACTACCAGAAAGTCTA CAATCTGATTGCAGAGAGATTGAGAGACGACGATGAATATGACAACTATATTGGTTACGG TCCAGTATTGGTCAGATTAGCTTGGCACTCTTCTGGTACCTGGGACAAGAACGACAACAC TGGTGGTTCTTATGGTGGTACTTACAGATACAAGAAGGAAAGCCAGGATCCATCCAATGC TGGTCTAGAGAACGCAGCCAAGTTCTTGGAACCAGTAAAGAAGCAATTCCCATGGATCTC TTATGGTGACTTGTATACCTTGGGTGGTGTTGTAGGTATTCAAGAGCTTCAAGGTCCTAA GATCCCATGGAGATCAGGTAGAACCGATCTGCCAGAAGATATGACCCCAGACAATGGTAG ATTACCAGATGGTGACAAGGACGCAAACTACGTTAGAAACTTCTACAAGAGATTGGATTT TAACGACAGAGAAGTTGTCGCTCTATTGGGTGCACATGCTTTGGGTAAGACACATTTGAA GAACTCTGGTTTTGAAGGCCCATGGGGTGCTGCCAACAACATCTTCACTAATGAATTCTA CTTGAACTTGTTGAACGAGGACTGGAAACTAGAAAAGAATGATGCCGGTAACTTGCAATA TAACTCTCCAAAGGGTTACATGATGTTGCCAACCGATTACGCTTTGATCCAAGACTCAAA CTACTTGAAGATTGTCAAGGAGTACGCTGCTGACCAAGATGCTTTCTTCAGAGACTTCTC TAAGGCCTTTGCTGCTTTGTTAGAGAGAGGTATTGATTTCCCAAAGAACCAACCAGTTCA CATCTTCAAGACTTTAGATGAACAAGGATTGTAAATACGTACAGGCTTGTGAACTGACTA TTTATAACTTTTGAATTGCAAGATATTCAAAAAGTCTATGTATTTATTTTTTTTTCTTTT CTCTTATTATTTATTTATAACTCGCCATATGATGTGACAATCATTCATATTCCACAAAAC ATCCTATTTATCCTATTTAACTGAAACAGAAGATGCTCTTGGAAGTAAATGTGTGTGATT ATACCTGAAAATGGTACTACCTATACTGCTATTTTCTCGTATTTAAGTGCGATTCCCATT GTTCGGTGGTGGCCGATGACTTTCGCACTTTTGA
>CAGL0K08184g.cds ATGTCTGCTACTGCATTGAGAATTGCTCCAATTGCCTCCAGAACATTCCAAAGAAGGTTG GGCTACTTGCTGGCTGGTGTTGCTACCGGTGCTGCTGCTACTGTTGCTTATAAGGCTCAG AAGAACAACAACTACTACAAATACAACAACAACAATAACAACAACAGTGGCTTCAAGGCT GGTGCACTAGCTGCTGCTGCTGGTGTTGTTCACTTAGCTCATGAAGAAGACAAGAAGACT GCTGACTACCAGAAAGTCTACAATCTGATTGCAGAGAGATTGAGAGACGACGATGAATAT GACAACTATATTGGTTACGGTCCAGTATTGGTCAGATTAGCTTGGCACTCTTCTGGTACC TGGGACAAGAACGACAACACTGGTGGTTCTTATGGTGGTACTTACAGATACAAGAAGGAA AGCCAGGATCCATCCAATGCTGGTCTAGAGAACGCAGCCAAGTTCTTGGAACCAGTAAAG AAGCAATTCCCATGGATCTCTTATGGTGACTTGTATACCTTGGGTGGTGTTGTAGGTATT CAAGAGCTTCAAGGTCCTAAGATCCCATGGAGATCAGGTAGAACCGATCTGCCAGAAGAT ATGACCCCAGACAATGGTAGATTACCAGATGGTGACAAGGACGCAAACTACGTTAGAAAC TTCTACAAGAGATTGGATTTTAACGACAGAGAAGTTGTCGCTCTATTGGGTGCACATGCT TTGGGTAAGACACATTTGAAGAACTCTGGTTTTGAAGGCCCATGGGGTGCTGCCAACAAC ATCTTCACTAATGAATTCTACTTGAACTTGTTGAACGAGGACTGGAAACTAGAAAAGAAT GATGCCGGTAACTTGCAATATAACTCTCCAAAGGGTTACATGATGTTGCCAACCGATTAC GCTTTGATCCAAGACTCAAACTACTTGAAGATTGTCAAGGAGTACGCTGCTGACCAAGAT GCTTTCTTCAGAGACTTCTCTAAGGCCTTTGCTGCTTTGTTAGAGAGAGGTATTGATTTC CCAAAGAACCAACCAGTTCACATCTTCAAGACTTTAGATGAACAAGGATTGTAA
>CAGL0K08184g.aa MSATALRIAPIASRTFQRRLGYLLAGVATGAAATVAYKAQKNNNYYKYNNNNNNNSGFKA GALAAAAGVVHLAHEEDKKTADYQKVYNLIAERLRDDDEYDNYIGYGPVLVRLAWHSSGT WDKNDNTGGSYGGTYRYKKESQDPSNAGLENAAKFLEPVKKQFPWISYGDLYTLGGVVGI QELQGPKIPWRSGRTDLPEDMTPDNGRLPDGDKDANYVRNFYKRLDFNDREVVALLGAHA LGKTHLKNSGFEGPWGAANNIFTNEFYLNLLNEDWKLEKNDAGNLQYNSPKGYMMLPTDY ALIQDSNYLKIVKEYAADQDAFFRDFSKAFAALLERGIDFPKNQPVHIFKTLDEQGL*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0K08184p