Element type: CDS
Element length: 3381 nucleotides,
on sense strand of
Cagl0H: 604422..607802.
Other names:
CAGL-CDS0292.1
CAGL-IPF334
Coding sequence: 1127 codons.
Element length: 3381 nucleotides,
on sense strand of
Cagl0H: 604422..607802.
Other names:
CAGL-CDS0292.1
CAGL-IPF334
Coding sequence: 1127 codons.
Database cross references:
EMBL: CR380954
GeneID: 2888691
GenomeReviews: CR380954_GR
HOGENOM: HBG398150
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380954
GeneID: 2888691
GenomeReviews: CR380954_GR
HOGENOM: HBG398150
Homologs and Orthologs
Homologs in protein family: GL3M4588Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0H06193p 
similar to uniprot|P31380 Saccharomyces cerevisiae YAL019w FUN30; SubName: Full=Similar to uniprot|P31380 Saccharomyces cerevisiae YAL019w FUN30;
Protein domain map
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR003892
InterPro: IPR014001
InterPro: IPR014021
KEGG: cgr:CAGL0H06193g
PROSITE: PS51140
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_447064.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6FRT0
UniProtKB: Q6FRT0_CANGA
Phylogeny
PhylomeDB:CAGL0H06193g
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR003892
InterPro: IPR014001
InterPro: IPR014021
KEGG: cgr:CAGL0H06193g
PROSITE: PS51140
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_447064.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6FRT0
UniProtKB: Q6FRT0_CANGA
Phylogeny 
PhylomeDB:CAGL0H06193gSequence data 
>CAGL0H06193g.nt ATGTCCGGAAACGAAGGCACTGAGGGCCAGCCGGGCAACAATCTATCTCCTGTTCAGTAC AAAACTGTATCTTCTTCTCCATTGAAACCCGAAGCGAGTACTCAGGATGCACAGAAATTA CGCGAACAGTTTGTTTTCAAGCCCAATAATAATCTCTCTAGCGCCATCACGAGTACAGGT CAAACAGCGTCATCTGCCACAAGTGGTACCGCAGCTCCTGTAAATCCAAATGCATTTGAA TCGTTGGTGGCTGAATTTCCAGACTTTTCGCAGACGCTAGTGCAAGCAGTCTACAAATCT AACTCTTTTGATTTGTCTCTTGCTAGAGAAAGGCTTACAAGAATTCGGAACCAGAGAAAG AACTGGGTCAGCTCAAAGAATTTAAACTCAAATGTTGCTCGTGGAAGCAGGGCTGCTAGT ACTGGTACGGGGAGATTAAGTGGCATTAATAATACATCTGACTCTTCAAAAATTGTGTTG GAAAAACAGAAGAAATCTATCTTCGATAGATATTCGAGCGCTGTAAATAGTAATGCACAG AACAGTGGAATCAACTCTGAGCTGTTTGAAAAAATGCAACGTAACGGAAGTCATAAGAAG AGAAAGCTTGTCAGAGGCGATAAATTGGCTGGTGATGAGAGTTATGTAGATAGCTCAAAT TCTCAGTTAGCTAAAGCAAAGCAACAGCTACTGAAAAGTCGTAATAAAAAGTTCAATCTT TCTGATGATGAAGAGGAAGTAGGCAATGATAATGGAGACTTTTCAGACGCTAGTGGAGAT GAATACGAGGAGCATACTCCTGTGGCTAATATTGATGATCAGATCTTGGAATTTTTGAAT AACTCTGAAATAGATGATATTATGGATTTGGGTGAAGTTTCTTATGAAAGAGCTAAAATA ATTCAATCCGCAAGACCCTTCACTTCCTTGTATTCGTTTTCTCAACAAGAATTTTTGACA GAAGAAGAGAAGGAAAAGCAGGCTAAACTTCTAGCTCAGCCTAAGAAGAGGGGTAGAAGA GGAGCTGCCCAAAGAAAGGAAGGTGAGAAAGTTCTAGATAGAGTTTCACAAGCTATGAGA GGTTATAATGCCATTGAATCTGTCATCAAGACATGTTCACAATATGGTAAACTAGTCTCC TCTCAAATAAAAAAATGGGGTGTCAAAGTAGACGCGGAAAATGGTGATGATGGAGAGCTT GATTTTATTAATGTTGGTTCTGAAACTGAAGGTGACGCCATTGTTGAGGATTTTAGCTCG TCCGCTCAAACTCCAGCACCCGAAATGTCAGATAGCGAAGGCTCAGAAGAAATTAAGCTT GAAGATGTCAATAAGAAACTCGAGAGCACTACTGATAATAATAACAAGGACGATGGAGCC ACACTTGAAACTGCTTCAGAACCTAAGCTTGACACTATGGAAGAGAAGGATAAAACCACT GACACTTCTGTTGAAAAAACTGAAGAAAATACGGCTAATGTCAAGGTAGAAGACACAGAC AAAGCAAATGACGATCCAGATAGCGATTTTCAGGCTGATGAAGAAATGGAATTTATTGAT GAGGATGAGGAAGATGAGGAATACGATGAAGAACTTCCGGTAACAAGAAGGACTAGGAAT ACTGGCGGGAATTTCAGAAAGGAAGTTGTGAAAAAGAATAGTGTTGTTAAATTTTTCAGA GGGAGACCAAAACTTCTAAACCCTGATGTTTCATTGAAAGACTATCAACAGACTGGTATC AACTGGCTAAACTTATTATATCACAATCAAATTTCTTGTATATTGGCTGATGATATGGGT CTGGGAAAGACATGTCAAGTCATATCTTTCCTAGCTTATTTGAAACAAATCGGTCAGCCA AGTCCGCATTTGATTGTGGTTCCTTCTTCGACGTTAGAAAACTGGTTGAGAGAATTCCAA AAATTTTGTCCTTCATTAAAAATTGAGCCATATTACGGTACTCAACAAGAAAGAGCTGAT CTTCGAGAGATATTAGAGCGTAATGATGGGAAATATGATGTTATTGTCACAACATATAAC CTTGCTGCAGGTAACAAGTACGATGTATCATTTTTGAAGACCAGAAATTTCAATGTAGTT GTATATGATGAAGGCCATATGTTAAAGAACTCAATGTCTGAGAGATTTAATAAGCTGATG AGAATTCATGCTAATTTCAGGCTGCTTCTAACGGGTACACCATTGCAAAATAACTTAAAG GAGCTAATGTCTTTGTTAGAGTTTATTATGCCGAATCTTTTTGTATCCAAAAAGGAGTCC TTAGCTGCAGTTTTCAAACAAAGGGCGAAGACATCAGATGATAACAAGGGACACAACCCT CTGTTGGCTCAACAAGCAATCACCAGAGCGAAGACCATGATGAAACCATTTATTTTACGT AGAAGGAAGGATCAAGTGTTGAAGCACTTGCCTGCAAAGCATGTCCGTACCAGTTACTGT GCAATGAATGACACTCAGAGGGAGATATACAACCGTGAAGTGAAGCTTGTAATGGAGCAC AAGCAAATGATCAGGGATGGCACATTACCTGAAGACAAGAAAGAGCGTAGTAAGATTGAG AACAACAGCTCAAAGAATTTGATCATGTCTCTAAGAAAAGCATCTATTCATCCATTATTG TTCCGTCACATCTATGATGACGCGAAGATAGACAAGATGTGTGATGCGATTTTGGATGAG CCAGCTTACGCGGAGAATGGTAATAAGGAGTATATCAGGGAGGATATGAGTTTTATGACT GATTTTGAGTTGCACAGGCTATGTTGCAACTTCCCTAACACACTAGGTGATTATCAGCTG AAAAACGATGAATGGATGAATAGTGGTAAAGTGGATGCTCTAAAGAAGTTGCTAGATGAT ATCATCAACAAGAAACGTGAGAAAGTGTTGATCTTCACACTGTTCACGCAGGTGTTAGAT ATCCTGGAGAAAGTATTGAGCACCTTGAACTATAAATTTCTGAGACTCGATGGGTCCACC CAGGTGAACGACAGACAAACGATGATCGACAAGTTCTACGACGATAACACCATCCCCATC TTCATGCTATCTACCAGAGCAGGTGGGTTTGGTATCAATCTGGTGTGTGCGAACCATGTG ATCATCTTCGACCAAAGTTTCAACCCACACGACGATAGACAGGCGGCCGACAGAGCGCAC CGTGTGGGCCAGACCAAGGAAGTCACAGTGACCACGCTGATTACCAAGGACAGCATAGAG GAGAAGATCTTCCAGCTGGCGAAGACCAAGCTGGCGCTGGACAGTCAAGTCAGCAGCAGC GAGGACCAAAGCGATCTGATAGACAACAAGGTCAGCGATCTCCTGGAAGACATCATATAC ACAGAAGCTCAAAAGAAGTAA
>CAGL0H06193g.cds ATGTCCGGAAACGAAGGCACTGAGGGCCAGCCGGGCAACAATCTATCTCCTGTTCAGTAC AAAACTGTATCTTCTTCTCCATTGAAACCCGAAGCGAGTACTCAGGATGCACAGAAATTA CGCGAACAGTTTGTTTTCAAGCCCAATAATAATCTCTCTAGCGCCATCACGAGTACAGGT CAAACAGCGTCATCTGCCACAAGTGGTACCGCAGCTCCTGTAAATCCAAATGCATTTGAA TCGTTGGTGGCTGAATTTCCAGACTTTTCGCAGACGCTAGTGCAAGCAGTCTACAAATCT AACTCTTTTGATTTGTCTCTTGCTAGAGAAAGGCTTACAAGAATTCGGAACCAGAGAAAG AACTGGGTCAGCTCAAAGAATTTAAACTCAAATGTTGCTCGTGGAAGCAGGGCTGCTAGT ACTGGTACGGGGAGATTAAGTGGCATTAATAATACATCTGACTCTTCAAAAATTGTGTTG GAAAAACAGAAGAAATCTATCTTCGATAGATATTCGAGCGCTGTAAATAGTAATGCACAG AACAGTGGAATCAACTCTGAGCTGTTTGAAAAAATGCAACGTAACGGAAGTCATAAGAAG AGAAAGCTTGTCAGAGGCGATAAATTGGCTGGTGATGAGAGTTATGTAGATAGCTCAAAT TCTCAGTTAGCTAAAGCAAAGCAACAGCTACTGAAAAGTCGTAATAAAAAGTTCAATCTT TCTGATGATGAAGAGGAAGTAGGCAATGATAATGGAGACTTTTCAGACGCTAGTGGAGAT GAATACGAGGAGCATACTCCTGTGGCTAATATTGATGATCAGATCTTGGAATTTTTGAAT AACTCTGAAATAGATGATATTATGGATTTGGGTGAAGTTTCTTATGAAAGAGCTAAAATA ATTCAATCCGCAAGACCCTTCACTTCCTTGTATTCGTTTTCTCAACAAGAATTTTTGACA GAAGAAGAGAAGGAAAAGCAGGCTAAACTTCTAGCTCAGCCTAAGAAGAGGGGTAGAAGA GGAGCTGCCCAAAGAAAGGAAGGTGAGAAAGTTCTAGATAGAGTTTCACAAGCTATGAGA GGTTATAATGCCATTGAATCTGTCATCAAGACATGTTCACAATATGGTAAACTAGTCTCC TCTCAAATAAAAAAATGGGGTGTCAAAGTAGACGCGGAAAATGGTGATGATGGAGAGCTT GATTTTATTAATGTTGGTTCTGAAACTGAAGGTGACGCCATTGTTGAGGATTTTAGCTCG TCCGCTCAAACTCCAGCACCCGAAATGTCAGATAGCGAAGGCTCAGAAGAAATTAAGCTT GAAGATGTCAATAAGAAACTCGAGAGCACTACTGATAATAATAACAAGGACGATGGAGCC ACACTTGAAACTGCTTCAGAACCTAAGCTTGACACTATGGAAGAGAAGGATAAAACCACT GACACTTCTGTTGAAAAAACTGAAGAAAATACGGCTAATGTCAAGGTAGAAGACACAGAC AAAGCAAATGACGATCCAGATAGCGATTTTCAGGCTGATGAAGAAATGGAATTTATTGAT GAGGATGAGGAAGATGAGGAATACGATGAAGAACTTCCGGTAACAAGAAGGACTAGGAAT ACTGGCGGGAATTTCAGAAAGGAAGTTGTGAAAAAGAATAGTGTTGTTAAATTTTTCAGA GGGAGACCAAAACTTCTAAACCCTGATGTTTCATTGAAAGACTATCAACAGACTGGTATC AACTGGCTAAACTTATTATATCACAATCAAATTTCTTGTATATTGGCTGATGATATGGGT CTGGGAAAGACATGTCAAGTCATATCTTTCCTAGCTTATTTGAAACAAATCGGTCAGCCA AGTCCGCATTTGATTGTGGTTCCTTCTTCGACGTTAGAAAACTGGTTGAGAGAATTCCAA AAATTTTGTCCTTCATTAAAAATTGAGCCATATTACGGTACTCAACAAGAAAGAGCTGAT CTTCGAGAGATATTAGAGCGTAATGATGGGAAATATGATGTTATTGTCACAACATATAAC CTTGCTGCAGGTAACAAGTACGATGTATCATTTTTGAAGACCAGAAATTTCAATGTAGTT GTATATGATGAAGGCCATATGTTAAAGAACTCAATGTCTGAGAGATTTAATAAGCTGATG AGAATTCATGCTAATTTCAGGCTGCTTCTAACGGGTACACCATTGCAAAATAACTTAAAG GAGCTAATGTCTTTGTTAGAGTTTATTATGCCGAATCTTTTTGTATCCAAAAAGGAGTCC TTAGCTGCAGTTTTCAAACAAAGGGCGAAGACATCAGATGATAACAAGGGACACAACCCT CTGTTGGCTCAACAAGCAATCACCAGAGCGAAGACCATGATGAAACCATTTATTTTACGT AGAAGGAAGGATCAAGTGTTGAAGCACTTGCCTGCAAAGCATGTCCGTACCAGTTACTGT GCAATGAATGACACTCAGAGGGAGATATACAACCGTGAAGTGAAGCTTGTAATGGAGCAC AAGCAAATGATCAGGGATGGCACATTACCTGAAGACAAGAAAGAGCGTAGTAAGATTGAG AACAACAGCTCAAAGAATTTGATCATGTCTCTAAGAAAAGCATCTATTCATCCATTATTG TTCCGTCACATCTATGATGACGCGAAGATAGACAAGATGTGTGATGCGATTTTGGATGAG CCAGCTTACGCGGAGAATGGTAATAAGGAGTATATCAGGGAGGATATGAGTTTTATGACT GATTTTGAGTTGCACAGGCTATGTTGCAACTTCCCTAACACACTAGGTGATTATCAGCTG AAAAACGATGAATGGATGAATAGTGGTAAAGTGGATGCTCTAAAGAAGTTGCTAGATGAT ATCATCAACAAGAAACGTGAGAAAGTGTTGATCTTCACACTGTTCACGCAGGTGTTAGAT ATCCTGGAGAAAGTATTGAGCACCTTGAACTATAAATTTCTGAGACTCGATGGGTCCACC CAGGTGAACGACAGACAAACGATGATCGACAAGTTCTACGACGATAACACCATCCCCATC TTCATGCTATCTACCAGAGCAGGTGGGTTTGGTATCAATCTGGTGTGTGCGAACCATGTG ATCATCTTCGACCAAAGTTTCAACCCACACGACGATAGACAGGCGGCCGACAGAGCGCAC CGTGTGGGCCAGACCAAGGAAGTCACAGTGACCACGCTGATTACCAAGGACAGCATAGAG GAGAAGATCTTCCAGCTGGCGAAGACCAAGCTGGCGCTGGACAGTCAAGTCAGCAGCAGC GAGGACCAAAGCGATCTGATAGACAACAAGGTCAGCGATCTCCTGGAAGACATCATATAC ACAGAAGCTCAAAAGAAGTAA
>CAGL0H06193g.aa MSGNEGTEGQPGNNLSPVQYKTVSSSPLKPEASTQDAQKLREQFVFKPNNNLSSAITSTG QTASSATSGTAAPVNPNAFESLVAEFPDFSQTLVQAVYKSNSFDLSLARERLTRIRNQRK NWVSSKNLNSNVARGSRAASTGTGRLSGINNTSDSSKIVLEKQKKSIFDRYSSAVNSNAQ NSGINSELFEKMQRNGSHKKRKLVRGDKLAGDESYVDSSNSQLAKAKQQLLKSRNKKFNL SDDEEEVGNDNGDFSDASGDEYEEHTPVANIDDQILEFLNNSEIDDIMDLGEVSYERAKI IQSARPFTSLYSFSQQEFLTEEEKEKQAKLLAQPKKRGRRGAAQRKEGEKVLDRVSQAMR GYNAIESVIKTCSQYGKLVSSQIKKWGVKVDAENGDDGELDFINVGSETEGDAIVEDFSS SAQTPAPEMSDSEGSEEIKLEDVNKKLESTTDNNNKDDGATLETASEPKLDTMEEKDKTT DTSVEKTEENTANVKVEDTDKANDDPDSDFQADEEMEFIDEDEEDEEYDEELPVTRRTRN TGGNFRKEVVKKNSVVKFFRGRPKLLNPDVSLKDYQQTGINWLNLLYHNQISCILADDMG LGKTCQVISFLAYLKQIGQPSPHLIVVPSSTLENWLREFQKFCPSLKIEPYYGTQQERAD LREILERNDGKYDVIVTTYNLAAGNKYDVSFLKTRNFNVVVYDEGHMLKNSMSERFNKLM RIHANFRLLLTGTPLQNNLKELMSLLEFIMPNLFVSKKESLAAVFKQRAKTSDDNKGHNP LLAQQAITRAKTMMKPFILRRRKDQVLKHLPAKHVRTSYCAMNDTQREIYNREVKLVMEH KQMIRDGTLPEDKKERSKIENNSSKNLIMSLRKASIHPLLFRHIYDDAKIDKMCDAILDE PAYAENGNKEYIREDMSFMTDFELHRLCCNFPNTLGDYQLKNDEWMNSGKVDALKKLLDD IINKKREKVLIFTLFTQVLDILEKVLSTLNYKFLRLDGSTQVNDRQTMIDKFYDDNTIPI FMLSTRAGGFGINLVCANHVIIFDQSFNPHDDRQAADRAHRVGQTKEVTVTTLITKDSIE EKIFQLAKTKLALDSQVSSSEDQSDLIDNKVSDLLEDIIYTEAQKK*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0H06193g