CAGL0H06193g


similar to uniprot|P31380 Saccharomyces cerevisiae YAL019w FUN30

Genomic environment map

Element type: CDS
Element length: 3381 nucleotides,
on sense strand of
Cagl0H: 604422..607802.
Other names:
CAGL-CDS0292.1
CAGL-IPF334
Coding sequence: 1127 codons.
Database cross references:
EMBL: CR380954
GeneID: 2888691
GenomeReviews: CR380954_GR
HOGENOM: HBG398150

Computed results  

None available yet


Protein CAGL0H06193p  


similar to uniprot|P31380 Saccharomyces cerevisiae YAL019w FUN30; SubName: Full=Similar to uniprot|P31380 Saccharomyces cerevisiae YAL019w FUN30;

Protein domain map

Protein length: 1126 amino acids
Protein family: GL3M4588
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR003892
InterPro: IPR014001
InterPro: IPR014021
KEGG: cgr:CAGL0H06193g
PROSITE: PS51140
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_447064.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6FRT0
UniProtKB: Q6FRT0_CANGA

Phylogeny  

PhylomeDB:CAGL0H06193g

Computed results for CAGL0H06193p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>CAGL0H06193g.nt
ATGTCCGGAAACGAAGGCACTGAGGGCCAGCCGGGCAACAATCTATCTCCTGTTCAGTAC
AAAACTGTATCTTCTTCTCCATTGAAACCCGAAGCGAGTACTCAGGATGCACAGAAATTA
CGCGAACAGTTTGTTTTCAAGCCCAATAATAATCTCTCTAGCGCCATCACGAGTACAGGT
CAAACAGCGTCATCTGCCACAAGTGGTACCGCAGCTCCTGTAAATCCAAATGCATTTGAA
TCGTTGGTGGCTGAATTTCCAGACTTTTCGCAGACGCTAGTGCAAGCAGTCTACAAATCT
AACTCTTTTGATTTGTCTCTTGCTAGAGAAAGGCTTACAAGAATTCGGAACCAGAGAAAG
AACTGGGTCAGCTCAAAGAATTTAAACTCAAATGTTGCTCGTGGAAGCAGGGCTGCTAGT
ACTGGTACGGGGAGATTAAGTGGCATTAATAATACATCTGACTCTTCAAAAATTGTGTTG
GAAAAACAGAAGAAATCTATCTTCGATAGATATTCGAGCGCTGTAAATAGTAATGCACAG
AACAGTGGAATCAACTCTGAGCTGTTTGAAAAAATGCAACGTAACGGAAGTCATAAGAAG
AGAAAGCTTGTCAGAGGCGATAAATTGGCTGGTGATGAGAGTTATGTAGATAGCTCAAAT
TCTCAGTTAGCTAAAGCAAAGCAACAGCTACTGAAAAGTCGTAATAAAAAGTTCAATCTT
TCTGATGATGAAGAGGAAGTAGGCAATGATAATGGAGACTTTTCAGACGCTAGTGGAGAT
GAATACGAGGAGCATACTCCTGTGGCTAATATTGATGATCAGATCTTGGAATTTTTGAAT
AACTCTGAAATAGATGATATTATGGATTTGGGTGAAGTTTCTTATGAAAGAGCTAAAATA
ATTCAATCCGCAAGACCCTTCACTTCCTTGTATTCGTTTTCTCAACAAGAATTTTTGACA
GAAGAAGAGAAGGAAAAGCAGGCTAAACTTCTAGCTCAGCCTAAGAAGAGGGGTAGAAGA
GGAGCTGCCCAAAGAAAGGAAGGTGAGAAAGTTCTAGATAGAGTTTCACAAGCTATGAGA
GGTTATAATGCCATTGAATCTGTCATCAAGACATGTTCACAATATGGTAAACTAGTCTCC
TCTCAAATAAAAAAATGGGGTGTCAAAGTAGACGCGGAAAATGGTGATGATGGAGAGCTT
GATTTTATTAATGTTGGTTCTGAAACTGAAGGTGACGCCATTGTTGAGGATTTTAGCTCG
TCCGCTCAAACTCCAGCACCCGAAATGTCAGATAGCGAAGGCTCAGAAGAAATTAAGCTT
GAAGATGTCAATAAGAAACTCGAGAGCACTACTGATAATAATAACAAGGACGATGGAGCC
ACACTTGAAACTGCTTCAGAACCTAAGCTTGACACTATGGAAGAGAAGGATAAAACCACT
GACACTTCTGTTGAAAAAACTGAAGAAAATACGGCTAATGTCAAGGTAGAAGACACAGAC
AAAGCAAATGACGATCCAGATAGCGATTTTCAGGCTGATGAAGAAATGGAATTTATTGAT
GAGGATGAGGAAGATGAGGAATACGATGAAGAACTTCCGGTAACAAGAAGGACTAGGAAT
ACTGGCGGGAATTTCAGAAAGGAAGTTGTGAAAAAGAATAGTGTTGTTAAATTTTTCAGA
GGGAGACCAAAACTTCTAAACCCTGATGTTTCATTGAAAGACTATCAACAGACTGGTATC
AACTGGCTAAACTTATTATATCACAATCAAATTTCTTGTATATTGGCTGATGATATGGGT
CTGGGAAAGACATGTCAAGTCATATCTTTCCTAGCTTATTTGAAACAAATCGGTCAGCCA
AGTCCGCATTTGATTGTGGTTCCTTCTTCGACGTTAGAAAACTGGTTGAGAGAATTCCAA
AAATTTTGTCCTTCATTAAAAATTGAGCCATATTACGGTACTCAACAAGAAAGAGCTGAT
CTTCGAGAGATATTAGAGCGTAATGATGGGAAATATGATGTTATTGTCACAACATATAAC
CTTGCTGCAGGTAACAAGTACGATGTATCATTTTTGAAGACCAGAAATTTCAATGTAGTT
GTATATGATGAAGGCCATATGTTAAAGAACTCAATGTCTGAGAGATTTAATAAGCTGATG
AGAATTCATGCTAATTTCAGGCTGCTTCTAACGGGTACACCATTGCAAAATAACTTAAAG
GAGCTAATGTCTTTGTTAGAGTTTATTATGCCGAATCTTTTTGTATCCAAAAAGGAGTCC
TTAGCTGCAGTTTTCAAACAAAGGGCGAAGACATCAGATGATAACAAGGGACACAACCCT
CTGTTGGCTCAACAAGCAATCACCAGAGCGAAGACCATGATGAAACCATTTATTTTACGT
AGAAGGAAGGATCAAGTGTTGAAGCACTTGCCTGCAAAGCATGTCCGTACCAGTTACTGT
GCAATGAATGACACTCAGAGGGAGATATACAACCGTGAAGTGAAGCTTGTAATGGAGCAC
AAGCAAATGATCAGGGATGGCACATTACCTGAAGACAAGAAAGAGCGTAGTAAGATTGAG
AACAACAGCTCAAAGAATTTGATCATGTCTCTAAGAAAAGCATCTATTCATCCATTATTG
TTCCGTCACATCTATGATGACGCGAAGATAGACAAGATGTGTGATGCGATTTTGGATGAG
CCAGCTTACGCGGAGAATGGTAATAAGGAGTATATCAGGGAGGATATGAGTTTTATGACT
GATTTTGAGTTGCACAGGCTATGTTGCAACTTCCCTAACACACTAGGTGATTATCAGCTG
AAAAACGATGAATGGATGAATAGTGGTAAAGTGGATGCTCTAAAGAAGTTGCTAGATGAT
ATCATCAACAAGAAACGTGAGAAAGTGTTGATCTTCACACTGTTCACGCAGGTGTTAGAT
ATCCTGGAGAAAGTATTGAGCACCTTGAACTATAAATTTCTGAGACTCGATGGGTCCACC
CAGGTGAACGACAGACAAACGATGATCGACAAGTTCTACGACGATAACACCATCCCCATC
TTCATGCTATCTACCAGAGCAGGTGGGTTTGGTATCAATCTGGTGTGTGCGAACCATGTG
ATCATCTTCGACCAAAGTTTCAACCCACACGACGATAGACAGGCGGCCGACAGAGCGCAC
CGTGTGGGCCAGACCAAGGAAGTCACAGTGACCACGCTGATTACCAAGGACAGCATAGAG
GAGAAGATCTTCCAGCTGGCGAAGACCAAGCTGGCGCTGGACAGTCAAGTCAGCAGCAGC
GAGGACCAAAGCGATCTGATAGACAACAAGGTCAGCGATCTCCTGGAAGACATCATATAC
ACAGAAGCTCAAAAGAAGTAA

Coding sequence    

>CAGL0H06193g.cds
ATGTCCGGAAACGAAGGCACTGAGGGCCAGCCGGGCAACAATCTATCTCCTGTTCAGTAC
AAAACTGTATCTTCTTCTCCATTGAAACCCGAAGCGAGTACTCAGGATGCACAGAAATTA
CGCGAACAGTTTGTTTTCAAGCCCAATAATAATCTCTCTAGCGCCATCACGAGTACAGGT
CAAACAGCGTCATCTGCCACAAGTGGTACCGCAGCTCCTGTAAATCCAAATGCATTTGAA
TCGTTGGTGGCTGAATTTCCAGACTTTTCGCAGACGCTAGTGCAAGCAGTCTACAAATCT
AACTCTTTTGATTTGTCTCTTGCTAGAGAAAGGCTTACAAGAATTCGGAACCAGAGAAAG
AACTGGGTCAGCTCAAAGAATTTAAACTCAAATGTTGCTCGTGGAAGCAGGGCTGCTAGT
ACTGGTACGGGGAGATTAAGTGGCATTAATAATACATCTGACTCTTCAAAAATTGTGTTG
GAAAAACAGAAGAAATCTATCTTCGATAGATATTCGAGCGCTGTAAATAGTAATGCACAG
AACAGTGGAATCAACTCTGAGCTGTTTGAAAAAATGCAACGTAACGGAAGTCATAAGAAG
AGAAAGCTTGTCAGAGGCGATAAATTGGCTGGTGATGAGAGTTATGTAGATAGCTCAAAT
TCTCAGTTAGCTAAAGCAAAGCAACAGCTACTGAAAAGTCGTAATAAAAAGTTCAATCTT
TCTGATGATGAAGAGGAAGTAGGCAATGATAATGGAGACTTTTCAGACGCTAGTGGAGAT
GAATACGAGGAGCATACTCCTGTGGCTAATATTGATGATCAGATCTTGGAATTTTTGAAT
AACTCTGAAATAGATGATATTATGGATTTGGGTGAAGTTTCTTATGAAAGAGCTAAAATA
ATTCAATCCGCAAGACCCTTCACTTCCTTGTATTCGTTTTCTCAACAAGAATTTTTGACA
GAAGAAGAGAAGGAAAAGCAGGCTAAACTTCTAGCTCAGCCTAAGAAGAGGGGTAGAAGA
GGAGCTGCCCAAAGAAAGGAAGGTGAGAAAGTTCTAGATAGAGTTTCACAAGCTATGAGA
GGTTATAATGCCATTGAATCTGTCATCAAGACATGTTCACAATATGGTAAACTAGTCTCC
TCTCAAATAAAAAAATGGGGTGTCAAAGTAGACGCGGAAAATGGTGATGATGGAGAGCTT
GATTTTATTAATGTTGGTTCTGAAACTGAAGGTGACGCCATTGTTGAGGATTTTAGCTCG
TCCGCTCAAACTCCAGCACCCGAAATGTCAGATAGCGAAGGCTCAGAAGAAATTAAGCTT
GAAGATGTCAATAAGAAACTCGAGAGCACTACTGATAATAATAACAAGGACGATGGAGCC
ACACTTGAAACTGCTTCAGAACCTAAGCTTGACACTATGGAAGAGAAGGATAAAACCACT
GACACTTCTGTTGAAAAAACTGAAGAAAATACGGCTAATGTCAAGGTAGAAGACACAGAC
AAAGCAAATGACGATCCAGATAGCGATTTTCAGGCTGATGAAGAAATGGAATTTATTGAT
GAGGATGAGGAAGATGAGGAATACGATGAAGAACTTCCGGTAACAAGAAGGACTAGGAAT
ACTGGCGGGAATTTCAGAAAGGAAGTTGTGAAAAAGAATAGTGTTGTTAAATTTTTCAGA
GGGAGACCAAAACTTCTAAACCCTGATGTTTCATTGAAAGACTATCAACAGACTGGTATC
AACTGGCTAAACTTATTATATCACAATCAAATTTCTTGTATATTGGCTGATGATATGGGT
CTGGGAAAGACATGTCAAGTCATATCTTTCCTAGCTTATTTGAAACAAATCGGTCAGCCA
AGTCCGCATTTGATTGTGGTTCCTTCTTCGACGTTAGAAAACTGGTTGAGAGAATTCCAA
AAATTTTGTCCTTCATTAAAAATTGAGCCATATTACGGTACTCAACAAGAAAGAGCTGAT
CTTCGAGAGATATTAGAGCGTAATGATGGGAAATATGATGTTATTGTCACAACATATAAC
CTTGCTGCAGGTAACAAGTACGATGTATCATTTTTGAAGACCAGAAATTTCAATGTAGTT
GTATATGATGAAGGCCATATGTTAAAGAACTCAATGTCTGAGAGATTTAATAAGCTGATG
AGAATTCATGCTAATTTCAGGCTGCTTCTAACGGGTACACCATTGCAAAATAACTTAAAG
GAGCTAATGTCTTTGTTAGAGTTTATTATGCCGAATCTTTTTGTATCCAAAAAGGAGTCC
TTAGCTGCAGTTTTCAAACAAAGGGCGAAGACATCAGATGATAACAAGGGACACAACCCT
CTGTTGGCTCAACAAGCAATCACCAGAGCGAAGACCATGATGAAACCATTTATTTTACGT
AGAAGGAAGGATCAAGTGTTGAAGCACTTGCCTGCAAAGCATGTCCGTACCAGTTACTGT
GCAATGAATGACACTCAGAGGGAGATATACAACCGTGAAGTGAAGCTTGTAATGGAGCAC
AAGCAAATGATCAGGGATGGCACATTACCTGAAGACAAGAAAGAGCGTAGTAAGATTGAG
AACAACAGCTCAAAGAATTTGATCATGTCTCTAAGAAAAGCATCTATTCATCCATTATTG
TTCCGTCACATCTATGATGACGCGAAGATAGACAAGATGTGTGATGCGATTTTGGATGAG
CCAGCTTACGCGGAGAATGGTAATAAGGAGTATATCAGGGAGGATATGAGTTTTATGACT
GATTTTGAGTTGCACAGGCTATGTTGCAACTTCCCTAACACACTAGGTGATTATCAGCTG
AAAAACGATGAATGGATGAATAGTGGTAAAGTGGATGCTCTAAAGAAGTTGCTAGATGAT
ATCATCAACAAGAAACGTGAGAAAGTGTTGATCTTCACACTGTTCACGCAGGTGTTAGAT
ATCCTGGAGAAAGTATTGAGCACCTTGAACTATAAATTTCTGAGACTCGATGGGTCCACC
CAGGTGAACGACAGACAAACGATGATCGACAAGTTCTACGACGATAACACCATCCCCATC
TTCATGCTATCTACCAGAGCAGGTGGGTTTGGTATCAATCTGGTGTGTGCGAACCATGTG
ATCATCTTCGACCAAAGTTTCAACCCACACGACGATAGACAGGCGGCCGACAGAGCGCAC
CGTGTGGGCCAGACCAAGGAAGTCACAGTGACCACGCTGATTACCAAGGACAGCATAGAG
GAGAAGATCTTCCAGCTGGCGAAGACCAAGCTGGCGCTGGACAGTCAAGTCAGCAGCAGC
GAGGACCAAAGCGATCTGATAGACAACAAGGTCAGCGATCTCCTGGAAGACATCATATAC
ACAGAAGCTCAAAAGAAGTAA

Predicted translation product    

>CAGL0H06193g.aa
MSGNEGTEGQPGNNLSPVQYKTVSSSPLKPEASTQDAQKLREQFVFKPNNNLSSAITSTG
QTASSATSGTAAPVNPNAFESLVAEFPDFSQTLVQAVYKSNSFDLSLARERLTRIRNQRK
NWVSSKNLNSNVARGSRAASTGTGRLSGINNTSDSSKIVLEKQKKSIFDRYSSAVNSNAQ
NSGINSELFEKMQRNGSHKKRKLVRGDKLAGDESYVDSSNSQLAKAKQQLLKSRNKKFNL
SDDEEEVGNDNGDFSDASGDEYEEHTPVANIDDQILEFLNNSEIDDIMDLGEVSYERAKI
IQSARPFTSLYSFSQQEFLTEEEKEKQAKLLAQPKKRGRRGAAQRKEGEKVLDRVSQAMR
GYNAIESVIKTCSQYGKLVSSQIKKWGVKVDAENGDDGELDFINVGSETEGDAIVEDFSS
SAQTPAPEMSDSEGSEEIKLEDVNKKLESTTDNNNKDDGATLETASEPKLDTMEEKDKTT
DTSVEKTEENTANVKVEDTDKANDDPDSDFQADEEMEFIDEDEEDEEYDEELPVTRRTRN
TGGNFRKEVVKKNSVVKFFRGRPKLLNPDVSLKDYQQTGINWLNLLYHNQISCILADDMG
LGKTCQVISFLAYLKQIGQPSPHLIVVPSSTLENWLREFQKFCPSLKIEPYYGTQQERAD
LREILERNDGKYDVIVTTYNLAAGNKYDVSFLKTRNFNVVVYDEGHMLKNSMSERFNKLM
RIHANFRLLLTGTPLQNNLKELMSLLEFIMPNLFVSKKESLAAVFKQRAKTSDDNKGHNP
LLAQQAITRAKTMMKPFILRRRKDQVLKHLPAKHVRTSYCAMNDTQREIYNREVKLVMEH
KQMIRDGTLPEDKKERSKIENNSSKNLIMSLRKASIHPLLFRHIYDDAKIDKMCDAILDE
PAYAENGNKEYIREDMSFMTDFELHRLCCNFPNTLGDYQLKNDEWMNSGKVDALKKLLDD
IINKKREKVLIFTLFTQVLDILEKVLSTLNYKFLRLDGSTQVNDRQTMIDKFYDDNTIPI
FMLSTRAGGFGINLVCANHVIIFDQSFNPHDDRQAADRAHRVGQTKEVTVTTLITKDSIE
EKIFQLAKTKLALDSQVSSSEDQSDLIDNKVSDLLEDIIYTEAQKK*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites