CAGL0M14113g
similar to uniprot|Q05998 Saccharomyces cerevisiae YLR237w THI7 or uniprot|Q08485 Saccharomyces cerevisiae YOR071c or uniprot|Q08579 Saccharomyces cerevisiae YOR192c
Element type: CDS
Element length: 1746 nucleotides,
on anti-sense strand of
Cagl0M: complement(1397407..1399152).
Other names:
CAGL-CDS1607.1
CAGL-IPF6031
Coding sequence: 582 codons.
Element length: 1746 nucleotides,
on anti-sense strand of
Cagl0M: complement(1397407..1399152).
Other names:
CAGL-CDS1607.1
CAGL-IPF6031
Coding sequence: 582 codons.
Database cross references:
EMBL: CR380959
GeneID: 2891620
GenomeReviews: CR380959_GR
HOGENOM: HBG626339
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380959
GeneID: 2891620
GenomeReviews: CR380959_GR
HOGENOM: HBG626339
Homologs and Orthologs
Homologs in protein family: GL3C0733Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0M14113p 
similar to uniprot|Q05998 Saccharomyces cerevisiae YLR237w THI7 or uniprot|Q08485 Saccharomyces cerevisiae YOR071c or uniprot|Q08579 Saccharomyces cerevisiae YOR192c; SubName: Full=Strain CBS138 chromosome M complete sequence;
Protein domain map
Database cross references:
InterPro: IPR001248
InterPro: IPR012681
KEGG: cgr:CAGL0M14113g
Pfam: PF02133
RefSeq: XP_449958.1
TIGRFAMs: TIGR00800
UniProtKB/TrEMBL: Q6FII6
UniProtKB: Q6FII6_CANGA
Phylogeny
PhylomeDB:CAGL0M14113g
InterPro: IPR001248
InterPro: IPR012681
KEGG: cgr:CAGL0M14113g
Pfam: PF02133
RefSeq: XP_449958.1
TIGRFAMs: TIGR00800
UniProtKB/TrEMBL: Q6FII6
UniProtKB: Q6FII6_CANGA
Phylogeny 
PhylomeDB:CAGL0M14113gSequence data 
>CAGL0M14113g.nt ATGAGAGGATTTAAGTTTCTCAAGTATTTGGAAGTCCCTGTGGAAGAAAGACAAACTTTA AGTTTCTTAAAAAATCCAGACTTGGTACCAATCCCTAAAAGCCACCAAACATGGGGGTTC TGGTCTAACTTTGCCTACTGGGGTACAATTGCCTTTACAGTAGGTACATGGATGGGCGCT ACAGCAGCATTGACAGTTGGGCTAAGTTATCCAGAGACTATCGCAACATTTATTCTGGGG AATGCTCTAACAATTGTTTACTCACTGGCCAACTGTTATCCAGGTTGGGACTGGAAAGTT GGCTATACGCTTTCTCAAAGATTTACCTTCGGTATCTATGGATCTGCTTTTGGTGTTATT ATTAGAGTTTTATTAAGTATTGTGAACTATGGCTCAAACGCGTGGTTAGGAGGGTTATGT ATTAACCTGATATTAAATTCATGGTCACACCACTATCTAGAATTAAAAAATACTTTATCT CCGCATGTTGCTATGACCACAAAAGAATTGATCGGTTTCGTTATTTTCCATATTGTCTGT GCTTTATGCTATCTAATGAAACCATATCAAATTAATAGGATATTAATAATTGCTTGTGCC GGGACATGTTTCTCAATGCTGGGGATAATCATATATCTGTGTCATGCAAATGGGGGTGCT GGTTCATTATTTCACACCCAAAAAACAACGGTTAGTGGTTCTGATAGGGCTTGGGCATGG GTTTACATGATATCATATTGGTTTGGCGCTGTTTCACCTGGTGCTGTAAACCAAAGTGAC TATTCCAGATTTGGTTCATCTTTAAAAGGAATTTATCTTGGAACTATTGTTGGTTTAATG ATTCCCCCAAACCTAGTTCCAGCATATGGTGTTATTGGTGCTTCTACTACACAAGAATTG TATGGTGAGTCCATGTGGATGCCAACAGAAATCTGTAATTATTGGTTAAATCACGGCTAC CATCCTGCAGCTAGAGCAGCCTCGTTTTTTTGCGGTGTTTTCTTCGCATTATCTCAAATT GCTTATACCATCTCTAATGCTGGATTCGCAAGCGGTATGGATTTGGCTGGTTTATTACCC AAATACGTCAACATTAAGAGAGGTGCTTTTTTTACTGCTATTGTATCGGTTGCAGTTCAA CCGTGGAACTTTTATAATTCCTCCTCAACCTTTTTAACAGTAATGAGTTCTTTCGGTGTT GTCATGGTGCCTATACTAGCTGTTATGATTTGTGATAACTTCATTATCAGGAAACGAAAC TATTCAGCATCACAGGCTTTTATTTTAAAAGGTGAATATTATTTTACAAAGGGTTTCAAC TGGAGAGCGTTTATTGCGATGATTGTGCCTATGGCACCAGGTCTACCGGGAATCGCATGG CAAGTCAATCATAATGCTTTTAACAATAGAGGTATAGTCAATTTTTATTATGGTGACTCC TTTTTCGCGTTTGTTATGTCATTTTGCTTGTACTGGATTTTGTGTATTATATTCCCCTTC AAAATTAACGTTCTACAGGATGACAAAGACTATTATGGTGCCTTCGATGAAAAAACCGCG ATTAAAAAGGGTATGGTTCCTTATAGTGGATTGACAGAGGCTGAAAGACAAGAGTACTTC ATTCCAACCGCTAATGAGGTCATGAGAGAAGAGCAACCAACTGAGTCGGATAGTGAGATA TCACGTGCATATGTTGAGGAAGGAACTGAAAAGGGTAGCACAAAGGACGGAGTTGCAGAA ATATGA
>CAGL0M14113g.cds ATGAGAGGATTTAAGTTTCTCAAGTATTTGGAAGTCCCTGTGGAAGAAAGACAAACTTTA AGTTTCTTAAAAAATCCAGACTTGGTACCAATCCCTAAAAGCCACCAAACATGGGGGTTC TGGTCTAACTTTGCCTACTGGGGTACAATTGCCTTTACAGTAGGTACATGGATGGGCGCT ACAGCAGCATTGACAGTTGGGCTAAGTTATCCAGAGACTATCGCAACATTTATTCTGGGG AATGCTCTAACAATTGTTTACTCACTGGCCAACTGTTATCCAGGTTGGGACTGGAAAGTT GGCTATACGCTTTCTCAAAGATTTACCTTCGGTATCTATGGATCTGCTTTTGGTGTTATT ATTAGAGTTTTATTAAGTATTGTGAACTATGGCTCAAACGCGTGGTTAGGAGGGTTATGT ATTAACCTGATATTAAATTCATGGTCACACCACTATCTAGAATTAAAAAATACTTTATCT CCGCATGTTGCTATGACCACAAAAGAATTGATCGGTTTCGTTATTTTCCATATTGTCTGT GCTTTATGCTATCTAATGAAACCATATCAAATTAATAGGATATTAATAATTGCTTGTGCC GGGACATGTTTCTCAATGCTGGGGATAATCATATATCTGTGTCATGCAAATGGGGGTGCT GGTTCATTATTTCACACCCAAAAAACAACGGTTAGTGGTTCTGATAGGGCTTGGGCATGG GTTTACATGATATCATATTGGTTTGGCGCTGTTTCACCTGGTGCTGTAAACCAAAGTGAC TATTCCAGATTTGGTTCATCTTTAAAAGGAATTTATCTTGGAACTATTGTTGGTTTAATG ATTCCCCCAAACCTAGTTCCAGCATATGGTGTTATTGGTGCTTCTACTACACAAGAATTG TATGGTGAGTCCATGTGGATGCCAACAGAAATCTGTAATTATTGGTTAAATCACGGCTAC CATCCTGCAGCTAGAGCAGCCTCGTTTTTTTGCGGTGTTTTCTTCGCATTATCTCAAATT GCTTATACCATCTCTAATGCTGGATTCGCAAGCGGTATGGATTTGGCTGGTTTATTACCC AAATACGTCAACATTAAGAGAGGTGCTTTTTTTACTGCTATTGTATCGGTTGCAGTTCAA CCGTGGAACTTTTATAATTCCTCCTCAACCTTTTTAACAGTAATGAGTTCTTTCGGTGTT GTCATGGTGCCTATACTAGCTGTTATGATTTGTGATAACTTCATTATCAGGAAACGAAAC TATTCAGCATCACAGGCTTTTATTTTAAAAGGTGAATATTATTTTACAAAGGGTTTCAAC TGGAGAGCGTTTATTGCGATGATTGTGCCTATGGCACCAGGTCTACCGGGAATCGCATGG CAAGTCAATCATAATGCTTTTAACAATAGAGGTATAGTCAATTTTTATTATGGTGACTCC TTTTTCGCGTTTGTTATGTCATTTTGCTTGTACTGGATTTTGTGTATTATATTCCCCTTC AAAATTAACGTTCTACAGGATGACAAAGACTATTATGGTGCCTTCGATGAAAAAACCGCG ATTAAAAAGGGTATGGTTCCTTATAGTGGATTGACAGAGGCTGAAAGACAAGAGTACTTC ATTCCAACCGCTAATGAGGTCATGAGAGAAGAGCAACCAACTGAGTCGGATAGTGAGATA TCACGTGCATATGTTGAGGAAGGAACTGAAAAGGGTAGCACAAAGGACGGAGTTGCAGAA ATATGA
>CAGL0M14113g.aa MRGFKFLKYLEVPVEERQTLSFLKNPDLVPIPKSHQTWGFWSNFAYWGTIAFTVGTWMGA TAALTVGLSYPETIATFILGNALTIVYSLANCYPGWDWKVGYTLSQRFTFGIYGSAFGVI IRVLLSIVNYGSNAWLGGLCINLILNSWSHHYLELKNTLSPHVAMTTKELIGFVIFHIVC ALCYLMKPYQINRILIIACAGTCFSMLGIIIYLCHANGGAGSLFHTQKTTVSGSDRAWAW VYMISYWFGAVSPGAVNQSDYSRFGSSLKGIYLGTIVGLMIPPNLVPAYGVIGASTTQEL YGESMWMPTEICNYWLNHGYHPAARAASFFCGVFFALSQIAYTISNAGFASGMDLAGLLP KYVNIKRGAFFTAIVSVAVQPWNFYNSSSTFLTVMSSFGVVMVPILAVMICDNFIIRKRN YSASQAFILKGEYYFTKGFNWRAFIAMIVPMAPGLPGIAWQVNHNAFNNRGIVNFYYGDS FFAFVMSFCLYWILCIIFPFKINVLQDDKDYYGAFDEKTAIKKGMVPYSGLTEAERQEYF IPTANEVMREEQPTESDSEISRAYVEEGTEKGSTKDGVAEI*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0M14113p