CAGL0K12342g
highly similar to uniprot|P16661 Saccharomyces cerevisiae YBR110w ALG1 Mannosyltransferase, involved in asparagine-linked glycosylation in the endoplasmic reticulum (ER)
Element type: CDS
Element length: 1353 nucleotides,
on sense strand of
Cagl0K: 1204665..1206017.
Other names:
CAGL-CDS2400.1
CAGL-IPF4645
Coding sequence: 451 codons.
Element length: 1353 nucleotides,
on sense strand of
Cagl0K: 1204665..1206017.
Other names:
CAGL-CDS2400.1
CAGL-IPF4645
Coding sequence: 451 codons.
Database cross references:
EMBL: CR380957
GeneID: 2889981
HOGENOM: Q6FLZ2
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380957
GeneID: 2889981
HOGENOM: Q6FLZ2
Homologs and Orthologs
Homologs in protein families: GL3R2097 GL3R2097.N1Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0K12342p 
highly similar to uniprot|P16661 Saccharomyces cerevisiae YBR110w ALG1 beta-mannosyltransferase
Protein domain map
Database cross references:
InterPro: IPR001296
KEGG: cgr:CAGL0K12342g
Pfam: PF00534
RefSeq: XP_448752.1
UniProtKB/Swiss-Prot: Q6FLZ2
UniprotKB: ALG1_CANGA
InterPro: IPR001296
KEGG: cgr:CAGL0K12342g
Pfam: PF00534
RefSeq: XP_448752.1
UniProtKB/Swiss-Prot: Q6FLZ2
UniprotKB: ALG1_CANGA
Sequence data 
Nucleotide sequence
>CAGL0K12342g.nt AACCACATAATAAATCTCTAGAAGAGAGTCATACTCCTTCAAATAAGTTATCTGAGAAAC CGAAACCTCCGAAGAAACCAGAACAACTGAAGGATTTGAATTTATCTACCCAACAAGCTG ATAAGAACATGAAGAATAAGGATCAGTTGTTTGATAACAAAAATGAGTTATTGAGCACCA TAAAAAATAAAAAAAGGCCGGCGCCTATACCCAAACCCAAACCTAAAAGTTTAACTTCAG AAGGTAACCATATGAATTTGAACACTGAAAAGGGAAAAGAAACTACAATAGAAAAACCTG ACGAGTCAAAATTTTTACCAATTTCGAGTTTTCCACCTCCACCAAAGCCATTTAAACGTG AAGAGCTCTCAAAGGAGGTAGTAGATAGTGTAGGAGAAACTGCGGATTTTACAAAGCCAA AAAGAGCTGGACAGTTAGAATCAACTCAGACCGAGAAATCAAATTCAAAAGGAAAAGCAC CACCACCTGTTCCAAAGAAGCGTAATGCACAGTCAAAATCCAGCCCTAGCCTAGAAGGGA GTGAGGATAACCCATTTTCAAAGTATTTAAAGGACGCAGTTCCTAACGAGCCTGATCGCT TACATAAATAATAATGCCCCAATTAAAAATACAATCAATATGAGCAACGAATATCTTACG TATCTTCGAAAGATCTAACTCATTTTATATAATAAAAACACTAATGAAAACTGGATAAGA ATTACTACTATATCTATTATTTAAGTGGTTTATATCATCTCGTCTCTTTTGGCACCTTTT TTTTTCTTGGTGAACATCAAATTCTATGCGAGGTGCTATCGAGAACAATCGATTTGTTCA GTTACCAATAGATTAGCATAAGATAATACACTGATCAACTATAAATCTATCATAATATCT AAGTGTGTTTAGCTGATTCCCTTCTCTTATATTTATCATGGTTATCTAAATATAAAATAT TATCCAACCTGCAAGAGGTTAGCTTAGCTATAAACGAGTTATGTCTTGGATACAGATACC ATGGAGTTGGGTTGTGACCCTAATAGTGACATATTTGTCTCTGCCATTAATTATTTACTA TTTGGTGCCATATATCTTTTATGGGAATAAGTCTTCAAAAAAGAGGATCATCATCTATGT GCTTGGAGATATAGGCCATTCACCAAGAATGTGCTACCATGCCAGAAGTTTTAGTGAGAA AGGCTGGCAAGTTGAGTTATGTGGATATGTGGAAGAGCAAGTTCCTGGTTTTATTGCAGA AGATCCCAATATTATTGTCCACGCTCTTCCAACGCTGACGCTACAAGGAAACAAGAGATC AATAATATTTTTGGTAAAGAAAGTATTGTTCCAAGTTTCTGCTATCATTGCTCAATTATG GGAGCTTAGGGGCAGTAATTATATGTTAATTCAGAACCCACCATCGATTCCGATTTTACC AATTGCTGTTTTCTATCGTTTGAGTGGTTGTAAACTTATCATTGATTGGCACAACTTGGC ATATTCCATTATGCAATTGAAATTTAATGGAAACTTTTATCACCCCGTTGTACTTGCCTC ATATGTCATAGAGTATATTTTTGGCAAGTTTGCTACTTATAATTTAACGGTCACTGAGGC CATGAAAGAATATTTAGTCAACAGTTTTGGCTTGAATCCAAAAAGGTGTGTGGTGCTATA TGATAGACCTGCGACACAATTTAAACCACTCACTGAAAGTGAATCCCGCACAAAGTTACT AGACTCTGAGTTCATCAGAGATATGATCCCTGAGGGATTCAATGTAGAGAAGGGAGACAA GATAATAGTCACTTCTACCTCGTTCACTCCAGATGAAGATATTAGTATACTGATTGGTGC GTTGAAAATCTACGACAACTCTTATGAGAATCTTGATAAAAGCCTTCCAAAGATTCTCTG TTTTGTTACAGGAAAGGGCCCCATGAAGGAAAGATATGTTAAAGATGTAGAAGAGCATGA TTGGCAGCATGTGTATGTTAAGTTTGTCTGGCTGAAATCGGAGGATTACCCTCGCTTGCT CCAGCTGTGTGATTATGGTGTCTCTTTACATAAGTCCAGTTCTGGCCTCGACTTACCTAT GAAAATACTTGATATGTATGGCTCGGGTATACCTGTAATAGCATATAACTACCCTGTGCT TGGAGAATTGGTTAAATACAACGAGAATGGTTTGAAATTCCTGGACAGAAGAGAACTACA CGAGTCATTGATCTTTGCCATGAAGGACCCAGAACTTTACAAGAAATTGAAGCAGGGGGC TTTAAAAGAGTCCCAAATTAGATGGAACTCGAGCTGGCAGTCTGCCATGCAAGAATTAAA GCTCGTCGCTTAAAAGAATGACTATCCCTCATAAACATTTTAGCATCACAAGGTACCAAA CATATATTTCCTAACAATTGTATATACTCAGCCACGTAAATAGCACCTTATAATCATATG ACAGTAACAATACCCCGCAACGAACTCAAAGCTAAGCTCTCACCACTTTACCGACTAGAT ACAGTTACTGAAATCAGCAATTAGTAAACACTGGGGAAAGTCATTGGGAACGGATCTCTG ATACAGAAACTGCCTGAGAGAGAATTGACTGTTTCCTAATACAAGGAATTTACCCAGCTT TTCTTTGGCTGCC
Coding sequence
>CAGL0K12342g.cds ATGTCTTGGATACAGATACCATGGAGTTGGGTTGTGACCCTAATAGTGACATATTTGTCT CTGCCATTAATTATTTACTATTTGGTGCCATATATCTTTTATGGGAATAAGTCTTCAAAA AAGAGGATCATCATCTATGTGCTTGGAGATATAGGCCATTCACCAAGAATGTGCTACCAT GCCAGAAGTTTTAGTGAGAAAGGCTGGCAAGTTGAGTTATGTGGATATGTGGAAGAGCAA GTTCCTGGTTTTATTGCAGAAGATCCCAATATTATTGTCCACGCTCTTCCAACGCTGACG CTACAAGGAAACAAGAGATCAATAATATTTTTGGTAAAGAAAGTATTGTTCCAAGTTTCT GCTATCATTGCTCAATTATGGGAGCTTAGGGGCAGTAATTATATGTTAATTCAGAACCCA CCATCGATTCCGATTTTACCAATTGCTGTTTTCTATCGTTTGAGTGGTTGTAAACTTATC ATTGATTGGCACAACTTGGCATATTCCATTATGCAATTGAAATTTAATGGAAACTTTTAT CACCCCGTTGTACTTGCCTCATATGTCATAGAGTATATTTTTGGCAAGTTTGCTACTTAT AATTTAACGGTCACTGAGGCCATGAAAGAATATTTAGTCAACAGTTTTGGCTTGAATCCA AAAAGGTGTGTGGTGCTATATGATAGACCTGCGACACAATTTAAACCACTCACTGAAAGT GAATCCCGCACAAAGTTACTAGACTCTGAGTTCATCAGAGATATGATCCCTGAGGGATTC AATGTAGAGAAGGGAGACAAGATAATAGTCACTTCTACCTCGTTCACTCCAGATGAAGAT ATTAGTATACTGATTGGTGCGTTGAAAATCTACGACAACTCTTATGAGAATCTTGATAAA AGCCTTCCAAAGATTCTCTGTTTTGTTACAGGAAAGGGCCCCATGAAGGAAAGATATGTT AAAGATGTAGAAGAGCATGATTGGCAGCATGTGTATGTTAAGTTTGTCTGGCTGAAATCG GAGGATTACCCTCGCTTGCTCCAGCTGTGTGATTATGGTGTCTCTTTACATAAGTCCAGT TCTGGCCTCGACTTACCTATGAAAATACTTGATATGTATGGCTCGGGTATACCTGTAATA GCATATAACTACCCTGTGCTTGGAGAATTGGTTAAATACAACGAGAATGGTTTGAAATTC CTGGACAGAAGAGAACTACACGAGTCATTGATCTTTGCCATGAAGGACCCAGAACTTTAC AAGAAATTGAAGCAGGGGGCTTTAAAAGAGTCCCAAATTAGATGGAACTCGAGCTGGCAG TCTGCCATGCAAGAATTAAAGCTCGTCGCTTAA
Predicted translation product
>CAGL0K12342g.aa MSWIQIPWSWVVTLIVTYLSLPLIIYYLVPYIFYGNKSSKKRIIIYVLGDIGHSPRMCYH ARSFSEKGWQVELCGYVEEQVPGFIAEDPNIIVHALPTLTLQGNKRSIIFLVKKVLFQVS AIIAQLWELRGSNYMLIQNPPSIPILPIAVFYRLSGCKLIIDWHNLAYSIMQLKFNGNFY HPVVLASYVIEYIFGKFATYNLTVTEAMKEYLVNSFGLNPKRCVVLYDRPATQFKPLTES ESRTKLLDSEFIRDMIPEGFNVEKGDKIIVTSTSFTPDEDISILIGALKIYDNSYENLDK SLPKILCFVTGKGPMKERYVKDVEEHDWQHVYVKFVWLKSEDYPRLLQLCDYGVSLHKSS SGLDLPMKILDMYGSGIPVIAYNYPVLGELVKYNENGLKFLDRRELHESLIFAMKDPELY KKLKQGALKESQIRWNSSWQSAMQELKLVA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0K12342p