CAGL0K12342g


highly similar to uniprot|P16661 Saccharomyces cerevisiae YBR110w ALG1 Mannosyltransferase, involved in asparagine-linked glycosylation in the endoplasmic reticulum (ER)

Genomic environment map

Element type: CDS
Element length: 1353 nucleotides,
on sense strand of
Cagl0K: 1204665..1206017.
Other names:
CAGL-CDS2400.1
CAGL-IPF4645
Coding sequence: 451 codons.
Database cross references:
EMBL: CR380957
GeneID: 2889981
HOGENOM: Q6FLZ2

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R2097 GL3R2097.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0K12342p  


highly similar to uniprot|P16661 Saccharomyces cerevisiae YBR110w ALG1 beta-mannosyltransferase

Protein domain map

Protein length: 450 amino acids
Protein family: GL3R2097
Database cross references:
InterPro: IPR001296
KEGG: cgr:CAGL0K12342g
Pfam: PF00534
RefSeq: XP_448752.1
UniProtKB/Swiss-Prot: Q6FLZ2
UniprotKB: ALG1_CANGA

Computed results for CAGL0K12342p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005789 endoplasmic reticulum membrane

Sequence data  

Nucleotide sequence

>CAGL0K12342g.nt
AACCACATAATAAATCTCTAGAAGAGAGTCATACTCCTTCAAATAAGTTATCTGAGAAAC
CGAAACCTCCGAAGAAACCAGAACAACTGAAGGATTTGAATTTATCTACCCAACAAGCTG
ATAAGAACATGAAGAATAAGGATCAGTTGTTTGATAACAAAAATGAGTTATTGAGCACCA
TAAAAAATAAAAAAAGGCCGGCGCCTATACCCAAACCCAAACCTAAAAGTTTAACTTCAG
AAGGTAACCATATGAATTTGAACACTGAAAAGGGAAAAGAAACTACAATAGAAAAACCTG
ACGAGTCAAAATTTTTACCAATTTCGAGTTTTCCACCTCCACCAAAGCCATTTAAACGTG
AAGAGCTCTCAAAGGAGGTAGTAGATAGTGTAGGAGAAACTGCGGATTTTACAAAGCCAA
AAAGAGCTGGACAGTTAGAATCAACTCAGACCGAGAAATCAAATTCAAAAGGAAAAGCAC
CACCACCTGTTCCAAAGAAGCGTAATGCACAGTCAAAATCCAGCCCTAGCCTAGAAGGGA
GTGAGGATAACCCATTTTCAAAGTATTTAAAGGACGCAGTTCCTAACGAGCCTGATCGCT
TACATAAATAATAATGCCCCAATTAAAAATACAATCAATATGAGCAACGAATATCTTACG
TATCTTCGAAAGATCTAACTCATTTTATATAATAAAAACACTAATGAAAACTGGATAAGA
ATTACTACTATATCTATTATTTAAGTGGTTTATATCATCTCGTCTCTTTTGGCACCTTTT
TTTTTCTTGGTGAACATCAAATTCTATGCGAGGTGCTATCGAGAACAATCGATTTGTTCA
GTTACCAATAGATTAGCATAAGATAATACACTGATCAACTATAAATCTATCATAATATCT
AAGTGTGTTTAGCTGATTCCCTTCTCTTATATTTATCATGGTTATCTAAATATAAAATAT
TATCCAACCTGCAAGAGGTTAGCTTAGCTATAAACGAGTTATGTCTTGGATACAGATACC
ATGGAGTTGGGTTGTGACCCTAATAGTGACATATTTGTCTCTGCCATTAATTATTTACTA
TTTGGTGCCATATATCTTTTATGGGAATAAGTCTTCAAAAAAGAGGATCATCATCTATGT
GCTTGGAGATATAGGCCATTCACCAAGAATGTGCTACCATGCCAGAAGTTTTAGTGAGAA
AGGCTGGCAAGTTGAGTTATGTGGATATGTGGAAGAGCAAGTTCCTGGTTTTATTGCAGA
AGATCCCAATATTATTGTCCACGCTCTTCCAACGCTGACGCTACAAGGAAACAAGAGATC
AATAATATTTTTGGTAAAGAAAGTATTGTTCCAAGTTTCTGCTATCATTGCTCAATTATG
GGAGCTTAGGGGCAGTAATTATATGTTAATTCAGAACCCACCATCGATTCCGATTTTACC
AATTGCTGTTTTCTATCGTTTGAGTGGTTGTAAACTTATCATTGATTGGCACAACTTGGC
ATATTCCATTATGCAATTGAAATTTAATGGAAACTTTTATCACCCCGTTGTACTTGCCTC
ATATGTCATAGAGTATATTTTTGGCAAGTTTGCTACTTATAATTTAACGGTCACTGAGGC
CATGAAAGAATATTTAGTCAACAGTTTTGGCTTGAATCCAAAAAGGTGTGTGGTGCTATA
TGATAGACCTGCGACACAATTTAAACCACTCACTGAAAGTGAATCCCGCACAAAGTTACT
AGACTCTGAGTTCATCAGAGATATGATCCCTGAGGGATTCAATGTAGAGAAGGGAGACAA
GATAATAGTCACTTCTACCTCGTTCACTCCAGATGAAGATATTAGTATACTGATTGGTGC
GTTGAAAATCTACGACAACTCTTATGAGAATCTTGATAAAAGCCTTCCAAAGATTCTCTG
TTTTGTTACAGGAAAGGGCCCCATGAAGGAAAGATATGTTAAAGATGTAGAAGAGCATGA
TTGGCAGCATGTGTATGTTAAGTTTGTCTGGCTGAAATCGGAGGATTACCCTCGCTTGCT
CCAGCTGTGTGATTATGGTGTCTCTTTACATAAGTCCAGTTCTGGCCTCGACTTACCTAT
GAAAATACTTGATATGTATGGCTCGGGTATACCTGTAATAGCATATAACTACCCTGTGCT
TGGAGAATTGGTTAAATACAACGAGAATGGTTTGAAATTCCTGGACAGAAGAGAACTACA
CGAGTCATTGATCTTTGCCATGAAGGACCCAGAACTTTACAAGAAATTGAAGCAGGGGGC
TTTAAAAGAGTCCCAAATTAGATGGAACTCGAGCTGGCAGTCTGCCATGCAAGAATTAAA
GCTCGTCGCTTAAAAGAATGACTATCCCTCATAAACATTTTAGCATCACAAGGTACCAAA
CATATATTTCCTAACAATTGTATATACTCAGCCACGTAAATAGCACCTTATAATCATATG
ACAGTAACAATACCCCGCAACGAACTCAAAGCTAAGCTCTCACCACTTTACCGACTAGAT
ACAGTTACTGAAATCAGCAATTAGTAAACACTGGGGAAAGTCATTGGGAACGGATCTCTG
ATACAGAAACTGCCTGAGAGAGAATTGACTGTTTCCTAATACAAGGAATTTACCCAGCTT
TTCTTTGGCTGCC

Coding sequence

>CAGL0K12342g.cds
ATGTCTTGGATACAGATACCATGGAGTTGGGTTGTGACCCTAATAGTGACATATTTGTCT
CTGCCATTAATTATTTACTATTTGGTGCCATATATCTTTTATGGGAATAAGTCTTCAAAA
AAGAGGATCATCATCTATGTGCTTGGAGATATAGGCCATTCACCAAGAATGTGCTACCAT
GCCAGAAGTTTTAGTGAGAAAGGCTGGCAAGTTGAGTTATGTGGATATGTGGAAGAGCAA
GTTCCTGGTTTTATTGCAGAAGATCCCAATATTATTGTCCACGCTCTTCCAACGCTGACG
CTACAAGGAAACAAGAGATCAATAATATTTTTGGTAAAGAAAGTATTGTTCCAAGTTTCT
GCTATCATTGCTCAATTATGGGAGCTTAGGGGCAGTAATTATATGTTAATTCAGAACCCA
CCATCGATTCCGATTTTACCAATTGCTGTTTTCTATCGTTTGAGTGGTTGTAAACTTATC
ATTGATTGGCACAACTTGGCATATTCCATTATGCAATTGAAATTTAATGGAAACTTTTAT
CACCCCGTTGTACTTGCCTCATATGTCATAGAGTATATTTTTGGCAAGTTTGCTACTTAT
AATTTAACGGTCACTGAGGCCATGAAAGAATATTTAGTCAACAGTTTTGGCTTGAATCCA
AAAAGGTGTGTGGTGCTATATGATAGACCTGCGACACAATTTAAACCACTCACTGAAAGT
GAATCCCGCACAAAGTTACTAGACTCTGAGTTCATCAGAGATATGATCCCTGAGGGATTC
AATGTAGAGAAGGGAGACAAGATAATAGTCACTTCTACCTCGTTCACTCCAGATGAAGAT
ATTAGTATACTGATTGGTGCGTTGAAAATCTACGACAACTCTTATGAGAATCTTGATAAA
AGCCTTCCAAAGATTCTCTGTTTTGTTACAGGAAAGGGCCCCATGAAGGAAAGATATGTT
AAAGATGTAGAAGAGCATGATTGGCAGCATGTGTATGTTAAGTTTGTCTGGCTGAAATCG
GAGGATTACCCTCGCTTGCTCCAGCTGTGTGATTATGGTGTCTCTTTACATAAGTCCAGT
TCTGGCCTCGACTTACCTATGAAAATACTTGATATGTATGGCTCGGGTATACCTGTAATA
GCATATAACTACCCTGTGCTTGGAGAATTGGTTAAATACAACGAGAATGGTTTGAAATTC
CTGGACAGAAGAGAACTACACGAGTCATTGATCTTTGCCATGAAGGACCCAGAACTTTAC
AAGAAATTGAAGCAGGGGGCTTTAAAAGAGTCCCAAATTAGATGGAACTCGAGCTGGCAG
TCTGCCATGCAAGAATTAAAGCTCGTCGCTTAA

Predicted translation product

>CAGL0K12342g.aa
MSWIQIPWSWVVTLIVTYLSLPLIIYYLVPYIFYGNKSSKKRIIIYVLGDIGHSPRMCYH
ARSFSEKGWQVELCGYVEEQVPGFIAEDPNIIVHALPTLTLQGNKRSIIFLVKKVLFQVS
AIIAQLWELRGSNYMLIQNPPSIPILPIAVFYRLSGCKLIIDWHNLAYSIMQLKFNGNFY
HPVVLASYVIEYIFGKFATYNLTVTEAMKEYLVNSFGLNPKRCVVLYDRPATQFKPLTES
ESRTKLLDSEFIRDMIPEGFNVEKGDKIIVTSTSFTPDEDISILIGALKIYDNSYENLDK
SLPKILCFVTGKGPMKERYVKDVEEHDWQHVYVKFVWLKSEDYPRLLQLCDYGVSLHKSS
SGLDLPMKILDMYGSGIPVIAYNYPVLGELVKYNENGLKFLDRRELHESLIFAMKDPELY
KKLKQGALKESQIRWNSSWQSAMQELKLVA*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites