CAGL0M03377g


highly similar to uniprot|P32775 Saccharomyces cerevisiae YEL011w GLC3 Glycogen branching enzyme, involved in glycogen accumulation

Genomic environment map

Element type: CDS
Element length: 2121 nucleotides,
on anti-sense strand of
Cagl0M: complement(385581..387701).
Other names:
CAGL-CDS1071.1
CAGL-IPF6818
Coding sequence: 707 codons.
Database cross references:
EMBL: CR380959
GeneID: 2891502
HOGENOM: Q6FJV0

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R2291 GL3R2291.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0M03377p  


highly similar to uniprot|P32775 Saccharomyces cerevisiae YEL011w GLC3

Protein domain map

Protein length: 706 amino acids
Protein family: GL3R2291
Database cross references:
Gene3D: G3DSA:2.60.40.10
Gene3D: G3DSA:2.60.40.1180
Gene3D: G3DSA:3.20.20.80
InterPro: IPR004193
InterPro: IPR006047
InterPro: IPR006048
InterPro: IPR013780
InterPro: IPR013781
InterPro: IPR013783
KEGG: cgr:CAGL0M03377g
Pfam: PF00128
Pfam: PF02806
Pfam: PF02922
RefSeq: XP_449494.1
UniProtKB/Swiss-Prot: Q6FJV0
UniprotKB: GLGB_CANGA

Computed results for CAGL0M03377p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>CAGL0M03377g.nt
GTGTACATAATGGAAACCTCGAAAGGAGGTATATGATCGTATCCCAGGTTACTATTACAT
TTTGAAAGCTAAGGTCTTTTCAACCGTAGTAAATATCTTTTTACTAAACGGTACCATACG
ATACGATACCATTACCCATACCCTTTTCCTACAAGCTCCCTTCGTTGCTTTAATTGTGTA
GGCAAGATATAAGCAACCAGTCAAGTACTGTCAGAACTACAATATTGCTTTTCTTTTATT
CCTTCTTCATCCTCTGAGCACTCAAGTGAGAGAGCTCCACCCCACGTAGCAATCCTAACC
CCCCTTCTCGCCCCCCCTCCATTCCATAAAATAATCAACAGATTGTAGGCAGTAAGTTAA
TCATCCTTGTTTCTCCCCCCCCCACCCTAACCCTCGACAGGATCCACTACCAGCCTTTTA
GCCTACGATTGTTTTCCTTTTTTGACAAGCTTGTTAATTTCCTAATCAGAAGAGACAAGA
CAAACAACCAAAATCAGAGCCCAAAACTAATCCCAGTACAAAAATTGGGACTTCGTTTGT
ACTAAGTTGCAGGAAGCTACAGACACCACCCCTTTCACACGTACTAGCCCCTCTCCTTTG
GACTAATGATTCAATTTCTTACCCTTTTTGATGCCCCATTCATCTTTTACCTGTGAAGTC
TAGTCAGTTTTGGCATCAGTCTTGTTTGTTTTTAGCAATTGACTAGTTGAACCGTCCCTC
TGCTGCGATACCCCTAGCCCCTTGCTAGTATTCAACACTTGTATGTATCTGTTCTTTTCG
ACTATTTAAACAGACCACATAGCTATCCTTGTAACAATTTGACTTTTGTTCTGTGCTGTT
CACTTTGACAAAACACTTGTAAGCACTACAGTTGGTAAACTTTCTATTCGAATATAACTA
GTTGTATAATACCATCATCAAAAACAATCACAATCGAATAATTTCGATATTACAATTACG
AACTAGATAATTCTTCCCACCCAGAAACTAGAATAATACAATGAGTTTGACTAAGATTCC
TGAAAATGTCCAGGGAGCAGTTTCCATTGACCCATGGTTAGAACCATTTGCTGATGTTCT
TTCTGAGCGAAGGTACCTTGCTGATAAATGGCTATATGACATCAAGCACGCCACACCTGA
TGGCTCTGAGCAAAGTTTGGTTGATTTCGCAAGAAACGCATATAAGACCTATGGTCTGCA
CGCCAACCAGCAAACTAAAGAAATTGTTTACAGAGAATGGGCTCCAAATGCTCAACGTGC
TTTCCTAGTCGGTGAGTTCAACAACTGGAACGAAGAGTCTCATGAAATGAAGCATAAGGA
TGAATTTGGTGTATTTAGCATCACTCTAGCACCTTTGGAGAACGGCGATTTTGCCATCCC
TCATGATTCCAAGATCAAAGTTATGTTTGTCTTGCCAGATGGCTCAAAAGTTTACAGAAT
TCCAGCATGGATTACAAGAGCCACTCAACCAAGCAAAGAAACCGCTCAGAAATACGGTCC
AACATATGAGGGTAGATTCTGGAACCCACCTAACTCTTACCAGTTTAAACACCAAAGACC
TAAGTTCAATTTGGCTAACGACTCAATTAAAATTTATGAAGCTCACATTGGTATCTCTTC
TCCAGAGCCAAAAGTTGCCTCCTACAAAGAATTCACTCAAAACGTTCTACCTAGAATTAA
GCATCTAGGTTACGATGCTATCCAATTGATGGCTATTATGGAGCACGCTTACTATGCTTC
GTTTGGTTACCAAGTTACCAACTTCTTTGCTATCAGTTCTCGTTATGGTACCCCAGAAGA
TTTGAAAGAGTTAATCGATACTGCCCATAGCATGGGTATCTTGGTTCTACTAGATGTCAT
TCACAGTCACGCATCTAAGAATAGTGAAGATGGTTTGAACATGTTCGATGGTTCTGACCA
CCAATACTTCCATTCCTTGACTTCTGGTAGAGGTGAGCATCCATTATGGGATTCCCGTTT
ATTCAATTATGGTAGCTTTGAAGTGCAAAGATTCTTGCTTGCCAACTTGGCTTACTACAT
TGACGTTTACCAATTCGATGGTTTCAGATTTGATGGTGTCACCTCTATGCTTTACTTACA
CCATGGTGTCGGTGCTGGTGGTGCTTTCAGCGGTGACTACAACGAGTATTTGTCCCGTGA
CAGATCTGGTGTTGACCATGAAGCCCTGGCTTACTTGATGTTGGCAAATGACCTAGTCCA
TGACTTGCTACCGGAATCCGCTGTTACTATTGCTGAAGATGTTTCTGGTTATCCAACTCT
GTGTCTACCAAGAACTGCAGGCGGTGGTGGTTTTGACTACAGATTGGCTATGGCTCTCCC
AGATATGTGGATCAAGCTTCTAAAGACCAAACAAGATGATGACTGGGATATGGGTCACAT
TGTTCACACATTGACTAACAGACGTCATGGTGAAAAGGTTGTTGCCTACTGTGAATCCCA
CGATCAAGCTCTAGTCGGTGACAAGACTCTTGCCTTCTGGCTAATGGATGCTGCCATGTA
CACCGACATGACCGTTTTGAAGGAGCCAACATTGGTTATTGACCGTGGTATTGCTCTACA
CAAAATGATTAGATTAATCACCCACTCTCTAGGTGGTGAGGCTTATCTAAACTTCGAGGG
TAACGAGTTCGGCCATCCTGAATGGTTAGACTTCCCAAGAGTTGGTAACAATGACAGTTA
CCATTATGCAAGAAGACAATTCAACTTGGTTGATGATGATTTATTGCGTTACAGGCATTT
GAACGAGTTTGATGCTGCAATGCAAAACTGTGAATCCAAACACCAATGGTTGAATACTCC
TCAAGCATATGTCTCTTTGAAGCACGAGGTTGACAAAGTTATCGCCTTCGAAAGAAATGG
CCATTTGTTTGTTTTCAACTTCCATCCAACTCAAAGTTTCACTGACTACAGAATTGGTGT
CGATGTTGCTGGTACTTACAAAATTGTACTAAATACTGATAGAGCAGAATTTGGTGGCCA
CAATAGAATTGATGAGGCTCAAGAATTTTTCACCACTGACTTGGAATGGAACAACAGAAG
GAACTTCATCCAAGTATACATTCCAAGCAGGACTGCTATCGTTTTGACCCGTCAAATGTA
GAGTGTTTACTCACTAACAGTACATGCTCTTTTTACATCTTTAATTATGCCCATCATTTA
TGTGTCATGATAGAAACGAACAAAGCTAGACTTTTCATGTTTGTGTCACAATACAGAAGA
TAGAAATTATATAATTTGATAAAACTATAAATTATCTAAAAAATAAACTAAAATAATCAT
GTTTTTTAAGAATAATAATCTCAGTACTGAGCTTTGAATATATTAATACTTTGTTCTATA
AATTATCTTCCGATGAACTAGTGCCTGCACTAAGTACCTTGTTTGTATTATATTATATTA
T

Coding sequence    

>CAGL0M03377g.cds
ATGAGTTTGACTAAGATTCCTGAAAATGTCCAGGGAGCAGTTTCCATTGACCCATGGTTA
GAACCATTTGCTGATGTTCTTTCTGAGCGAAGGTACCTTGCTGATAAATGGCTATATGAC
ATCAAGCACGCCACACCTGATGGCTCTGAGCAAAGTTTGGTTGATTTCGCAAGAAACGCA
TATAAGACCTATGGTCTGCACGCCAACCAGCAAACTAAAGAAATTGTTTACAGAGAATGG
GCTCCAAATGCTCAACGTGCTTTCCTAGTCGGTGAGTTCAACAACTGGAACGAAGAGTCT
CATGAAATGAAGCATAAGGATGAATTTGGTGTATTTAGCATCACTCTAGCACCTTTGGAG
AACGGCGATTTTGCCATCCCTCATGATTCCAAGATCAAAGTTATGTTTGTCTTGCCAGAT
GGCTCAAAAGTTTACAGAATTCCAGCATGGATTACAAGAGCCACTCAACCAAGCAAAGAA
ACCGCTCAGAAATACGGTCCAACATATGAGGGTAGATTCTGGAACCCACCTAACTCTTAC
CAGTTTAAACACCAAAGACCTAAGTTCAATTTGGCTAACGACTCAATTAAAATTTATGAA
GCTCACATTGGTATCTCTTCTCCAGAGCCAAAAGTTGCCTCCTACAAAGAATTCACTCAA
AACGTTCTACCTAGAATTAAGCATCTAGGTTACGATGCTATCCAATTGATGGCTATTATG
GAGCACGCTTACTATGCTTCGTTTGGTTACCAAGTTACCAACTTCTTTGCTATCAGTTCT
CGTTATGGTACCCCAGAAGATTTGAAAGAGTTAATCGATACTGCCCATAGCATGGGTATC
TTGGTTCTACTAGATGTCATTCACAGTCACGCATCTAAGAATAGTGAAGATGGTTTGAAC
ATGTTCGATGGTTCTGACCACCAATACTTCCATTCCTTGACTTCTGGTAGAGGTGAGCAT
CCATTATGGGATTCCCGTTTATTCAATTATGGTAGCTTTGAAGTGCAAAGATTCTTGCTT
GCCAACTTGGCTTACTACATTGACGTTTACCAATTCGATGGTTTCAGATTTGATGGTGTC
ACCTCTATGCTTTACTTACACCATGGTGTCGGTGCTGGTGGTGCTTTCAGCGGTGACTAC
AACGAGTATTTGTCCCGTGACAGATCTGGTGTTGACCATGAAGCCCTGGCTTACTTGATG
TTGGCAAATGACCTAGTCCATGACTTGCTACCGGAATCCGCTGTTACTATTGCTGAAGAT
GTTTCTGGTTATCCAACTCTGTGTCTACCAAGAACTGCAGGCGGTGGTGGTTTTGACTAC
AGATTGGCTATGGCTCTCCCAGATATGTGGATCAAGCTTCTAAAGACCAAACAAGATGAT
GACTGGGATATGGGTCACATTGTTCACACATTGACTAACAGACGTCATGGTGAAAAGGTT
GTTGCCTACTGTGAATCCCACGATCAAGCTCTAGTCGGTGACAAGACTCTTGCCTTCTGG
CTAATGGATGCTGCCATGTACACCGACATGACCGTTTTGAAGGAGCCAACATTGGTTATT
GACCGTGGTATTGCTCTACACAAAATGATTAGATTAATCACCCACTCTCTAGGTGGTGAG
GCTTATCTAAACTTCGAGGGTAACGAGTTCGGCCATCCTGAATGGTTAGACTTCCCAAGA
GTTGGTAACAATGACAGTTACCATTATGCAAGAAGACAATTCAACTTGGTTGATGATGAT
TTATTGCGTTACAGGCATTTGAACGAGTTTGATGCTGCAATGCAAAACTGTGAATCCAAA
CACCAATGGTTGAATACTCCTCAAGCATATGTCTCTTTGAAGCACGAGGTTGACAAAGTT
ATCGCCTTCGAAAGAAATGGCCATTTGTTTGTTTTCAACTTCCATCCAACTCAAAGTTTC
ACTGACTACAGAATTGGTGTCGATGTTGCTGGTACTTACAAAATTGTACTAAATACTGAT
AGAGCAGAATTTGGTGGCCACAATAGAATTGATGAGGCTCAAGAATTTTTCACCACTGAC
TTGGAATGGAACAACAGAAGGAACTTCATCCAAGTATACATTCCAAGCAGGACTGCTATC
GTTTTGACCCGTCAAATGTAG

Predicted translation product    

>CAGL0M03377g.aa
MSLTKIPENVQGAVSIDPWLEPFADVLSERRYLADKWLYDIKHATPDGSEQSLVDFARNA
YKTYGLHANQQTKEIVYREWAPNAQRAFLVGEFNNWNEESHEMKHKDEFGVFSITLAPLE
NGDFAIPHDSKIKVMFVLPDGSKVYRIPAWITRATQPSKETAQKYGPTYEGRFWNPPNSY
QFKHQRPKFNLANDSIKIYEAHIGISSPEPKVASYKEFTQNVLPRIKHLGYDAIQLMAIM
EHAYYASFGYQVTNFFAISSRYGTPEDLKELIDTAHSMGILVLLDVIHSHASKNSEDGLN
MFDGSDHQYFHSLTSGRGEHPLWDSRLFNYGSFEVQRFLLANLAYYIDVYQFDGFRFDGV
TSMLYLHHGVGAGGAFSGDYNEYLSRDRSGVDHEALAYLMLANDLVHDLLPESAVTIAED
VSGYPTLCLPRTAGGGGFDYRLAMALPDMWIKLLKTKQDDDWDMGHIVHTLTNRRHGEKV
VAYCESHDQALVGDKTLAFWLMDAAMYTDMTVLKEPTLVIDRGIALHKMIRLITHSLGGE
AYLNFEGNEFGHPEWLDFPRVGNNDSYHYARRQFNLVDDDLLRYRHLNEFDAAMQNCESK
HQWLNTPQAYVSLKHEVDKVIAFERNGHLFVFNFHPTQSFTDYRIGVDVAGTYKIVLNTD
RAEFGGHNRIDEAQEFFTTDLEWNNRRNFIQVYIPSRTAIVLTRQM*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites