CAGL0M13849g


highly similar to uniprot|P22146 Saccharomyces cerevisiae YMR307w GAS1 Beta-1,3-glucanosyltransferase, required for cell wall assembly

Genomic environment map

Element type: CDS
Element length: 1698 nucleotides,
on sense strand of
Cagl0M: 1368269..1369966.
Other names:
CAGL-CDS1693.1
CAGL-IPF6049
Coding sequence: 566 codons.
Database cross references:
EMBL: AJ302062
EMBL: CR380959
GeneID: 2891237
HOGENOM: Q8X0Z6

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R0042 GL3R0042.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0M13849p  


highly similar to uniprot|P22146 Saccharomyces cerevisiae YMR307w GAS1 glycophospholipid-anchored surface glycoprotein

Protein domain map

Protein length: 565 amino acids
Protein family: GL3R0042
Database cross references:
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0M13849g
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_449946.1
SMART: SM00768
UniProtKB/TrEMBL: Q8X0Z6
UniprotKB: Q8X0Z6_CANGA

Computed results for CAGL0M13849p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>CAGL0M13849g.nt
GCTATTTTGCCGGTGACTTTGTATTAGGACATTTTATACTGTTCTTACAAGCTCCGATAA
TATTAATTCCCTTCATTGACTATTGGCATTCAATGACATTGTTCTGGTTACGACCTACAA
ACTTAATTAGAGGGAGGAGGATATTTGGAAGAAAGCAATCTCGACGTAGAAGGAATACTG
TTATAAGATATTTTTTCTTTTACTTTCTCACATTAGCGATTTTTATTGGTATCTTACTAA
CTCCATATTTTGCTCATAAATTTGATATCCAAGCAGATGATTTGTTGATAGGAACTATTC
TTGACGGTATAATTCAGCCAAATAATCAAAACAATAATGACACTGGAGCAAGAGCACCAC
CAAACATTATGAGGAGTACGCCTAAAGCGAAACCTGTTAAAACTGTTTCCTAATAATGAG
TGATGGATCCGTTTAAAGTTGAAGAATTGAAATCACCGAAACTCAATAAAAATTTTGTTT
CCGGACATCCAATAATGATAACTTCGAATAATAACTTTGAATGTTATAGAGTGGCCGCTA
TTATGTGAAATGCATACGTGCGATAACAGATAAATTTATTGATTGTCTCTAGTCACCCGT
GCACAATTGAGCTTTGTAATAGGCTGTGAAAATTAGAGTAGATAAGCAAGTCTGCTTCAT
GGCAATCCGTTTCTACGTGGATCAACTATTTCAGAATCCTTTATTTCCCGCAACAATTAC
TCGAAAATCACCTACGGTTAAATAACGGTAACTTATTTGAAACCCTGACCTTGTTTATTT
TTCGTGGGACAAATCCCCTAGAGGGAAAAAATTTTCACTTTTCGCTCGCACTGTATTTGT
TTACATATCCAATTTTTATTCTCCTTTTAATCTTTACCATTAGTATTTAAGTAATCAGCA
AAATTGATATTTACTTGATAGAGATCTTTTGCTTTCCTGGAGGCGGCGTAAATCACCAGA
ATTGAAGTGTAATTAACTAATAAGACTGCTAATTTGCAATATGTTTTTCAAAAACACTTT
AGCAGCATTAACTGCTGCCAGTGCTTTATTTAGTACTGTAAAGGCTGATGATCTACCACC
TATTGAAATTGTTGGTAACAAGTTTTTTTTCTCTAACAATGGTTCTCAGTTTTACATGAG
AGGTATTGCCTACCAAGCTGATACTGCAAATGCTACTTCAGGTGCCACTATTAATGATCC
TTTAGCAGACTTTTCATCATGTTCAAGAGATATTCCATATTTGCAACAGCTTGCCACAAA
TGTTATTCGTGTCTACGCTGTCAATACATCTTTAGACCATGATGAGTGTATGAAGGCCTT
AAATGATGCTGGTATCTATGTCATTGCTGACCTTTCTGCACCAAAGACATCAGTTAACAG
AGACAGTCCATCATGGGATCTTGAACTATATGAACGTTACACTTCTGTTGTCGATATGTT
TGCCAACTACTCAAATGTTTTGGGTTTCTTTGCAGGTAACGAGGTTACTAATAACTCTAC
CAACACAGATGCTTCTGCATTTGTTAAGGCAGCTGTTAGAGACACCAAGCAATATATTAA
ATCGAAAGGTTACAGAAAGATCCCTGTTGGTTATTCTTCTAACGATGATGCTGATACTAG
AGTTTCAATTGCTGACTACTTTGCATGTGGTGACGAAGACCAAAGGGCTGATTTTTACGG
TATTAATATGTATGAATGGTGTGGTAACTCCAACTTACAAAAATCTGGTTATGCTGACAG
AACTAAGGAATTTTCAAATCTGTCAATTCCACTTTTCTTCTCTGAATATGGTTGTAATGA
AGTCACTCCAAGACTTTTCACTGAAGTTCAAGCTCTATTTGGAGATCAAATGACTGATGT
GTGGTCCGGTGGTATTGTATACCTTTACTTTGAAGAAGAGAACCATTACGGTTTGGTCAG
TATTGATGGCAATGATGTGAAGACATTGGATGACTTCAACAACTACTCTAAACAAATTCA
TAGCATAAGTCCATCTTCCGCTAACACTGCATCTTACTCAGCCTCTTCTACTTCATTGTC
CTGCCCAACTTCTAACAGCTACTGGAAGGCTTCTAGTAATCTTCCTCCAACACCAGATAA
GGATTTGTGTCTATGTATGGATGATGCTAACAGCTGCATTGTTGCTGACAAGGTTGATGA
AGATGACTATAAGGACCTATACGGTTATGTTTGTGGTGAAATTGACTGTAGTGGTATTAC
CGGTAATGGTACGACTGGTAAGTATGGTTCTTACTCCTTCTGTTCTCCTAAGGAAAAGCT
GAACTTCGTCTTGAATTTGTACTATCAATCTAAGGGTGGTTCAAAGTCTGACTGTGACTT
CAGCGGTTCTGCTTCTGTCAGATCCGCCACTACCCATGCTGGTTGTGCTTCCGCCTTGAA
GGAAATTGGTAGTGTCGGTACGAACTCTGCTACTGACTCTGCCACATACTCAGGATCTGG
TACTGGATCCATGTCTACTTCAAAGGCTTCTGCCAGTGGCTCATCCAAGGGCTCCTCCTC
TGCCAAGTCTGGTTCTGCAAGTGGTTCTTCGTCTTCCTCTTCTCGTTCTGCCACTTCCTC
TTCCAAAAGCAACAAGAAGAATGCCGGTGTCAACTTGAAGACTGACTTATTCCAAGTTAT
TGCTACATCTGTCATTTCAATTTCCATGCTTGCTGGTCTTGGATTTGTTCTTGCTTAATT
TCGGTTATAGAATGAACAGTTATTTTTATGTATGATTAAATTTTTTTCTATAATGGGATC
TTGATTATCCTTAATTTTTTATATGTTCTTATGTATATGCCTTTGTTTATTTTTTAGATT
ATTACATTTTTATCGAGTTTTTGAACTACATATTTAGAAACACCACGTTACCATATTCCG
CCAACCAAACAGAAATGTGACCAGTTAAAAATTATATGCCAATTCAGTATACATTTATAA
ATTACTGAAGAAAATTTTATGGACTATGTACACATTATGATCTTATATATCATATTCT

Coding sequence    

>CAGL0M13849g.cds
ATGTTTTTCAAAAACACTTTAGCAGCATTAACTGCTGCCAGTGCTTTATTTAGTACTGTA
AAGGCTGATGATCTACCACCTATTGAAATTGTTGGTAACAAGTTTTTTTTCTCTAACAAT
GGTTCTCAGTTTTACATGAGAGGTATTGCCTACCAAGCTGATACTGCAAATGCTACTTCA
GGTGCCACTATTAATGATCCTTTAGCAGACTTTTCATCATGTTCAAGAGATATTCCATAT
TTGCAACAGCTTGCCACAAATGTTATTCGTGTCTACGCTGTCAATACATCTTTAGACCAT
GATGAGTGTATGAAGGCCTTAAATGATGCTGGTATCTATGTCATTGCTGACCTTTCTGCA
CCAAAGACATCAGTTAACAGAGACAGTCCATCATGGGATCTTGAACTATATGAACGTTAC
ACTTCTGTTGTCGATATGTTTGCCAACTACTCAAATGTTTTGGGTTTCTTTGCAGGTAAC
GAGGTTACTAATAACTCTACCAACACAGATGCTTCTGCATTTGTTAAGGCAGCTGTTAGA
GACACCAAGCAATATATTAAATCGAAAGGTTACAGAAAGATCCCTGTTGGTTATTCTTCT
AACGATGATGCTGATACTAGAGTTTCAATTGCTGACTACTTTGCATGTGGTGACGAAGAC
CAAAGGGCTGATTTTTACGGTATTAATATGTATGAATGGTGTGGTAACTCCAACTTACAA
AAATCTGGTTATGCTGACAGAACTAAGGAATTTTCAAATCTGTCAATTCCACTTTTCTTC
TCTGAATATGGTTGTAATGAAGTCACTCCAAGACTTTTCACTGAAGTTCAAGCTCTATTT
GGAGATCAAATGACTGATGTGTGGTCCGGTGGTATTGTATACCTTTACTTTGAAGAAGAG
AACCATTACGGTTTGGTCAGTATTGATGGCAATGATGTGAAGACATTGGATGACTTCAAC
AACTACTCTAAACAAATTCATAGCATAAGTCCATCTTCCGCTAACACTGCATCTTACTCA
GCCTCTTCTACTTCATTGTCCTGCCCAACTTCTAACAGCTACTGGAAGGCTTCTAGTAAT
CTTCCTCCAACACCAGATAAGGATTTGTGTCTATGTATGGATGATGCTAACAGCTGCATT
GTTGCTGACAAGGTTGATGAAGATGACTATAAGGACCTATACGGTTATGTTTGTGGTGAA
ATTGACTGTAGTGGTATTACCGGTAATGGTACGACTGGTAAGTATGGTTCTTACTCCTTC
TGTTCTCCTAAGGAAAAGCTGAACTTCGTCTTGAATTTGTACTATCAATCTAAGGGTGGT
TCAAAGTCTGACTGTGACTTCAGCGGTTCTGCTTCTGTCAGATCCGCCACTACCCATGCT
GGTTGTGCTTCCGCCTTGAAGGAAATTGGTAGTGTCGGTACGAACTCTGCTACTGACTCT
GCCACATACTCAGGATCTGGTACTGGATCCATGTCTACTTCAAAGGCTTCTGCCAGTGGC
TCATCCAAGGGCTCCTCCTCTGCCAAGTCTGGTTCTGCAAGTGGTTCTTCGTCTTCCTCT
TCTCGTTCTGCCACTTCCTCTTCCAAAAGCAACAAGAAGAATGCCGGTGTCAACTTGAAG
ACTGACTTATTCCAAGTTATTGCTACATCTGTCATTTCAATTTCCATGCTTGCTGGTCTT
GGATTTGTTCTTGCTTAA

Predicted translation product    

>CAGL0M13849g.aa
MFFKNTLAALTAASALFSTVKADDLPPIEIVGNKFFFSNNGSQFYMRGIAYQADTANATS
GATINDPLADFSSCSRDIPYLQQLATNVIRVYAVNTSLDHDECMKALNDAGIYVIADLSA
PKTSVNRDSPSWDLELYERYTSVVDMFANYSNVLGFFAGNEVTNNSTNTDASAFVKAAVR
DTKQYIKSKGYRKIPVGYSSNDDADTRVSIADYFACGDEDQRADFYGINMYEWCGNSNLQ
KSGYADRTKEFSNLSIPLFFSEYGCNEVTPRLFTEVQALFGDQMTDVWSGGIVYLYFEEE
NHYGLVSIDGNDVKTLDDFNNYSKQIHSISPSSANTASYSASSTSLSCPTSNSYWKASSN
LPPTPDKDLCLCMDDANSCIVADKVDEDDYKDLYGYVCGEIDCSGITGNGTTGKYGSYSF
CSPKEKLNFVLNLYYQSKGGSKSDCDFSGSASVRSATTHAGCASALKEIGSVGTNSATDS
ATYSGSGTGSMSTSKASASGSSKGSSSAKSGSASGSSSSSSRSATSSSKSNKKNAGVNLK
TDLFQVIATSVISISMLAGLGFVLA*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites