CAGL0M13849g
highly similar to uniprot|P22146 Saccharomyces cerevisiae YMR307w GAS1 Beta-1,3-glucanosyltransferase, required for cell wall assembly
Element type: CDS
Element length: 1698 nucleotides,
on sense strand of
Cagl0M: 1368269..1369966.
Other names:
CAGL-CDS1693.1
CAGL-IPF6049
Coding sequence: 566 codons.
Element length: 1698 nucleotides,
on sense strand of
Cagl0M: 1368269..1369966.
Other names:
CAGL-CDS1693.1
CAGL-IPF6049
Coding sequence: 566 codons.
Database cross references:
EMBL: AJ302062
EMBL: CR380959
GeneID: 2891237
HOGENOM: Q8X0Z6
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: AJ302062
EMBL: CR380959
GeneID: 2891237
HOGENOM: Q8X0Z6
Homologs and Orthologs
Homologs in protein families: GL3R0042 GL3R0042.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0M13849p 
highly similar to uniprot|P22146 Saccharomyces cerevisiae YMR307w GAS1 glycophospholipid-anchored surface glycoprotein
Protein domain map
Database cross references:
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0M13849g
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_449946.1
SMART: SM00768
UniProtKB/TrEMBL: Q8X0Z6
UniprotKB: Q8X0Z6_CANGA
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0M13849g
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_449946.1
SMART: SM00768
UniProtKB/TrEMBL: Q8X0Z6
UniprotKB: Q8X0Z6_CANGA
Sequence data 
>CAGL0M13849g.nt GCTATTTTGCCGGTGACTTTGTATTAGGACATTTTATACTGTTCTTACAAGCTCCGATAA TATTAATTCCCTTCATTGACTATTGGCATTCAATGACATTGTTCTGGTTACGACCTACAA ACTTAATTAGAGGGAGGAGGATATTTGGAAGAAAGCAATCTCGACGTAGAAGGAATACTG TTATAAGATATTTTTTCTTTTACTTTCTCACATTAGCGATTTTTATTGGTATCTTACTAA CTCCATATTTTGCTCATAAATTTGATATCCAAGCAGATGATTTGTTGATAGGAACTATTC TTGACGGTATAATTCAGCCAAATAATCAAAACAATAATGACACTGGAGCAAGAGCACCAC CAAACATTATGAGGAGTACGCCTAAAGCGAAACCTGTTAAAACTGTTTCCTAATAATGAG TGATGGATCCGTTTAAAGTTGAAGAATTGAAATCACCGAAACTCAATAAAAATTTTGTTT CCGGACATCCAATAATGATAACTTCGAATAATAACTTTGAATGTTATAGAGTGGCCGCTA TTATGTGAAATGCATACGTGCGATAACAGATAAATTTATTGATTGTCTCTAGTCACCCGT GCACAATTGAGCTTTGTAATAGGCTGTGAAAATTAGAGTAGATAAGCAAGTCTGCTTCAT GGCAATCCGTTTCTACGTGGATCAACTATTTCAGAATCCTTTATTTCCCGCAACAATTAC TCGAAAATCACCTACGGTTAAATAACGGTAACTTATTTGAAACCCTGACCTTGTTTATTT TTCGTGGGACAAATCCCCTAGAGGGAAAAAATTTTCACTTTTCGCTCGCACTGTATTTGT TTACATATCCAATTTTTATTCTCCTTTTAATCTTTACCATTAGTATTTAAGTAATCAGCA AAATTGATATTTACTTGATAGAGATCTTTTGCTTTCCTGGAGGCGGCGTAAATCACCAGA ATTGAAGTGTAATTAACTAATAAGACTGCTAATTTGCAATATGTTTTTCAAAAACACTTT AGCAGCATTAACTGCTGCCAGTGCTTTATTTAGTACTGTAAAGGCTGATGATCTACCACC TATTGAAATTGTTGGTAACAAGTTTTTTTTCTCTAACAATGGTTCTCAGTTTTACATGAG AGGTATTGCCTACCAAGCTGATACTGCAAATGCTACTTCAGGTGCCACTATTAATGATCC TTTAGCAGACTTTTCATCATGTTCAAGAGATATTCCATATTTGCAACAGCTTGCCACAAA TGTTATTCGTGTCTACGCTGTCAATACATCTTTAGACCATGATGAGTGTATGAAGGCCTT AAATGATGCTGGTATCTATGTCATTGCTGACCTTTCTGCACCAAAGACATCAGTTAACAG AGACAGTCCATCATGGGATCTTGAACTATATGAACGTTACACTTCTGTTGTCGATATGTT TGCCAACTACTCAAATGTTTTGGGTTTCTTTGCAGGTAACGAGGTTACTAATAACTCTAC CAACACAGATGCTTCTGCATTTGTTAAGGCAGCTGTTAGAGACACCAAGCAATATATTAA ATCGAAAGGTTACAGAAAGATCCCTGTTGGTTATTCTTCTAACGATGATGCTGATACTAG AGTTTCAATTGCTGACTACTTTGCATGTGGTGACGAAGACCAAAGGGCTGATTTTTACGG TATTAATATGTATGAATGGTGTGGTAACTCCAACTTACAAAAATCTGGTTATGCTGACAG AACTAAGGAATTTTCAAATCTGTCAATTCCACTTTTCTTCTCTGAATATGGTTGTAATGA AGTCACTCCAAGACTTTTCACTGAAGTTCAAGCTCTATTTGGAGATCAAATGACTGATGT GTGGTCCGGTGGTATTGTATACCTTTACTTTGAAGAAGAGAACCATTACGGTTTGGTCAG TATTGATGGCAATGATGTGAAGACATTGGATGACTTCAACAACTACTCTAAACAAATTCA TAGCATAAGTCCATCTTCCGCTAACACTGCATCTTACTCAGCCTCTTCTACTTCATTGTC CTGCCCAACTTCTAACAGCTACTGGAAGGCTTCTAGTAATCTTCCTCCAACACCAGATAA GGATTTGTGTCTATGTATGGATGATGCTAACAGCTGCATTGTTGCTGACAAGGTTGATGA AGATGACTATAAGGACCTATACGGTTATGTTTGTGGTGAAATTGACTGTAGTGGTATTAC CGGTAATGGTACGACTGGTAAGTATGGTTCTTACTCCTTCTGTTCTCCTAAGGAAAAGCT GAACTTCGTCTTGAATTTGTACTATCAATCTAAGGGTGGTTCAAAGTCTGACTGTGACTT CAGCGGTTCTGCTTCTGTCAGATCCGCCACTACCCATGCTGGTTGTGCTTCCGCCTTGAA GGAAATTGGTAGTGTCGGTACGAACTCTGCTACTGACTCTGCCACATACTCAGGATCTGG TACTGGATCCATGTCTACTTCAAAGGCTTCTGCCAGTGGCTCATCCAAGGGCTCCTCCTC TGCCAAGTCTGGTTCTGCAAGTGGTTCTTCGTCTTCCTCTTCTCGTTCTGCCACTTCCTC TTCCAAAAGCAACAAGAAGAATGCCGGTGTCAACTTGAAGACTGACTTATTCCAAGTTAT TGCTACATCTGTCATTTCAATTTCCATGCTTGCTGGTCTTGGATTTGTTCTTGCTTAATT TCGGTTATAGAATGAACAGTTATTTTTATGTATGATTAAATTTTTTTCTATAATGGGATC TTGATTATCCTTAATTTTTTATATGTTCTTATGTATATGCCTTTGTTTATTTTTTAGATT ATTACATTTTTATCGAGTTTTTGAACTACATATTTAGAAACACCACGTTACCATATTCCG CCAACCAAACAGAAATGTGACCAGTTAAAAATTATATGCCAATTCAGTATACATTTATAA ATTACTGAAGAAAATTTTATGGACTATGTACACATTATGATCTTATATATCATATTCT
>CAGL0M13849g.cds ATGTTTTTCAAAAACACTTTAGCAGCATTAACTGCTGCCAGTGCTTTATTTAGTACTGTA AAGGCTGATGATCTACCACCTATTGAAATTGTTGGTAACAAGTTTTTTTTCTCTAACAAT GGTTCTCAGTTTTACATGAGAGGTATTGCCTACCAAGCTGATACTGCAAATGCTACTTCA GGTGCCACTATTAATGATCCTTTAGCAGACTTTTCATCATGTTCAAGAGATATTCCATAT TTGCAACAGCTTGCCACAAATGTTATTCGTGTCTACGCTGTCAATACATCTTTAGACCAT GATGAGTGTATGAAGGCCTTAAATGATGCTGGTATCTATGTCATTGCTGACCTTTCTGCA CCAAAGACATCAGTTAACAGAGACAGTCCATCATGGGATCTTGAACTATATGAACGTTAC ACTTCTGTTGTCGATATGTTTGCCAACTACTCAAATGTTTTGGGTTTCTTTGCAGGTAAC GAGGTTACTAATAACTCTACCAACACAGATGCTTCTGCATTTGTTAAGGCAGCTGTTAGA GACACCAAGCAATATATTAAATCGAAAGGTTACAGAAAGATCCCTGTTGGTTATTCTTCT AACGATGATGCTGATACTAGAGTTTCAATTGCTGACTACTTTGCATGTGGTGACGAAGAC CAAAGGGCTGATTTTTACGGTATTAATATGTATGAATGGTGTGGTAACTCCAACTTACAA AAATCTGGTTATGCTGACAGAACTAAGGAATTTTCAAATCTGTCAATTCCACTTTTCTTC TCTGAATATGGTTGTAATGAAGTCACTCCAAGACTTTTCACTGAAGTTCAAGCTCTATTT GGAGATCAAATGACTGATGTGTGGTCCGGTGGTATTGTATACCTTTACTTTGAAGAAGAG AACCATTACGGTTTGGTCAGTATTGATGGCAATGATGTGAAGACATTGGATGACTTCAAC AACTACTCTAAACAAATTCATAGCATAAGTCCATCTTCCGCTAACACTGCATCTTACTCA GCCTCTTCTACTTCATTGTCCTGCCCAACTTCTAACAGCTACTGGAAGGCTTCTAGTAAT CTTCCTCCAACACCAGATAAGGATTTGTGTCTATGTATGGATGATGCTAACAGCTGCATT GTTGCTGACAAGGTTGATGAAGATGACTATAAGGACCTATACGGTTATGTTTGTGGTGAA ATTGACTGTAGTGGTATTACCGGTAATGGTACGACTGGTAAGTATGGTTCTTACTCCTTC TGTTCTCCTAAGGAAAAGCTGAACTTCGTCTTGAATTTGTACTATCAATCTAAGGGTGGT TCAAAGTCTGACTGTGACTTCAGCGGTTCTGCTTCTGTCAGATCCGCCACTACCCATGCT GGTTGTGCTTCCGCCTTGAAGGAAATTGGTAGTGTCGGTACGAACTCTGCTACTGACTCT GCCACATACTCAGGATCTGGTACTGGATCCATGTCTACTTCAAAGGCTTCTGCCAGTGGC TCATCCAAGGGCTCCTCCTCTGCCAAGTCTGGTTCTGCAAGTGGTTCTTCGTCTTCCTCT TCTCGTTCTGCCACTTCCTCTTCCAAAAGCAACAAGAAGAATGCCGGTGTCAACTTGAAG ACTGACTTATTCCAAGTTATTGCTACATCTGTCATTTCAATTTCCATGCTTGCTGGTCTT GGATTTGTTCTTGCTTAA
>CAGL0M13849g.aa MFFKNTLAALTAASALFSTVKADDLPPIEIVGNKFFFSNNGSQFYMRGIAYQADTANATS GATINDPLADFSSCSRDIPYLQQLATNVIRVYAVNTSLDHDECMKALNDAGIYVIADLSA PKTSVNRDSPSWDLELYERYTSVVDMFANYSNVLGFFAGNEVTNNSTNTDASAFVKAAVR DTKQYIKSKGYRKIPVGYSSNDDADTRVSIADYFACGDEDQRADFYGINMYEWCGNSNLQ KSGYADRTKEFSNLSIPLFFSEYGCNEVTPRLFTEVQALFGDQMTDVWSGGIVYLYFEEE NHYGLVSIDGNDVKTLDDFNNYSKQIHSISPSSANTASYSASSTSLSCPTSNSYWKASSN LPPTPDKDLCLCMDDANSCIVADKVDEDDYKDLYGYVCGEIDCSGITGNGTTGKYGSYSF CSPKEKLNFVLNLYYQSKGGSKSDCDFSGSASVRSATTHAGCASALKEIGSVGTNSATDS ATYSGSGTGSMSTSKASASGSSKGSSSAKSGSASGSSSSSSRSATSSSKSNKKNAGVNLK TDLFQVIATSVISISMLAGLGFVLA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0M13849g