Element type: CDS
Element length: 1629 nucleotides,
on sense strand of
Cagl0G: 99661..101289.
Other names:
CAGL-CDS1823.1
CAGL-IPF3881
Coding sequence: 543 codons.
Element length: 1629 nucleotides,
on sense strand of
Cagl0G: 99661..101289.
Other names:
CAGL-CDS1823.1
CAGL-IPF3881
Coding sequence: 543 codons.
Database cross references:
EMBL: CR380953
GeneID: 2888270
HOGENOM: Q6FTN7
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380953
GeneID: 2888270
HOGENOM: Q6FTN7
Homologs and Orthologs
Homologs in protein families: GL3R0042 GL3R0042.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0G01056g
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_446407.1
SMART: SM00768
UniProtKB/TrEMBL: Q6FTN7
UniprotKB: Q6FTN7_CANGA
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0G01056g
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_446407.1
SMART: SM00768
UniProtKB/TrEMBL: Q6FTN7
UniprotKB: Q6FTN7_CANGA
Sequence data 
>CAGL0G01056g.nt GTTCTAGGTCATGTTATTCTATTTGCCCAATTCCCAGTTCTATGTATTCCAGCCATTGAC AAGTTCCACTCCATCATGTTATTCTGGTTAAAGCCATCGCGTCAAATCCGTCCACCAATT TATTCTTTGAAACAATCTCGTCTAAGAAAGAGAATGGTCAAGAGATACCTAACCCTTTAC ATTATCGTTTTCCTAGTATTTGCTGGTTGCATCGTAGGCCCAGCAGTTGCTTCTTCCCAT GTTGCTAAGGACTTGGGTCACCAATTAACTGGTACTTTCCACAATTTGGTGCAACCAAGA AATGTAAGCAACAACGATACTGGTTTTGGTATTTCCACATACAGTAACCATTACTACACG CATACTCCATCTCTAAAGACCTGGTCTACAATCAAATAATCCATTTCAATAATTTGCGTG ATTTTATTGTTTATTCTTATTCCACATTTCATCATTTGATATTAAACGAGTTCAAGATAA TAGATTTATTTTTTTCATCATTTACCTCCATACTACATCAAGAATTTTTTTATTTGATTG ATGGGTTTATTTGTTATGGTTCGAAGACGGTTTTAAAATATTTATAGAGATACTGGTTAT ATCCCGCTGTTTTTACGACTGATCGTCCTTCAACTAGCAAATCCGTTTTGGTATTGTATT TTAGTTATTTTATATTTTCAGGAGTTTCTTTACCTTATATATATTAAGAGGAACCATATT ATTATTTTTCATGTACTTTTTTTTCATTTTGTCTCTGATCTTTGTTACACTTCATTAGTT TTTGTGAGCGCTAACTTAATAAAAAGCTCCCAATTAAAGTGTCGGGTGCTTTTGGAACCG CAAAAACAATTGATAGAATAGAAAATAAACTAAGATGTAGTACATAATTCAGAGATTATT CATTTATTCATGCGGGTGAATATATAATGAAGAAATTATCCACAATAGGTTTCAATATTT TATAAGTGTCTAGACAACACAAGGGAAATTGGTATACCTCATGAAGAAATTATTATATTT TGCTTTGTCTGTGGTAGCAACATCAGCTCAGCTAGCTGATGAGGTCTTATTACGCAAATG GTCTTTGCAGTTACCTACTATCGAAATCGAAGGCAATAAATTCTTCAATAGTGAAACTGG TGAGCAATTTTTTATGAAAGGCATTGCATACCAACAGCAAGTTGATCAGGATAGTGAATT ATATGATGGTACACCATATGTAGATCCTCTTGCTGACCCACATATATGTCTCCGTGATCT ACCATATCTGGTGGAGCTTGGAATCAATACAATAAGAGTGTATCATATTGACCCCAGCTC TTCACATGATACATGCATGAAGGCATTTTCTGATGCAGGTATATATGTGCTCATCGATCT CGCAGAACCAGAAATATCCATAGTCCGTAATAACCCTAGTTGGGATGTTAAAGTATGGTC TAGGTATAGAGATGTAGTTGATGCCATGCATTTTTATAACAATGTTTTAGGCTTTTTTGC TGGTAATGAAGTGACCAATGACAAATATAATACTGATGCTTCACCATTTGTGAAGGCGGC AATTAGAGATGTAAAGACTTATATGCAACAAAAGGGATATAGGAATATCCCTGTAGGTTA TTCAACGAATGACGATGCTGAAACTAGAATAAATCTTTCCAAATATTTTGTATGTGGAGA AAACTCAGCAGATTTCTATGGCATAAATATGTATGAGTGGTGTGGCTATTCTACATACGG CACCAGCGGCTACAAAGAAAGAACTGAAGAATTTACTGACTTCCCTGTTCCAGTGTTTTT CTCTGAATTTGGGTGTAACTTGGTTAGACCCAGGCCATTTACAGAAGTAGCCGCCCTCTT CAGTAAGAAAATGTCTTCTGTATGGTCTGGCGGTTTAGTTTATATGTATTTTGAGGAGGA AAATCAATATGGTGTTGTTAAAATTAACAAAAATAATGAAGTAGAGAAGCTACCTGATTT TGATAATTTGAAAAAAGCATATAGGAAAGCAACCCCTAAGGGTGTCAATCTTTCAGATCA AGCTGTCTCAAGGAAATCCATTAATGTTCGCAAACTTGATTGCCCGGAGAAAAGCCATAA TAATAATTGGTTAGCGTCTGATATTTTACCACCGACACCTAATGACGAGAAATGTTCTTG CTTGGATGAGATTTTACCTTGCCTGGCATTACCTAGTAATGACGATCAAGATCATTACAA GACTTTATTTAACTACGTTTGTGGGGAAGTCGACTGTACAGATATCAAAACAGATGGTAC ATTAGGGAAGTATGGAAAATTCTCAGACTGTTCAGTAAATCAAAAGCTGTCCTTGCAATT GAGTAAGCTTTACTACAAATTAAAATTAGAGGATCACATATGCCCAACTAACCCGAAATA CGTTCGGTTTAACAGCGCGACCATAACTCGGAAGGATACTTGTGAATCTATCCTCAAAGA AATTAGCCAAGGGACTGCAAAAACTAAAAAGGAACCGATAAATGAAATAGTCACATCGGT ACCAGCGGAAGACGGAATGATTTCCTCCACCGCTAACACTTTGAGTGGCACAATAATACT AGTTATTATTGTTCTAAATACTTTAGTAGTCCTGCTTATAGCATATTAATAAGAATGATC TAAAAATCTATATCATAGTAAAATTGACTCAAGCTCTAGCAGGAATCTCAATAAAGTCAT AATGTTTCATACCTCATAGAGATCCTTCCAGTATACTACCTTAACGACAAATATATAGAT AGTACCGACAGGCATTGTAATACCGACTTGAGTACATTTGTGACTGTAGCTAATGGAACA CCCATACACAACACCCAAACGTTTCAAATAAGATGACTGGTTTTCAAACTTGTACTTCAT GTTTCCGTATACGGAAAAAAAACCTTCGAATCTAGAAATAATTTGTCAC
>CAGL0G01056g.cds ATGAAGAAATTATTATATTTTGCTTTGTCTGTGGTAGCAACATCAGCTCAGCTAGCTGAT GAGGTCTTATTACGCAAATGGTCTTTGCAGTTACCTACTATCGAAATCGAAGGCAATAAA TTCTTCAATAGTGAAACTGGTGAGCAATTTTTTATGAAAGGCATTGCATACCAACAGCAA GTTGATCAGGATAGTGAATTATATGATGGTACACCATATGTAGATCCTCTTGCTGACCCA CATATATGTCTCCGTGATCTACCATATCTGGTGGAGCTTGGAATCAATACAATAAGAGTG TATCATATTGACCCCAGCTCTTCACATGATACATGCATGAAGGCATTTTCTGATGCAGGT ATATATGTGCTCATCGATCTCGCAGAACCAGAAATATCCATAGTCCGTAATAACCCTAGT TGGGATGTTAAAGTATGGTCTAGGTATAGAGATGTAGTTGATGCCATGCATTTTTATAAC AATGTTTTAGGCTTTTTTGCTGGTAATGAAGTGACCAATGACAAATATAATACTGATGCT TCACCATTTGTGAAGGCGGCAATTAGAGATGTAAAGACTTATATGCAACAAAAGGGATAT AGGAATATCCCTGTAGGTTATTCAACGAATGACGATGCTGAAACTAGAATAAATCTTTCC AAATATTTTGTATGTGGAGAAAACTCAGCAGATTTCTATGGCATAAATATGTATGAGTGG TGTGGCTATTCTACATACGGCACCAGCGGCTACAAAGAAAGAACTGAAGAATTTACTGAC TTCCCTGTTCCAGTGTTTTTCTCTGAATTTGGGTGTAACTTGGTTAGACCCAGGCCATTT ACAGAAGTAGCCGCCCTCTTCAGTAAGAAAATGTCTTCTGTATGGTCTGGCGGTTTAGTT TATATGTATTTTGAGGAGGAAAATCAATATGGTGTTGTTAAAATTAACAAAAATAATGAA GTAGAGAAGCTACCTGATTTTGATAATTTGAAAAAAGCATATAGGAAAGCAACCCCTAAG GGTGTCAATCTTTCAGATCAAGCTGTCTCAAGGAAATCCATTAATGTTCGCAAACTTGAT TGCCCGGAGAAAAGCCATAATAATAATTGGTTAGCGTCTGATATTTTACCACCGACACCT AATGACGAGAAATGTTCTTGCTTGGATGAGATTTTACCTTGCCTGGCATTACCTAGTAAT GACGATCAAGATCATTACAAGACTTTATTTAACTACGTTTGTGGGGAAGTCGACTGTACA GATATCAAAACAGATGGTACATTAGGGAAGTATGGAAAATTCTCAGACTGTTCAGTAAAT CAAAAGCTGTCCTTGCAATTGAGTAAGCTTTACTACAAATTAAAATTAGAGGATCACATA TGCCCAACTAACCCGAAATACGTTCGGTTTAACAGCGCGACCATAACTCGGAAGGATACT TGTGAATCTATCCTCAAAGAAATTAGCCAAGGGACTGCAAAAACTAAAAAGGAACCGATA AATGAAATAGTCACATCGGTACCAGCGGAAGACGGAATGATTTCCTCCACCGCTAACACT TTGAGTGGCACAATAATACTAGTTATTATTGTTCTAAATACTTTAGTAGTCCTGCTTATA GCATATTAA
>CAGL0G01056g.aa MKKLLYFALSVVATSAQLADEVLLRKWSLQLPTIEIEGNKFFNSETGEQFFMKGIAYQQQ VDQDSELYDGTPYVDPLADPHICLRDLPYLVELGINTIRVYHIDPSSSHDTCMKAFSDAG IYVLIDLAEPEISIVRNNPSWDVKVWSRYRDVVDAMHFYNNVLGFFAGNEVTNDKYNTDA SPFVKAAIRDVKTYMQQKGYRNIPVGYSTNDDAETRINLSKYFVCGENSADFYGINMYEW CGYSTYGTSGYKERTEEFTDFPVPVFFSEFGCNLVRPRPFTEVAALFSKKMSSVWSGGLV YMYFEEENQYGVVKINKNNEVEKLPDFDNLKKAYRKATPKGVNLSDQAVSRKSINVRKLD CPEKSHNNNWLASDILPPTPNDEKCSCLDEILPCLALPSNDDQDHYKTLFNYVCGEVDCT DIKTDGTLGKYGKFSDCSVNQKLSLQLSKLYYKLKLEDHICPTNPKYVRFNSATITRKDT CESILKEISQGTAKTKKEPINEIVTSVPAEDGMISSTANTLSGTIILVIIVLNTLVVLLI AY*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0G01056p