Element type: CDS
Element length: 1680 nucleotides,
on anti-sense strand of
Cagl0G: complement(27239..28918).
Other names:
CAGL-CDS1723.1
CAGL-IPF8924
Coding sequence: 560 codons.
Element length: 1680 nucleotides,
on anti-sense strand of
Cagl0G: complement(27239..28918).
Other names:
CAGL-CDS1723.1
CAGL-IPF8924
Coding sequence: 560 codons.
Database cross references:
EMBL: CR380953
GeneID: 2888417
HOGENOM: Q6FTR7
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380953
GeneID: 2888417
HOGENOM: Q6FTR7
Homologs and Orthologs
Homologs in protein families: GL3R0042 GL3R0042.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR000873
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0G00286g
PROSITE: PS00455
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_446377.1
SMART: SM00768
UniProtKB/TrEMBL: Q6FTR7
UniprotKB: Q6FTR7_CANGA
InterPro: IPR000873
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0G00286g
PROSITE: PS00455
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_446377.1
SMART: SM00768
UniProtKB/TrEMBL: Q6FTR7
UniprotKB: Q6FTR7_CANGA
Sequence data 
>CAGL0G00286g.nt ACTCATACTCTGAACAAAGATGATGACCGGAGGGTAGTTGGAGCACTAAGTGCAGAGAAT GTAGCGTACACCCTCTGCCATTAATCCTTCCCCCGCTTTTTCATAGCCAGCACATTATAC CTCTTCGAGCGATAAAACAACAGAAACTGCAAAAATGTTTCCCCATCCGGCACAAATTAT CCTACTGAGCGGTCCCGCGACCAGTGGTAGCTCAGAAATGGCCGCAATAACCACTAAAAA ACCTGGCATGCTGATCTGAAATACACAGACAAACACAGACACACTGCCATACCTGTGCAA ACCCAGAGAAGGCAGTACCAGAGAGCACAATCCGCCCCCAAAAAGAAACAATCTTCAGAG AAAAGCTTCTGTATCCTCCCCCCACTGCAAGCAACGCACAACTGCGCAATAGAAAGTATA AAACCTTCGAAAAATAGAAGAAAATCAGAGAGTGCGGGTCCTGCAGAAGTGTCTGGGCAG CATATCTAGAAAGGGCTTGAAAGGGCTTGAAAGGGCTTCAAAAGTAGCTTCGAAAGAAAA CCGCTTTGTTCCTCTCCCCCCCATCTCAAGCCTTTGTGTTACGTGCACTACCTTCGTTCT CCCCCCACCGATAATCCAGCAAATCCACCAAAAACTCAGCTCACATATTTTGGTGTTTTC GTGTTGGTTAGCATTGCCCTTGCAGCGAAATATTCTATCCAGTGAAAAAAATGAATTAAA CAATCAAACAATAACACAGCCTACTTTAAAATTCACAGGACGCAATGCGTAACCTTGCTT TTTGAACAGAACTAATTAAGGCAGGAAAAGGAAAAGATTGAATTTATCTTGACAACAACA GGATGAGGATATATAAGGACCACATTTGATCCTTTCTTGAAACGTAATCTTAAACTCTAT GTTTACTTCTAGTGCTGTGTTCTAATAGTTTTTTTTCTTCACTTAGCTGTATCAAACAAC TCACTGTATCAATCACTATTTTACTATAACTAGATCAATAATGCAATACAGTCTGGTTTC TTTCATAATAGCTGCTACATTGCTGTTGTCGTCAGTTATGGCTGATGACCTGCCGGCTAT TGAGATCAAGGGTAACAAGTTCTTTTTCTCCAATAACGGTTCCCAATTCTACATGAAGGG TATCGCTTACCAAGCTGATACCGCTAACGTCACAGGCGGTGCCACCATCAACGACCCATT GGCCGACTGGGACACCTGTTCCAGAGATATTCCATACTTGCAACAACTAGCTACTAACGT CATCCGTGTCTACGCCGTCAACACTTCCTTGGACCACTCCAGATGTATGAACGCTCTGAA CGATGCCGGTATCTACGTCATTGCCGATTTGTCCTCTCCAAAGGTTTCCGTCAACAGAAA GTCTCCTTCCTGGGACTTGGAAATCTTCGACCGTTACAAGTCCGTTGTTGACATGTTCGC TAACTACTCCAACGTTCTAGGTTTCTTCGCAGGTAACGAGGTTACCAACGATGCCACCAA CACTGACGCTTCCGCTTTCGTTAAGGCCGCCATTAGAGACACCAAGTCCTACATCAAGGA AAAGGGTTACAGAGGTATCCCAGTTGGTTACTCTTCTAACGATGACGCCGACACCAGAGT TGACATCGCTGATTACTTCGCTTGTGGTGACGACGCTGAAAGAGCTGACTTCTACGGTAT TAACATGTACGAATGGTGTGGTAACTCTACTTTCCAAAAGTCTGGTTACGCTGACAGAAC CAAGGAATTCGCCAACTTGTCCATCCCATTGTTCTTCTCCGAATACGGTTGTAACGAAGT TCAACCAAGAGAGTTCACTGAAGTTCAAGCACTATACGGCCCTGATATGACTGATGTCTG GTCCGGTGGTATTGTCTACATGTATTTCCAAGAAGCAAACAACTACGGTTTGGTTAGCAT CGATGGCTCTAGTGTTAAGACTTTGGAAGATTTCAACTACTACTCTAAAGAAATCCACTC CATCTCCCCATCCTCAGTAAACTCCAAGACTTACACTCCAACCGCAACCTCTTTGGCTTG CCCATCTACTAACCAATACTGGAAGGCTGCCACTAACTTGCCACCAACCCCACAACTAGA TCTATGTGAATGTATGGATGCTGCTAACTCTTGTATTGTTCAAGATGATGTCGATGAAGA CGACTACCAAGATTTGTTCTCCTACTTGTGTGGTAAGATTGACTGTGGTGGTATTACTGG TAACGGTACCACCGGTAAGTACGGTTCTTACTCTTTCTGCTCTCCAAAGGAAAAGCTAAA CTTCGTTCTAAACCTATACTACAACGCCCAAGGTGGTTCCAAGTCTAACTGTGACTTCAG TGGTTCTGCTACATTGAGAAGTGGAACTACCCAAGCTGGTTGTGCCTCTGCCTTGAAGGA AATTGGTAGCGTCGGTACTAACTCTGCTACCGATTCTGTTACTTTCTCTGGTGGCTCTAC TGGTACTTCCAAGGCATCTGCTACCGGCTCTAACTCTTCCAAGTCCGGCTCAAGCAAATC CGGCTCTTCTACAAGTTCTTCTGCTAAGAGCTCTTCCTCTGGTAAGAGTAACAAGAAGTC TAACAGCTCTAGCTCCGTCCAAGTTGGTCTATACCAACTTCTTTTCTCAGCTTTCATCAC ATTAGGTGCAGTCGCCGGTGCTGGTTTCGCTCTTATTTAATTTCAGAATGATTTAAATAT TTGATGAATCCTAATGTTCATATGATCAACGGGAAGAATATAGAAAAATATTCATTTTAA TGCATGTATACTAAAGACATTTGTTTCATTTTTTCTCAACTAATTTCATTCATATCCAAT TAACCAAAATATTTATTTGTAATTTTAGCTATGATCTTTAATGGAATTGTGTTTTTTTTT GGACGCGATTTATTGTCGCCTAAAGTATTCTAATTTGCTGACATATATTTTCTATAAAAG CGATGCAGGGGTATACAGTTCTGTGTTGAGGTAATATTAG
>CAGL0G00286g.cds ATGCAATACAGTCTGGTTTCTTTCATAATAGCTGCTACATTGCTGTTGTCGTCAGTTATG GCTGATGACCTGCCGGCTATTGAGATCAAGGGTAACAAGTTCTTTTTCTCCAATAACGGT TCCCAATTCTACATGAAGGGTATCGCTTACCAAGCTGATACCGCTAACGTCACAGGCGGT GCCACCATCAACGACCCATTGGCCGACTGGGACACCTGTTCCAGAGATATTCCATACTTG CAACAACTAGCTACTAACGTCATCCGTGTCTACGCCGTCAACACTTCCTTGGACCACTCC AGATGTATGAACGCTCTGAACGATGCCGGTATCTACGTCATTGCCGATTTGTCCTCTCCA AAGGTTTCCGTCAACAGAAAGTCTCCTTCCTGGGACTTGGAAATCTTCGACCGTTACAAG TCCGTTGTTGACATGTTCGCTAACTACTCCAACGTTCTAGGTTTCTTCGCAGGTAACGAG GTTACCAACGATGCCACCAACACTGACGCTTCCGCTTTCGTTAAGGCCGCCATTAGAGAC ACCAAGTCCTACATCAAGGAAAAGGGTTACAGAGGTATCCCAGTTGGTTACTCTTCTAAC GATGACGCCGACACCAGAGTTGACATCGCTGATTACTTCGCTTGTGGTGACGACGCTGAA AGAGCTGACTTCTACGGTATTAACATGTACGAATGGTGTGGTAACTCTACTTTCCAAAAG TCTGGTTACGCTGACAGAACCAAGGAATTCGCCAACTTGTCCATCCCATTGTTCTTCTCC GAATACGGTTGTAACGAAGTTCAACCAAGAGAGTTCACTGAAGTTCAAGCACTATACGGC CCTGATATGACTGATGTCTGGTCCGGTGGTATTGTCTACATGTATTTCCAAGAAGCAAAC AACTACGGTTTGGTTAGCATCGATGGCTCTAGTGTTAAGACTTTGGAAGATTTCAACTAC TACTCTAAAGAAATCCACTCCATCTCCCCATCCTCAGTAAACTCCAAGACTTACACTCCA ACCGCAACCTCTTTGGCTTGCCCATCTACTAACCAATACTGGAAGGCTGCCACTAACTTG CCACCAACCCCACAACTAGATCTATGTGAATGTATGGATGCTGCTAACTCTTGTATTGTT CAAGATGATGTCGATGAAGACGACTACCAAGATTTGTTCTCCTACTTGTGTGGTAAGATT GACTGTGGTGGTATTACTGGTAACGGTACCACCGGTAAGTACGGTTCTTACTCTTTCTGC TCTCCAAAGGAAAAGCTAAACTTCGTTCTAAACCTATACTACAACGCCCAAGGTGGTTCC AAGTCTAACTGTGACTTCAGTGGTTCTGCTACATTGAGAAGTGGAACTACCCAAGCTGGT TGTGCCTCTGCCTTGAAGGAAATTGGTAGCGTCGGTACTAACTCTGCTACCGATTCTGTT ACTTTCTCTGGTGGCTCTACTGGTACTTCCAAGGCATCTGCTACCGGCTCTAACTCTTCC AAGTCCGGCTCAAGCAAATCCGGCTCTTCTACAAGTTCTTCTGCTAAGAGCTCTTCCTCT GGTAAGAGTAACAAGAAGTCTAACAGCTCTAGCTCCGTCCAAGTTGGTCTATACCAACTT CTTTTCTCAGCTTTCATCACATTAGGTGCAGTCGCCGGTGCTGGTTTCGCTCTTATTTAA
>CAGL0G00286g.aa MQYSLVSFIIAATLLLSSVMADDLPAIEIKGNKFFFSNNGSQFYMKGIAYQADTANVTGG ATINDPLADWDTCSRDIPYLQQLATNVIRVYAVNTSLDHSRCMNALNDAGIYVIADLSSP KVSVNRKSPSWDLEIFDRYKSVVDMFANYSNVLGFFAGNEVTNDATNTDASAFVKAAIRD TKSYIKEKGYRGIPVGYSSNDDADTRVDIADYFACGDDAERADFYGINMYEWCGNSTFQK SGYADRTKEFANLSIPLFFSEYGCNEVQPREFTEVQALYGPDMTDVWSGGIVYMYFQEAN NYGLVSIDGSSVKTLEDFNYYSKEIHSISPSSVNSKTYTPTATSLACPSTNQYWKAATNL PPTPQLDLCECMDAANSCIVQDDVDEDDYQDLFSYLCGKIDCGGITGNGTTGKYGSYSFC SPKEKLNFVLNLYYNAQGGSKSNCDFSGSATLRSGTTQAGCASALKEIGSVGTNSATDSV TFSGGSTGTSKASATGSNSSKSGSSKSGSSTSSSAKSSSSGKSNKKSNSSSSVQVGLYQL LFSAFITLGAVAGAGFALI*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0G00286p