CAGL0H03289g
similar to uniprot|P53155 Saccharomyces cerevisiae YGL082w (ohnolog of YPL191C) Putative protein of unknown function
Element type: CDS
Element length: 1158 nucleotides,
on sense strand of
Cagl0H: 309091..310248.
Other names:
CAGL-CDS2861.1
CAGL-IPF1938
Coding sequence: 386 codons.
Element length: 1158 nucleotides,
on sense strand of
Cagl0H: 309091..310248.
Other names:
CAGL-CDS2861.1
CAGL-IPF1938
Coding sequence: 386 codons.
Database cross references:
EMBL: CR380954
GeneID: 2888535
HOGENOM: Q6FS58
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380954
GeneID: 2888535
HOGENOM: Q6FS58
Homologs and Orthologs
Homologs in protein families: GL3C0218 GL3C0218.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR007518
KEGG: cgr:CAGL0H03289g
PANTHER: PTHR18063
Pfam: PF04424
RefSeq: XP_446936.1
UniProtKB/TrEMBL: Q6FS58
UniprotKB: Q6FS58_CANGA
InterPro: IPR007518
KEGG: cgr:CAGL0H03289g
PANTHER: PTHR18063
Pfam: PF04424
RefSeq: XP_446936.1
UniProtKB/TrEMBL: Q6FS58
UniprotKB: Q6FS58_CANGA
Sequence data 
>CAGL0H03289g.nt TATGATAATTCGAGCATGCAACAACCCATGTCTAATAATCAACCCATGAATCAGCAACAA TTGTTAGCTGCAATTCAGAATCTGCCACCTAATGTTGTATCAAACCTACTTTCTATGGCA CAACAACAGCAACAACAGCCTGATTCCCAACAACAAATTTTAAACATGATTCAGTCAATG CAAACACCCCAGCAGAACCAACAGCAGCAGCAACAACAACAACAACCAGTAAATATGCCA CCCCAAAATAGCTTTGGTGGTAATGTTCAACCTTCGAACCAACCCACAAATCCCGTATCG CCATCTGCTGCTGCCAACCAGCAGGACCAACAACCTGCTAACAATGTTCAAAGTCTTTTG GATAGTTTAACTCAGCTCCAAAAGTAAAGCAGGATAACTGAAGAATAAAAAGACATCTTA TAAAATATAAATTTCTTTGTGCTATTTTTTCTACCTTTCTTTCAATGATTCTCACACATC AATTGCTAATATGAATGTCTTAATTACTCTTATCTCTTAATGATTTTTCTACTTATCAAA TGAGTTTCCTTCTACCTTCTATTCCTCTGAGAATATACTCTTACACATTGCCATTTCTTA TATCTGATATCTTATCTCTTTAATCTCATCATAAATATAGTACAGTCAATGTTGTATGTG CACCGATATCTAACTTGTGCCTATCATACATCACAAATACTCTTGTATAATCAGTCTACT ATTCAACCTCCATTCGAATCTCTGCTGGAGCTCAATTTGCACTTTCAGTGATTCGCGAAA TCCAGTTTGATTAAGGGGTGTGTAGGCTCATAACGAGCATGATAGAATAAAAGTCTATCA CACAGATTAGGCATATATAGCCTGACATAAGCAGACAAAATATCATTGTAGTAATAATTC ATTATCTAGAGGTCTTAGAAATTTTGGATTACCGCAGAGATTGCAACTAAGAGGAAGCTG ACGAGTAACAACCTGCTTTGTAAAGAATATCAGAAACAGAATGGATGACTATTACAGTAT CAAAAGCATTGAATTCAAAGGGTATCATTGTCGGATACTACTAGACCAAGATGAAGACTA CTCTGCATTGGTAGCGTTGACTAATGCCCTAGTGTTGTCGCAGGGACATAATAGGGTTAC GAGCCAATTGAAGAGTATATTTGACAACTGCAATGAGATAGCTGTAGAAGATCTTCTCGA TGAACTTGCCAACATTGGACTGCAATTAGGTGTGATGAGTAACTATGGGCAAGACAAGGA ACAGTTAATAGCTACATTGAAGGAGTTTAGAAAAGGTTTGCATATTAATCCTAAATTTAA CGGATCATTTACTGACAGTTTGGAAACTTCTGTATTTAGTGGATTCAATGTTGCTTTGGT TCATGGATGGGTGGTTGATGGTGACAGAGATCCAACGAGTTATTACCATCTATCAAAATA TTCATACGAGGAAGCACAAAGGGTTTTAGTCCAAGCATATGAAATTAGGAAGGACCAGAA CGGGGTTGCGCTTAACACTAATGCACAACAAGTATTAGACGACTCTGCATATATTAAATC ATTTCTAGCACGGTCTGCTACTCAATTGACTGAGTATGGTTTACAACATTTAAAGGAAAT ACTTGTTGAGAAATCATTTGCTGTGCTATTTAGAAATGATAGATATTTCACTCTCTACAA AAATGCTGGTGAATTATTTATTCTCGTTACTAATCCATCGCAATCACGTAATAACAACAT TGTTTGGCAGTCATTACATTCGGTAAATGGTGCTAGGGATTTGTACTATAATGGAGTTTT TGTTGAGATAAACCCAGATAATGACCAAAATACATTTGATGATGTTGTTGTCCCACAAAG TAATCCATTCAGTGACCCACAAACAAACCAAGAATTCCAAAACATTGACAGAAATGATAC CTTCGATGCTCAGCAGGTGGAAGATGATGAATTGCTAGCTAGGCAATTGCAGGAAGAAGA GGATAGACAAGCCGCTGGTCTAATGCAGAATGCTTACAGAAGAAATGGACCTCGTAATAA GTACCAAATAGATGACGAGTCAAAGAAGAAGAAGAAAAGGAACAGTATAATTCCAAAAAT GCCGTCTCTAGGAAAAAAGAAGAAAGACGGTAAGGATAAAAATTGTATTATAATGTAACA TGCCACCTAGACATTTGAAATAACTTATGTATTATTTGAAAATTGGCCGTGGGAATGTCC ATAACACCTCAACTTTTCATTTATGCGGCGTAAAATTTTAGTGACTCATGGTCCGTTCAC CTCATTTGCTTATCAATTTTTCATTGTTCTTGGTATATTATATGAAACATGTACTATTAT AGATGTATTATACAACTATGATGTAAGTAAACAAAATGGTCCCACTTTTTGTAATTGGTT TTAATTATTTAATCATGCTATAACTTGGCTATAATCTCGGACTTTTCTACGTTATGTG
>CAGL0H03289g.cds ATGGATGACTATTACAGTATCAAAAGCATTGAATTCAAAGGGTATCATTGTCGGATACTA CTAGACCAAGATGAAGACTACTCTGCATTGGTAGCGTTGACTAATGCCCTAGTGTTGTCG CAGGGACATAATAGGGTTACGAGCCAATTGAAGAGTATATTTGACAACTGCAATGAGATA GCTGTAGAAGATCTTCTCGATGAACTTGCCAACATTGGACTGCAATTAGGTGTGATGAGT AACTATGGGCAAGACAAGGAACAGTTAATAGCTACATTGAAGGAGTTTAGAAAAGGTTTG CATATTAATCCTAAATTTAACGGATCATTTACTGACAGTTTGGAAACTTCTGTATTTAGT GGATTCAATGTTGCTTTGGTTCATGGATGGGTGGTTGATGGTGACAGAGATCCAACGAGT TATTACCATCTATCAAAATATTCATACGAGGAAGCACAAAGGGTTTTAGTCCAAGCATAT GAAATTAGGAAGGACCAGAACGGGGTTGCGCTTAACACTAATGCACAACAAGTATTAGAC GACTCTGCATATATTAAATCATTTCTAGCACGGTCTGCTACTCAATTGACTGAGTATGGT TTACAACATTTAAAGGAAATACTTGTTGAGAAATCATTTGCTGTGCTATTTAGAAATGAT AGATATTTCACTCTCTACAAAAATGCTGGTGAATTATTTATTCTCGTTACTAATCCATCG CAATCACGTAATAACAACATTGTTTGGCAGTCATTACATTCGGTAAATGGTGCTAGGGAT TTGTACTATAATGGAGTTTTTGTTGAGATAAACCCAGATAATGACCAAAATACATTTGAT GATGTTGTTGTCCCACAAAGTAATCCATTCAGTGACCCACAAACAAACCAAGAATTCCAA AACATTGACAGAAATGATACCTTCGATGCTCAGCAGGTGGAAGATGATGAATTGCTAGCT AGGCAATTGCAGGAAGAAGAGGATAGACAAGCCGCTGGTCTAATGCAGAATGCTTACAGA AGAAATGGACCTCGTAATAAGTACCAAATAGATGACGAGTCAAAGAAGAAGAAGAAAAGG AACAGTATAATTCCAAAAATGCCGTCTCTAGGAAAAAAGAAGAAAGACGGTAAGGATAAA AATTGTATTATAATGTAA
>CAGL0H03289g.aa MDDYYSIKSIEFKGYHCRILLDQDEDYSALVALTNALVLSQGHNRVTSQLKSIFDNCNEI AVEDLLDELANIGLQLGVMSNYGQDKEQLIATLKEFRKGLHINPKFNGSFTDSLETSVFS GFNVALVHGWVVDGDRDPTSYYHLSKYSYEEAQRVLVQAYEIRKDQNGVALNTNAQQVLD DSAYIKSFLARSATQLTEYGLQHLKEILVEKSFAVLFRNDRYFTLYKNAGELFILVTNPS QSRNNNIVWQSLHSVNGARDLYYNGVFVEINPDNDQNTFDDVVVPQSNPFSDPQTNQEFQ NIDRNDTFDAQQVEDDELLARQLQEEEDRQAAGLMQNAYRRNGPRNKYQIDDESKKKKKR NSIIPKMPSLGKKKKDGKDKNCIIM*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0H03289p