CAGL0A04235g
similar to uniprot|P34218 Saccharomyces cerevisiae YBL052c SAS3 Histone acetyltransferase catalytic subunit of NuA3 complex that acetylates histone H3, involved in transcriptional silencing
Element type: CDS
Element length: 2160 nucleotides,
on sense strand of
Cagl0A: 419951..422110.
Other names:
CAGL-CDS1027.1
CAGL-IPF1155
Coding sequence: 720 codons.
Element length: 2160 nucleotides,
on sense strand of
Cagl0A: 419951..422110.
Other names:
CAGL-CDS1027.1
CAGL-IPF1155
Coding sequence: 720 codons.
Database cross references:
EMBL: CR380947
GeneID: 2886335
HOGENOM: Q6FY84
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380947
GeneID: 2886335
HOGENOM: Q6FY84
Homologs and Orthologs
Homologs in protein families: GL3C0136 GL3C0136.N4Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0A04235p 
similar to uniprot|P34218 Saccharomyces cerevisiae YBL052c SAS3 silencing protein
Protein domain map
Database cross references:
Gene3D: G3DSA:3.40.630.30
InterPro: IPR002717
InterPro: IPR016181
KEGG: cgr:CAGL0A04235g
Pfam: PF01853
RefSeq: XP_444951.1
UniProtKB/TrEMBL: Q6FY84
UniprotKB: Q6FY84_CANGA
Gene3D: G3DSA:3.40.630.30
InterPro: IPR002717
InterPro: IPR016181
KEGG: cgr:CAGL0A04235g
Pfam: PF01853
RefSeq: XP_444951.1
UniProtKB/TrEMBL: Q6FY84
UniprotKB: Q6FY84_CANGA
Sequence data 
Nucleotide sequence
>CAGL0A04235g.nt ATTCTCTTTTTTCTAAAATTAATGAACAGGCTTTTAACAACCCACTACTTTATAGATAAT ATTTGGTAAATTCACGTCGTGGTTGTCCCCATACAAGAAAATTGTCACCTTCTATCTTTT CTTCATCTCAATTGCATGCTTAAACACGCGTTCAGAATATTGTCCAAACTCAAATCTCAA TGTGATATATGCCTCACTTGCAAAATTTATTCCTTACCATTTAAACCAATCAATTTTTGC AACACCTTGACTTTTTTACGTCACGCACAGGGGGGACAATCTCAACAATCTCATAACATA GCTATGAAAAATGAGTTAGCTACCTACTCCCTATTGCAACAACTATTTTGCCTACCAACT CGACGTTAATGATATTATTCTGCGGAATCATAAGCATTCTATACATGGTTTCAACTTTTT CAGCGAAACATTGGCTTCTTGATTCTCGCAATACCCTCCATTTCAAAATGTATTTTAAAA AAAACAACTAATATGTACTTTCTTATGTATTATTCTAACTAAATTCATATCACACTACTC ATAGTTGGTACAGCATGCAAGAGGCATGATACGATTAAGTCAGAGCTTATTGTTGCAGTT CAGTCACATCACTGAATACGCTGCATTTCTTCATTGAGCCTGAATTGATCACAAGGTAAA CAAGTGGATTACACCTTAACTATGATGTATATTGGGTAGAGGACATAATAGTTAGCGTAT GCATACACCGTGCCACCGAATCTGCACTACAATCATAAGCTATATCAAATTATAGCGCTA CAGTTCCATAGGAAGGAATTGCAAAAGAGAACCAACAGATAATTAAACTTAAAGTTTAAT AAAAGATATTAACACCACAAAGCTAGACAGTAAAGATTTTAATGACCTATCAAAACGATA ATCGGAGTCCACTATAGGAAGCAATTACGATACTTTACCAGCAAAACAACAGGAAATCTA ACAGTTATAAGCGATACGAAGTTGCCTATAACTTCTATAAATGAATCGAGCCACACGAAG TCGAAAACCAGATCAACTCCCGTCACAACCGCAACGATATGATTACAATACCATTAATGA TCTGGGTGCAACCCATATGACAAATGGTGGAACCAATGTTGCATCCATAACAACCAATCT ATCGACAAATCCGAATGTTGAAATTCATGAAGAAGTTTTCGAGAACAATGCGAGAAAACT AGGTCCCTTAAAAATTAGATATGATTCGAAAAAACTTTTGAATTTTAAGAGATTATTGGA GGTTAGATCAGAAAATGCTATCAAAGACTCTGAGGCATCTATCCGAGAGTTTGATCCGAA CGGTGATAGTGATGCACTCCAAATACCGGAGGTAGACGACGATATCCCATACAGAGGAGT TGTTGTAGGCAAAAAGAATTACAGCACTCATCGAACAATCCCTTCATCAACTGATCGTGA GTTCTTCAGGAGATTATTCCTAGAATCATCTACAGCTGCATTTTACAATGGTAATGTTCT TTTAGGGAATCATGATATAAATGATAACGATACCAGTCAACCACCGAATAAGAAATTAAA GAACGTGAAAGCTGTAAAAGAAAATCCAAAGACAATAGAATATGTATACATAAGAGATTC AGAAGTCAAGACATGGTACACAGCCCCATATCCCGAGGAATTCAATAAAAATAAAATACT TTACGTCTGTGAATATTGCCTGAAGTACATGAATTCACGTTTTGTATATTATAGACACAC ACTGAAATGCAAGGACCATAGACCCCCAGGTAATGAAATTTACAGAGATGAGAATGTCTC CGTCTGGGAAATAGATGGCAGAGAAAACGTTGTCTACTGCCAAAATTTATGTCTACTAGC TAAATTGTTCCTTAATTCCAAGACTCTTTACTATGATGTAGAGCCATTTGTGTTTTATGT GTTGACTGAACGAGAAGTATCTGAGGATGGAAGAACAGTGAAAAATCACTTCGTTGGCTA TTTTAGCAAAGAAAAACTAAACTCATCTGGGTACAATTTAAGTTGTATTATTACTTTACC TCTCTATCAGAGACGTGGATATGGCCATTTCTTGATGGATTTTTCATATTTACTCTCTAA AAGAGAATTTTCACAGGGCACTCCTGAGAAACCGTTATCTGATCTAGGCCTTATTACTTA TAGAAATTTCTGGAAGTTAAAATGTGCTGAAACACTTCTTTATTTGAAGAATGAACTTAA TTTGGAAGATAGTGAGAGTGATGATAAATTTCCTCTGGTCTCTATAGAGGATCTAGCAAA TCTAACCGGAATGCTTCCAACTGATGTTATACTTGGCCTTGAAGAGTTAGGCGTATTTTA CAGGTGTCCGGATCCTAATCAAAATACCACATCGTACTGTATTAAAATCGATTCTTGGAA TAGAATCAAGGCCATTCGTGAAAACTGGCTAAGAAAGGGCTATCAGTCATTAAAACCAGA AAACTTAATTTGGAAGCCACTAATATATGGCCCATCAGGTGGTGTAAATGCGTTAGGAAT GGTAGAACCCCCCAGTTTGCCAGAAGATCGTAAAACAAGTATATCCAGTGAGCCGAATTT CCAGAACAATCCAGTTGACTTCTTTGGTAGCCATATAACCATGGTGAAGAAGTTTATGAC TGATGATATCGAAGATCCAAGGGACTTAGAAATTCTGACAATAGACAATATTAAGAAGAG AAAACTATCTGTTGGCAAAAACAACATGTTGCAACAGAGTTGGGAGATTGCATATCAAGA TCCAAGGCCTGTTGATAAAAAGGATACTACTGCACGTAAAGCTCCCTCTCTTCTTTCTTC AAAAACTACTCGAAAGGAATCCTTGATGTCTGCTGAGACCGAAGATGTACTCCCTTATGA AAACCAAGAAATGGATGATACTTCAGTAGTATTAGAAACTGAAGAAAGCGACCCTGACGA CAATGATTACGATGAGGAGGATATTAACGAGAAAGTCATATCTTCTGATTCATCATCTTT AGTGGTGTCTTCTGAGGAGGAAAATACAGATGAGACAATACCAGTCAGAAGATTTCCGCG AAGGCATGCTTCTGGGTTAGATGATGCATTGGATGACGATAGAGACGAATTGATAGATCT TACTACCTCAAGACAGAAAAGACAACTGAGGAGGATGTGATATGAACATCAAAACTGTTA ATAGTATATAAAAGTTTTTATTCCATCTAATTATTCATAAAAAGTACATTTGTGCATCAT GCAATTAACTTTTGCACAGAAACTATAAAAAACTATAAACTCATGACATGAAACATTTAA TGATTGATTAATAATAACCATGATATACGATATATGCAATAATGTAAGCTATCTAGCCAG CCAGACTTTGTTATTTTCTCTTGTTAAATCCTCAAACCGTGTTGTCACAGGTAATTACGC GAGAATGTATCTAAACGCCATTTGATTTCATTTTCTGATC
Coding sequence
>CAGL0A04235g.cds ATGAATCGAGCCACACGAAGTCGAAAACCAGATCAACTCCCGTCACAACCGCAACGATAT GATTACAATACCATTAATGATCTGGGTGCAACCCATATGACAAATGGTGGAACCAATGTT GCATCCATAACAACCAATCTATCGACAAATCCGAATGTTGAAATTCATGAAGAAGTTTTC GAGAACAATGCGAGAAAACTAGGTCCCTTAAAAATTAGATATGATTCGAAAAAACTTTTG AATTTTAAGAGATTATTGGAGGTTAGATCAGAAAATGCTATCAAAGACTCTGAGGCATCT ATCCGAGAGTTTGATCCGAACGGTGATAGTGATGCACTCCAAATACCGGAGGTAGACGAC GATATCCCATACAGAGGAGTTGTTGTAGGCAAAAAGAATTACAGCACTCATCGAACAATC CCTTCATCAACTGATCGTGAGTTCTTCAGGAGATTATTCCTAGAATCATCTACAGCTGCA TTTTACAATGGTAATGTTCTTTTAGGGAATCATGATATAAATGATAACGATACCAGTCAA CCACCGAATAAGAAATTAAAGAACGTGAAAGCTGTAAAAGAAAATCCAAAGACAATAGAA TATGTATACATAAGAGATTCAGAAGTCAAGACATGGTACACAGCCCCATATCCCGAGGAA TTCAATAAAAATAAAATACTTTACGTCTGTGAATATTGCCTGAAGTACATGAATTCACGT TTTGTATATTATAGACACACACTGAAATGCAAGGACCATAGACCCCCAGGTAATGAAATT TACAGAGATGAGAATGTCTCCGTCTGGGAAATAGATGGCAGAGAAAACGTTGTCTACTGC CAAAATTTATGTCTACTAGCTAAATTGTTCCTTAATTCCAAGACTCTTTACTATGATGTA GAGCCATTTGTGTTTTATGTGTTGACTGAACGAGAAGTATCTGAGGATGGAAGAACAGTG AAAAATCACTTCGTTGGCTATTTTAGCAAAGAAAAACTAAACTCATCTGGGTACAATTTA AGTTGTATTATTACTTTACCTCTCTATCAGAGACGTGGATATGGCCATTTCTTGATGGAT TTTTCATATTTACTCTCTAAAAGAGAATTTTCACAGGGCACTCCTGAGAAACCGTTATCT GATCTAGGCCTTATTACTTATAGAAATTTCTGGAAGTTAAAATGTGCTGAAACACTTCTT TATTTGAAGAATGAACTTAATTTGGAAGATAGTGAGAGTGATGATAAATTTCCTCTGGTC TCTATAGAGGATCTAGCAAATCTAACCGGAATGCTTCCAACTGATGTTATACTTGGCCTT GAAGAGTTAGGCGTATTTTACAGGTGTCCGGATCCTAATCAAAATACCACATCGTACTGT ATTAAAATCGATTCTTGGAATAGAATCAAGGCCATTCGTGAAAACTGGCTAAGAAAGGGC TATCAGTCATTAAAACCAGAAAACTTAATTTGGAAGCCACTAATATATGGCCCATCAGGT GGTGTAAATGCGTTAGGAATGGTAGAACCCCCCAGTTTGCCAGAAGATCGTAAAACAAGT ATATCCAGTGAGCCGAATTTCCAGAACAATCCAGTTGACTTCTTTGGTAGCCATATAACC ATGGTGAAGAAGTTTATGACTGATGATATCGAAGATCCAAGGGACTTAGAAATTCTGACA ATAGACAATATTAAGAAGAGAAAACTATCTGTTGGCAAAAACAACATGTTGCAACAGAGT TGGGAGATTGCATATCAAGATCCAAGGCCTGTTGATAAAAAGGATACTACTGCACGTAAA GCTCCCTCTCTTCTTTCTTCAAAAACTACTCGAAAGGAATCCTTGATGTCTGCTGAGACC GAAGATGTACTCCCTTATGAAAACCAAGAAATGGATGATACTTCAGTAGTATTAGAAACT GAAGAAAGCGACCCTGACGACAATGATTACGATGAGGAGGATATTAACGAGAAAGTCATA TCTTCTGATTCATCATCTTTAGTGGTGTCTTCTGAGGAGGAAAATACAGATGAGACAATA CCAGTCAGAAGATTTCCGCGAAGGCATGCTTCTGGGTTAGATGATGCATTGGATGACGAT AGAGACGAATTGATAGATCTTACTACCTCAAGACAGAAAAGACAACTGAGGAGGATGTGA
Predicted translation product
>CAGL0A04235g.aa MNRATRSRKPDQLPSQPQRYDYNTINDLGATHMTNGGTNVASITTNLSTNPNVEIHEEVF ENNARKLGPLKIRYDSKKLLNFKRLLEVRSENAIKDSEASIREFDPNGDSDALQIPEVDD DIPYRGVVVGKKNYSTHRTIPSSTDREFFRRLFLESSTAAFYNGNVLLGNHDINDNDTSQ PPNKKLKNVKAVKENPKTIEYVYIRDSEVKTWYTAPYPEEFNKNKILYVCEYCLKYMNSR FVYYRHTLKCKDHRPPGNEIYRDENVSVWEIDGRENVVYCQNLCLLAKLFLNSKTLYYDV EPFVFYVLTEREVSEDGRTVKNHFVGYFSKEKLNSSGYNLSCIITLPLYQRRGYGHFLMD FSYLLSKREFSQGTPEKPLSDLGLITYRNFWKLKCAETLLYLKNELNLEDSESDDKFPLV SIEDLANLTGMLPTDVILGLEELGVFYRCPDPNQNTTSYCIKIDSWNRIKAIRENWLRKG YQSLKPENLIWKPLIYGPSGGVNALGMVEPPSLPEDRKTSISSEPNFQNNPVDFFGSHIT MVKKFMTDDIEDPRDLEILTIDNIKKRKLSVGKNNMLQQSWEIAYQDPRPVDKKDTTARK APSLLSSKTTRKESLMSAETEDVLPYENQEMDDTSVVLETEESDPDDNDYDEEDINEKVI SSDSSSLVVSSEEENTDETIPVRRFPRRHASGLDDALDDDRDELIDLTTSRQKRQLRRM*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0A04235p