CAGL0A04235g


similar to uniprot|P34218 Saccharomyces cerevisiae YBL052c SAS3 Histone acetyltransferase catalytic subunit of NuA3 complex that acetylates histone H3, involved in transcriptional silencing

Genomic environment map

Element type: CDS
Element length: 2160 nucleotides,
on sense strand of
Cagl0A: 419951..422110.
Other names:
CAGL-CDS1027.1
CAGL-IPF1155
Coding sequence: 720 codons.
Database cross references:
EMBL: CR380947
GeneID: 2886335
HOGENOM: Q6FY84

Homologs and Orthologs

Homologs in protein families: GL3C0136 GL3C0136.N4
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0A04235p  


similar to uniprot|P34218 Saccharomyces cerevisiae YBL052c SAS3 silencing protein

Protein domain map

Protein length: 719 amino acids
Protein family: GL3C0136
Database cross references:
Gene3D: G3DSA:3.40.630.30
InterPro: IPR002717
InterPro: IPR016181
KEGG: cgr:CAGL0A04235g
Pfam: PF01853
RefSeq: XP_444951.1
UniProtKB/TrEMBL: Q6FY84
UniprotKB: Q6FY84_CANGA

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>CAGL0A04235g.nt
ATTCTCTTTTTTCTAAAATTAATGAACAGGCTTTTAACAACCCACTACTTTATAGATAAT
ATTTGGTAAATTCACGTCGTGGTTGTCCCCATACAAGAAAATTGTCACCTTCTATCTTTT
CTTCATCTCAATTGCATGCTTAAACACGCGTTCAGAATATTGTCCAAACTCAAATCTCAA
TGTGATATATGCCTCACTTGCAAAATTTATTCCTTACCATTTAAACCAATCAATTTTTGC
AACACCTTGACTTTTTTACGTCACGCACAGGGGGGACAATCTCAACAATCTCATAACATA
GCTATGAAAAATGAGTTAGCTACCTACTCCCTATTGCAACAACTATTTTGCCTACCAACT
CGACGTTAATGATATTATTCTGCGGAATCATAAGCATTCTATACATGGTTTCAACTTTTT
CAGCGAAACATTGGCTTCTTGATTCTCGCAATACCCTCCATTTCAAAATGTATTTTAAAA
AAAACAACTAATATGTACTTTCTTATGTATTATTCTAACTAAATTCATATCACACTACTC
ATAGTTGGTACAGCATGCAAGAGGCATGATACGATTAAGTCAGAGCTTATTGTTGCAGTT
CAGTCACATCACTGAATACGCTGCATTTCTTCATTGAGCCTGAATTGATCACAAGGTAAA
CAAGTGGATTACACCTTAACTATGATGTATATTGGGTAGAGGACATAATAGTTAGCGTAT
GCATACACCGTGCCACCGAATCTGCACTACAATCATAAGCTATATCAAATTATAGCGCTA
CAGTTCCATAGGAAGGAATTGCAAAAGAGAACCAACAGATAATTAAACTTAAAGTTTAAT
AAAAGATATTAACACCACAAAGCTAGACAGTAAAGATTTTAATGACCTATCAAAACGATA
ATCGGAGTCCACTATAGGAAGCAATTACGATACTTTACCAGCAAAACAACAGGAAATCTA
ACAGTTATAAGCGATACGAAGTTGCCTATAACTTCTATAAATGAATCGAGCCACACGAAG
TCGAAAACCAGATCAACTCCCGTCACAACCGCAACGATATGATTACAATACCATTAATGA
TCTGGGTGCAACCCATATGACAAATGGTGGAACCAATGTTGCATCCATAACAACCAATCT
ATCGACAAATCCGAATGTTGAAATTCATGAAGAAGTTTTCGAGAACAATGCGAGAAAACT
AGGTCCCTTAAAAATTAGATATGATTCGAAAAAACTTTTGAATTTTAAGAGATTATTGGA
GGTTAGATCAGAAAATGCTATCAAAGACTCTGAGGCATCTATCCGAGAGTTTGATCCGAA
CGGTGATAGTGATGCACTCCAAATACCGGAGGTAGACGACGATATCCCATACAGAGGAGT
TGTTGTAGGCAAAAAGAATTACAGCACTCATCGAACAATCCCTTCATCAACTGATCGTGA
GTTCTTCAGGAGATTATTCCTAGAATCATCTACAGCTGCATTTTACAATGGTAATGTTCT
TTTAGGGAATCATGATATAAATGATAACGATACCAGTCAACCACCGAATAAGAAATTAAA
GAACGTGAAAGCTGTAAAAGAAAATCCAAAGACAATAGAATATGTATACATAAGAGATTC
AGAAGTCAAGACATGGTACACAGCCCCATATCCCGAGGAATTCAATAAAAATAAAATACT
TTACGTCTGTGAATATTGCCTGAAGTACATGAATTCACGTTTTGTATATTATAGACACAC
ACTGAAATGCAAGGACCATAGACCCCCAGGTAATGAAATTTACAGAGATGAGAATGTCTC
CGTCTGGGAAATAGATGGCAGAGAAAACGTTGTCTACTGCCAAAATTTATGTCTACTAGC
TAAATTGTTCCTTAATTCCAAGACTCTTTACTATGATGTAGAGCCATTTGTGTTTTATGT
GTTGACTGAACGAGAAGTATCTGAGGATGGAAGAACAGTGAAAAATCACTTCGTTGGCTA
TTTTAGCAAAGAAAAACTAAACTCATCTGGGTACAATTTAAGTTGTATTATTACTTTACC
TCTCTATCAGAGACGTGGATATGGCCATTTCTTGATGGATTTTTCATATTTACTCTCTAA
AAGAGAATTTTCACAGGGCACTCCTGAGAAACCGTTATCTGATCTAGGCCTTATTACTTA
TAGAAATTTCTGGAAGTTAAAATGTGCTGAAACACTTCTTTATTTGAAGAATGAACTTAA
TTTGGAAGATAGTGAGAGTGATGATAAATTTCCTCTGGTCTCTATAGAGGATCTAGCAAA
TCTAACCGGAATGCTTCCAACTGATGTTATACTTGGCCTTGAAGAGTTAGGCGTATTTTA
CAGGTGTCCGGATCCTAATCAAAATACCACATCGTACTGTATTAAAATCGATTCTTGGAA
TAGAATCAAGGCCATTCGTGAAAACTGGCTAAGAAAGGGCTATCAGTCATTAAAACCAGA
AAACTTAATTTGGAAGCCACTAATATATGGCCCATCAGGTGGTGTAAATGCGTTAGGAAT
GGTAGAACCCCCCAGTTTGCCAGAAGATCGTAAAACAAGTATATCCAGTGAGCCGAATTT
CCAGAACAATCCAGTTGACTTCTTTGGTAGCCATATAACCATGGTGAAGAAGTTTATGAC
TGATGATATCGAAGATCCAAGGGACTTAGAAATTCTGACAATAGACAATATTAAGAAGAG
AAAACTATCTGTTGGCAAAAACAACATGTTGCAACAGAGTTGGGAGATTGCATATCAAGA
TCCAAGGCCTGTTGATAAAAAGGATACTACTGCACGTAAAGCTCCCTCTCTTCTTTCTTC
AAAAACTACTCGAAAGGAATCCTTGATGTCTGCTGAGACCGAAGATGTACTCCCTTATGA
AAACCAAGAAATGGATGATACTTCAGTAGTATTAGAAACTGAAGAAAGCGACCCTGACGA
CAATGATTACGATGAGGAGGATATTAACGAGAAAGTCATATCTTCTGATTCATCATCTTT
AGTGGTGTCTTCTGAGGAGGAAAATACAGATGAGACAATACCAGTCAGAAGATTTCCGCG
AAGGCATGCTTCTGGGTTAGATGATGCATTGGATGACGATAGAGACGAATTGATAGATCT
TACTACCTCAAGACAGAAAAGACAACTGAGGAGGATGTGATATGAACATCAAAACTGTTA
ATAGTATATAAAAGTTTTTATTCCATCTAATTATTCATAAAAAGTACATTTGTGCATCAT
GCAATTAACTTTTGCACAGAAACTATAAAAAACTATAAACTCATGACATGAAACATTTAA
TGATTGATTAATAATAACCATGATATACGATATATGCAATAATGTAAGCTATCTAGCCAG
CCAGACTTTGTTATTTTCTCTTGTTAAATCCTCAAACCGTGTTGTCACAGGTAATTACGC
GAGAATGTATCTAAACGCCATTTGATTTCATTTTCTGATC

Coding sequence

>CAGL0A04235g.cds
ATGAATCGAGCCACACGAAGTCGAAAACCAGATCAACTCCCGTCACAACCGCAACGATAT
GATTACAATACCATTAATGATCTGGGTGCAACCCATATGACAAATGGTGGAACCAATGTT
GCATCCATAACAACCAATCTATCGACAAATCCGAATGTTGAAATTCATGAAGAAGTTTTC
GAGAACAATGCGAGAAAACTAGGTCCCTTAAAAATTAGATATGATTCGAAAAAACTTTTG
AATTTTAAGAGATTATTGGAGGTTAGATCAGAAAATGCTATCAAAGACTCTGAGGCATCT
ATCCGAGAGTTTGATCCGAACGGTGATAGTGATGCACTCCAAATACCGGAGGTAGACGAC
GATATCCCATACAGAGGAGTTGTTGTAGGCAAAAAGAATTACAGCACTCATCGAACAATC
CCTTCATCAACTGATCGTGAGTTCTTCAGGAGATTATTCCTAGAATCATCTACAGCTGCA
TTTTACAATGGTAATGTTCTTTTAGGGAATCATGATATAAATGATAACGATACCAGTCAA
CCACCGAATAAGAAATTAAAGAACGTGAAAGCTGTAAAAGAAAATCCAAAGACAATAGAA
TATGTATACATAAGAGATTCAGAAGTCAAGACATGGTACACAGCCCCATATCCCGAGGAA
TTCAATAAAAATAAAATACTTTACGTCTGTGAATATTGCCTGAAGTACATGAATTCACGT
TTTGTATATTATAGACACACACTGAAATGCAAGGACCATAGACCCCCAGGTAATGAAATT
TACAGAGATGAGAATGTCTCCGTCTGGGAAATAGATGGCAGAGAAAACGTTGTCTACTGC
CAAAATTTATGTCTACTAGCTAAATTGTTCCTTAATTCCAAGACTCTTTACTATGATGTA
GAGCCATTTGTGTTTTATGTGTTGACTGAACGAGAAGTATCTGAGGATGGAAGAACAGTG
AAAAATCACTTCGTTGGCTATTTTAGCAAAGAAAAACTAAACTCATCTGGGTACAATTTA
AGTTGTATTATTACTTTACCTCTCTATCAGAGACGTGGATATGGCCATTTCTTGATGGAT
TTTTCATATTTACTCTCTAAAAGAGAATTTTCACAGGGCACTCCTGAGAAACCGTTATCT
GATCTAGGCCTTATTACTTATAGAAATTTCTGGAAGTTAAAATGTGCTGAAACACTTCTT
TATTTGAAGAATGAACTTAATTTGGAAGATAGTGAGAGTGATGATAAATTTCCTCTGGTC
TCTATAGAGGATCTAGCAAATCTAACCGGAATGCTTCCAACTGATGTTATACTTGGCCTT
GAAGAGTTAGGCGTATTTTACAGGTGTCCGGATCCTAATCAAAATACCACATCGTACTGT
ATTAAAATCGATTCTTGGAATAGAATCAAGGCCATTCGTGAAAACTGGCTAAGAAAGGGC
TATCAGTCATTAAAACCAGAAAACTTAATTTGGAAGCCACTAATATATGGCCCATCAGGT
GGTGTAAATGCGTTAGGAATGGTAGAACCCCCCAGTTTGCCAGAAGATCGTAAAACAAGT
ATATCCAGTGAGCCGAATTTCCAGAACAATCCAGTTGACTTCTTTGGTAGCCATATAACC
ATGGTGAAGAAGTTTATGACTGATGATATCGAAGATCCAAGGGACTTAGAAATTCTGACA
ATAGACAATATTAAGAAGAGAAAACTATCTGTTGGCAAAAACAACATGTTGCAACAGAGT
TGGGAGATTGCATATCAAGATCCAAGGCCTGTTGATAAAAAGGATACTACTGCACGTAAA
GCTCCCTCTCTTCTTTCTTCAAAAACTACTCGAAAGGAATCCTTGATGTCTGCTGAGACC
GAAGATGTACTCCCTTATGAAAACCAAGAAATGGATGATACTTCAGTAGTATTAGAAACT
GAAGAAAGCGACCCTGACGACAATGATTACGATGAGGAGGATATTAACGAGAAAGTCATA
TCTTCTGATTCATCATCTTTAGTGGTGTCTTCTGAGGAGGAAAATACAGATGAGACAATA
CCAGTCAGAAGATTTCCGCGAAGGCATGCTTCTGGGTTAGATGATGCATTGGATGACGAT
AGAGACGAATTGATAGATCTTACTACCTCAAGACAGAAAAGACAACTGAGGAGGATGTGA

Predicted translation product

>CAGL0A04235g.aa
MNRATRSRKPDQLPSQPQRYDYNTINDLGATHMTNGGTNVASITTNLSTNPNVEIHEEVF
ENNARKLGPLKIRYDSKKLLNFKRLLEVRSENAIKDSEASIREFDPNGDSDALQIPEVDD
DIPYRGVVVGKKNYSTHRTIPSSTDREFFRRLFLESSTAAFYNGNVLLGNHDINDNDTSQ
PPNKKLKNVKAVKENPKTIEYVYIRDSEVKTWYTAPYPEEFNKNKILYVCEYCLKYMNSR
FVYYRHTLKCKDHRPPGNEIYRDENVSVWEIDGRENVVYCQNLCLLAKLFLNSKTLYYDV
EPFVFYVLTEREVSEDGRTVKNHFVGYFSKEKLNSSGYNLSCIITLPLYQRRGYGHFLMD
FSYLLSKREFSQGTPEKPLSDLGLITYRNFWKLKCAETLLYLKNELNLEDSESDDKFPLV
SIEDLANLTGMLPTDVILGLEELGVFYRCPDPNQNTTSYCIKIDSWNRIKAIRENWLRKG
YQSLKPENLIWKPLIYGPSGGVNALGMVEPPSLPEDRKTSISSEPNFQNNPVDFFGSHIT
MVKKFMTDDIEDPRDLEILTIDNIKKRKLSVGKNNMLQQSWEIAYQDPRPVDKKDTTARK
APSLLSSKTTRKESLMSAETEDVLPYENQEMDDTSVVLETEESDPDDNDYDEEDINEKVI
SSDSSSLVVSSEEENTDETIPVRRFPRRHASGLDDALDDDRDELIDLTTSRQKRQLRRM*





Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites