CAGL0J03696g


highly similar to uniprot|Q08649 Saccharomyces cerevisiae YOR244w ESA1 Histone acetyltransferase catalytic subunit of the native multisubunit complex (NuA4) that acetylates four conserved internal lysines of histone H4 N-terminal tail

Genomic environment map

Element type: CDS
Element length: 1341 nucleotides,
on anti-sense strand of
Cagl0J: complement(354372..355712).
Other names:
CAGL-CDS2434.1
CAGL-IPF6886
Coding sequence: 447 codons.
Database cross references:
EMBL: CR380956
GeneID: 2889685
HOGENOM: Q6FPH9

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0136 GL3C0136.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0J03696p  


highly similar to uniprot|Q08649 Saccharomyces cerevisiae YOR244w ESA1 histone acetyltransferase

Protein domain map

Protein length: 446 amino acids
Protein family: GL3C0136
Database cross references:
Gene3D: G3DSA:1.10.10.10
Gene3D: G3DSA:3.40.630.30
InterPro: IPR000953
InterPro: IPR002717
InterPro: IPR011991
InterPro: IPR016181
KEGG: cgr:CAGL0J03696g
Pfam: PF01853
RefSeq: XP_447865.1
SMART: SM00298
SMR: Q6FPH9
UniProtKB/Swiss-Prot: Q6FPH9
UniprotKB: ESA1_CANGA

Computed results for CAGL0J03696p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>CAGL0J03696g.nt
ATGGCTGGTGCAGAAGTTGAAGAGGAAGCTGGTATTCCTAAGAAGATTGAGAGCACTGAG
GAAGTACTGATTAAGTGTCAGTGCTGGGTGCGTAAGGATGAGGAGGAGCGGCTGGCTGAG
ATACTGTCGATAAACGCGCGCGTGAGCCCATCGAAATTCTATGTGCACTATGTCAATTTC
AACAAGCGTTTGGATGAATGGGTTACTGGGGATCGTATAAACCTGGATAAGGAAGTTATA
TTTCCGAGACCCAAGAGACAGTTGGAAGAGGACACGAACAAGAAGCAAAAGAAAAAGAAG
AAGTTTCCACAGAAAGCTGCAGTGGTGGAGTCTGATGCGAAGAGCTCAGAGATGGGTGAA
GGTAGTGATGTTATGGATCTGGATAACTTAAATGTGCGAGGTCTGAAAGACGAAGAGATA
TCTAGGGAAGATGAGATCAAAAAGTTAAGGACTTCCGGTTCCATGATTCAAAATCCACAT
GAAGTGGCTCACGTTAGAAATTTGTCCAAGATCATTATGGGGAAATTTGAAATCGAGCCT
TGGTATTTTTCTCCATATCCAATTGAGCTCACAGATCTTGATGTGGTATACATCGATGAC
TTCACGCTGCAATACTTTGGTTCCAGAAAACAATACGAACGTTATAGAAAGAAGTGTACA
CTACGGCATCCACCTGGAAATGAAATTTACAGAGACGATTATGTTTCATTCTTTGAGATA
GATGGTCGAAAGCAGAGAACGTGGTGTAGAAATCTGTGTTTGTTATCTAAACTCTTTTTA
GACCATAAGACGTTATATTACGATGTTGATCCATTTTTATTTTATTGTATGACTAGAAGA
GATGAGATGGGCCATCATTTTGTTGGCTACTTTTCTAAGGAAAAGGAATCTGCTGATGGT
TATAATGTTGCATGTATCCTAACATTACCACAATATCAGCGTATGGGTTATGGTAGATTA
TTAATCGAATTTTCTTATGAACTATCCAAAAAGGAAGGAAAAGTAGGTTCGCCGGAAAAG
CCATTGTCGGATTTGGGTTTGCTATCTTATAGAGCTTATTGGTCGGACGTTTTAATCACA
CTTCTAGTGGAGCATGGAAAAGAAGTTACCATTGATGAAATAAGTTCAATGACATCAATG
ACAACAACTGATATCCTTCACACTTTAAAAACATTAAACATTCTACGATACTACAAGGGA
CAGCATATTATATTTTTAAATGACGATATATTAGAAAGGTACAACCAGTTAAAAACCAAA
AAGAGAAGACATATCGATGCAGAGAAGTTACTATGGAAACCTCCTGTATTTACTGCATCG
CAATTGAGATTTGCCTGGTAG

Coding sequence    

>CAGL0J03696g.cds
ATGGCTGGTGCAGAAGTTGAAGAGGAAGCTGGTATTCCTAAGAAGATTGAGAGCACTGAG
GAAGTACTGATTAAGTGTCAGTGCTGGGTGCGTAAGGATGAGGAGGAGCGGCTGGCTGAG
ATACTGTCGATAAACGCGCGCGTGAGCCCATCGAAATTCTATGTGCACTATGTCAATTTC
AACAAGCGTTTGGATGAATGGGTTACTGGGGATCGTATAAACCTGGATAAGGAAGTTATA
TTTCCGAGACCCAAGAGACAGTTGGAAGAGGACACGAACAAGAAGCAAAAGAAAAAGAAG
AAGTTTCCACAGAAAGCTGCAGTGGTGGAGTCTGATGCGAAGAGCTCAGAGATGGGTGAA
GGTAGTGATGTTATGGATCTGGATAACTTAAATGTGCGAGGTCTGAAAGACGAAGAGATA
TCTAGGGAAGATGAGATCAAAAAGTTAAGGACTTCCGGTTCCATGATTCAAAATCCACAT
GAAGTGGCTCACGTTAGAAATTTGTCCAAGATCATTATGGGGAAATTTGAAATCGAGCCT
TGGTATTTTTCTCCATATCCAATTGAGCTCACAGATCTTGATGTGGTATACATCGATGAC
TTCACGCTGCAATACTTTGGTTCCAGAAAACAATACGAACGTTATAGAAAGAAGTGTACA
CTACGGCATCCACCTGGAAATGAAATTTACAGAGACGATTATGTTTCATTCTTTGAGATA
GATGGTCGAAAGCAGAGAACGTGGTGTAGAAATCTGTGTTTGTTATCTAAACTCTTTTTA
GACCATAAGACGTTATATTACGATGTTGATCCATTTTTATTTTATTGTATGACTAGAAGA
GATGAGATGGGCCATCATTTTGTTGGCTACTTTTCTAAGGAAAAGGAATCTGCTGATGGT
TATAATGTTGCATGTATCCTAACATTACCACAATATCAGCGTATGGGTTATGGTAGATTA
TTAATCGAATTTTCTTATGAACTATCCAAAAAGGAAGGAAAAGTAGGTTCGCCGGAAAAG
CCATTGTCGGATTTGGGTTTGCTATCTTATAGAGCTTATTGGTCGGACGTTTTAATCACA
CTTCTAGTGGAGCATGGAAAAGAAGTTACCATTGATGAAATAAGTTCAATGACATCAATG
ACAACAACTGATATCCTTCACACTTTAAAAACATTAAACATTCTACGATACTACAAGGGA
CAGCATATTATATTTTTAAATGACGATATATTAGAAAGGTACAACCAGTTAAAAACCAAA
AAGAGAAGACATATCGATGCAGAGAAGTTACTATGGAAACCTCCTGTATTTACTGCATCG
CAATTGAGATTTGCCTGGTAG

Predicted translation product    

>CAGL0J03696g.aa
MAGAEVEEEAGIPKKIESTEEVLIKCQCWVRKDEEERLAEILSINARVSPSKFYVHYVNF
NKRLDEWVTGDRINLDKEVIFPRPKRQLEEDTNKKQKKKKKFPQKAAVVESDAKSSEMGE
GSDVMDLDNLNVRGLKDEEISREDEIKKLRTSGSMIQNPHEVAHVRNLSKIIMGKFEIEP
WYFSPYPIELTDLDVVYIDDFTLQYFGSRKQYERYRKKCTLRHPPGNEIYRDDYVSFFEI
DGRKQRTWCRNLCLLSKLFLDHKTLYYDVDPFLFYCMTRRDEMGHHFVGYFSKEKESADG
YNVACILTLPQYQRMGYGRLLIEFSYELSKKEGKVGSPEKPLSDLGLLSYRAYWSDVLIT
LLVEHGKEVTIDEISSMTSMTTTDILHTLKTLNILRYYKGQHIIFLNDDILERYNQLKTK
KRRHIDAEKLLWKPPVFTASQLRFAW*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites