CAGL0L09042g
similar to uniprot|Q12341 Saccharomyces cerevisiae YPL001w HAT1 Catalytic subunit of the Hat1p-Hat2p histone acetyltransferase complex that uses the cofactor acetyl coenzyme A, to acetylate free nuclear and cytoplasmic histone H4
Element type: CDS
Element length: 1167 nucleotides,
on sense strand of
Cagl0L: 985819..986985.
Other names:
CAGL-CDS2840.1
CAGL-IPF7485
Coding sequence: 389 codons.
Element length: 1167 nucleotides,
on sense strand of
Cagl0L: 985819..986985.
Other names:
CAGL-CDS2840.1
CAGL-IPF7485
Coding sequence: 389 codons.
Database cross references:
EMBL: CR380958
GeneID: 2891100
HOGENOM: Q6FKS5
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380958
GeneID: 2891100
HOGENOM: Q6FKS5
Homologs and Orthologs
Homologs in protein families: GL3R2212 GL3R2212.N1Orthologs: strict determination not possible; homologs must be refined manually
Protein CAGL0L09042p 
similar to uniprot|Q12341 Saccharomyces cerevisiae YPL001w HAT1 histone acetyltransferase subunit
Protein domain map
Database cross references:
Gene3D: G3DSA:1.10.10.390
Gene3D: G3DSA:3.40.630.30
InterPro: IPR000182
InterPro: IPR013523
InterPro: IPR016181
InterPro: IPR017380
KEGG: cgr:CAGL0L09042g
PIRSF: PIRSF038084
PROSITE: PS51186
Pfam: PF00583
RefSeq: XP_449169.1
UniProtKB/Swiss-Prot: Q6FKS5
UniprotKB: HAT1_CANGA
Gene3D: G3DSA:1.10.10.390
Gene3D: G3DSA:3.40.630.30
InterPro: IPR000182
InterPro: IPR013523
InterPro: IPR016181
InterPro: IPR017380
KEGG: cgr:CAGL0L09042g
PIRSF: PIRSF038084
PROSITE: PS51186
Pfam: PF00583
RefSeq: XP_449169.1
UniProtKB/Swiss-Prot: Q6FKS5
UniprotKB: HAT1_CANGA
Sequence data 
>CAGL0L09042g.nt TAGAATCTCATTTCCGGCTTGATTATCGATCCATAACATACTATTCGCCGCCATATCTTC CAATACTGACCGACCTCTTACAGCATCCCACTGAAAATTTGCCCTGAGAAGCGATATACT AGCATAACCCATAATAGAGCAAACCTCCAGAATTTTTGTCTGATCTCCTGTTATTTCATT TGGTACACTTCTCAGAAATTTTTTTCCTCGAATGCTGAATACTTTAAATCCACCATCTAG CACTTGAATCATATCTATTGATTTTTCTAAATCTTCCATGGTAACATTTAGATTCTTAAA ATAACCATTTCGCAGTTCATCAAAAGATATTACTCCACCATTCATGTCTTTAGTTTCCCT ACACACTTCTATAATTTTAACCGATATTTCATAATAAAAATCATCAACATTAAATAAATG TTGGTCTTTCTCAAATATTTCTAGAGGGTCTATACCGATAGATGAGCACATACGTAAAAA CCTTGATCTAAATTCAGGATTAGCCTTCAGATCTTTATTGTGCTTTTTGGCAAAGTCAAT TAACCTGCCCTGGAAAACTTCTAATTGCTGCTCGAGTTCCAATGATTGCCTTTCCAGAAT ATTTGAACTTACATTTTTATAGCTTTCATCATTAAGATGATCAAAAGCAGACAAACCAAA GTTTGGCATCAACAATTCCTCAGATAGTATTTTATTCTTTATAACTCGTTAGTCTGTAGC TACTAAATACTTTGTAGCTTAAAGATATTTCAATTTAAGGATCTAGCTAGTAGCGTAGTT CAAACAACCACATATAAACCTTTCTATATACTATCATTATTTTATCTCTGTTTTTATGTT TGACCTTTTTTAAACAAGATCCTTGTACGGCTCATTTCAACGTTCGAAGTTGCGAAACAC GTAAAATAAGCTTACCAAATTCTTATCATCTAATATAGTGTTATCTTAGTCTTCATTACA GTTACCAGACTAGATTTATTAAGAAACACGATAGAAAGCAATGTCAATTGATGATTTCAA GCCTGAAAAGTGGACGATATCTTCAAATGAAGCTTTGAAGTTGTCGCTAGTTTCTGAGGA TAACGCGATACAGTTCTCGCCAACCTTTACCTATCCTATATTTGGCACGGAAGAACAAAT ATTTGGCTATAAGGACTTAGTTATTCATCTAGCATTCGATGCTATCACATTCAAACCATT TTTGAATGTGAAGTTTTCATCTAAGTTCGAGGGAAGTGAAGAAGAGTTAGTTAACATAAA AGAGAAGCTATTGGAATATTTACCGATTGACGATACCATTTATAAGGATGAAGAGAAATG GATTGACTCTTTTAAGAAGGAACAGGAATCTATAGAGGCCTACAAAAATGACCAAAATAT TGATGAGTATAAAATCGACAACGCTGACTTTGAGATTTACAAAGTAAACTTACAAGACCC CAAAATGAAAAGATTTCACAGGAGAATTCAAATATTTTCCTTACTTTTCATTGAAGCAGC TTCATATATTGATGAAGATGATCCAAAATGGGAAATATTTATAGTACAGACCAAGAAAGA CAAAAAGTTTGTTGGTTATGCCACCGCATACAATTACTGGTACTACCCTGGTGCTAATAA TTTTGATAGCGAGAGTAAATACCGCTACAGAGGCAAAATTTCACAATTTCTAATTTTACC ACCATATCAAGGTAGGGGTCATGGTTCACATCTGTATAACAGCATTGTAAAAAATTGGCG AAACGATTCCTCAATTTTGGAAATCGTTGTCGAAGACCCAAATGAAAGTTTCGATGACTT AAGAGATGTTAATGACCTTGAAATGTTATATAAAGATGGATTCTTCAACAAACTACCACA AGAGAGGCCAATCCCTAATGCATGGATAGAATCTACGAGATTAAAATATAAAATAGAAAA GCGACAATTCAGTAGGCTACTTGAAATGATATTGCTATCAACTGGTTCAAATAATTTTGA GTACCAAGTCAAACAGAGACTGCTGATAAAAAATAAAGATGGATTGGAAGGTATGGAAGT TTCTGATATCAAGGATGCCTTAAATAAATCCTTTGAGTCACTGCGTGAGGATTATGATCG CATTCTTGGTAAATGTCAATTTTCCAACGACGCGGATGGACCATCAAAAAAGAAAATAAA GACTTGAAATAGATTTATAGACTACAAAAGCTAACCATCTATATAATTATTGTATTCTCG ATAAAAAAAATCTTTATATCGATTCTCAATTAGCCTTACAAGCCATTCCAACTGTTATAA TTAGATATTATAATTTTTCTACTCCATTGTCATGTGATTGAAAAACAGTTAAAATGATTT TAATAAAAAAAAGTTAATAATACTAATTCTAATATTATATATGACGAAGTATATGGCTTT GATTTCCGAAATAAGAAAAGCAGTCTATAAATCGAAGTTTGGTTTTGATATACTATTTTT TCAGGTC
>CAGL0L09042g.cds ATGTCAATTGATGATTTCAAGCCTGAAAAGTGGACGATATCTTCAAATGAAGCTTTGAAG TTGTCGCTAGTTTCTGAGGATAACGCGATACAGTTCTCGCCAACCTTTACCTATCCTATA TTTGGCACGGAAGAACAAATATTTGGCTATAAGGACTTAGTTATTCATCTAGCATTCGAT GCTATCACATTCAAACCATTTTTGAATGTGAAGTTTTCATCTAAGTTCGAGGGAAGTGAA GAAGAGTTAGTTAACATAAAAGAGAAGCTATTGGAATATTTACCGATTGACGATACCATT TATAAGGATGAAGAGAAATGGATTGACTCTTTTAAGAAGGAACAGGAATCTATAGAGGCC TACAAAAATGACCAAAATATTGATGAGTATAAAATCGACAACGCTGACTTTGAGATTTAC AAAGTAAACTTACAAGACCCCAAAATGAAAAGATTTCACAGGAGAATTCAAATATTTTCC TTACTTTTCATTGAAGCAGCTTCATATATTGATGAAGATGATCCAAAATGGGAAATATTT ATAGTACAGACCAAGAAAGACAAAAAGTTTGTTGGTTATGCCACCGCATACAATTACTGG TACTACCCTGGTGCTAATAATTTTGATAGCGAGAGTAAATACCGCTACAGAGGCAAAATT TCACAATTTCTAATTTTACCACCATATCAAGGTAGGGGTCATGGTTCACATCTGTATAAC AGCATTGTAAAAAATTGGCGAAACGATTCCTCAATTTTGGAAATCGTTGTCGAAGACCCA AATGAAAGTTTCGATGACTTAAGAGATGTTAATGACCTTGAAATGTTATATAAAGATGGA TTCTTCAACAAACTACCACAAGAGAGGCCAATCCCTAATGCATGGATAGAATCTACGAGA TTAAAATATAAAATAGAAAAGCGACAATTCAGTAGGCTACTTGAAATGATATTGCTATCA ACTGGTTCAAATAATTTTGAGTACCAAGTCAAACAGAGACTGCTGATAAAAAATAAAGAT GGATTGGAAGGTATGGAAGTTTCTGATATCAAGGATGCCTTAAATAAATCCTTTGAGTCA CTGCGTGAGGATTATGATCGCATTCTTGGTAAATGTCAATTTTCCAACGACGCGGATGGA CCATCAAAAAAGAAAATAAAGACTTGA
>CAGL0L09042g.aa MSIDDFKPEKWTISSNEALKLSLVSEDNAIQFSPTFTYPIFGTEEQIFGYKDLVIHLAFD AITFKPFLNVKFSSKFEGSEEELVNIKEKLLEYLPIDDTIYKDEEKWIDSFKKEQESIEA YKNDQNIDEYKIDNADFEIYKVNLQDPKMKRFHRRIQIFSLLFIEAASYIDEDDPKWEIF IVQTKKDKKFVGYATAYNYWYYPGANNFDSESKYRYRGKISQFLILPPYQGRGHGSHLYN SIVKNWRNDSSILEIVVEDPNESFDDLRDVNDLEMLYKDGFFNKLPQERPIPNAWIESTR LKYKIEKRQFSRLLEMILLSTGSNNFEYQVKQRLLIKNKDGLEGMEVSDIKDALNKSFES LREDYDRILGKCQFSNDADGPSKKKIKT*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0L09042p