YALI0F32153g


similar to uniprot|P46677 Saccharomyces cerevisiae YGR274c TAF145 TFIID subunit (TBP-associated factor)

Genomic environment map

Element type: CDS
Element length: 3243 nucleotides,
on anti-sense strand of
Yali0F: complement(3985832..3989074).
Other names:
YALI-CDS0355.1
YALI-IPF1765
Coding sequence: 1081 codons.
Database cross references:
EMBL: CR382132
GeneID: 2907745
GenomeReviews: CR382132_GR
HOGENOM: HBG398361

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3R1454
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0F32153p  


similar to uniprot|P46677 Saccharomyces cerevisiae YGR274c TAF145 TFIID subunit (TBP-associated factor); SubName: Full=YALI0F32153p;

Protein domain map

Protein length: 1080 amino acids
Protein family: GL3R1454
Database cross references:
InterPro: IPR001878
InterPro: IPR022591
KEGG: yli:YALI0F32153g
Pfam: PF12157
RefSeq: XP_506122.1
SMART: SM00343
UniProtKB/TrEMBL: Q6BZP0
UniProtKB: Q6BZP0_YARLI

Phylogeny  

PhylomeDB:YALI0F32153g

Computed results for YALI0F32153p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>YALI0F32153g.nt
ATGGCCGACGAAGACGCTGACTGGGCCAGAGCACTCGGTGGCGACCCTACCAACGTTGGA
GGTGTTGTCCCATCGCTTGATGGTGCAGATGCCATCAAACACTCCGCAGACGCCATCGAT
TTCGAGGACGAAGACGAGCTGGCAGATGATGAGCTGCCCGAAGAGGAGGAAGCAACCGGA
AACAACGTGATTGAGGAGGATGAGGAGATGATTGTCGAGGAGGAAGCTGGTCTATCCATG
TCTGGGAACGATAGCCTGGGTGGCTTCCAGTTCCACATTGGCGAGGATGAGCTTGGAGGA
TTTGATCTGGGAGATGACGAACATTGGAGCCAGAGCTGGGGAGGCAACGAAATGGACGAG
CTGCCTGTGATGGACGAGTACGATGATGGTGGTGTTGTCCATGCTGGAATGTTGAATAAC
AATGCCAACAGCAACAACCATGTCGTGCATGCCGGTGAACCTGGAGAGTCAGACGAAGAG
ATGGAGATGGAGGCTGCTGGAGGCGACGGGCTGGACATGAAGCTTGACGCAGATGAGCTA
GGCGAACTGGGAAAGTTAGGGCCTGCCGACAAGATGGACGCCTCTGGATTTGATGACATC
CAACACATGAACATCCAGGAGCTCAATCTGGAGGAAATTGCACAGATGGCGCGTCAGGCT
GATCTTGACATGCTTGCAATGTACTTCCCCGGGTTCCAAAAGGGCAAGCCTATGAAGATG
AACTCGCTCTTCTACAACAAGTTTGCTGTTCTGAGTCTGCCCAAGCCTAACACTGCTCGT
CCGTGTGTCCCCACAAAGACTAAGCTGGAGGTGGCGCCAGATGAGCGTAAGATGACACGA
GGAACAGTGTGGGTGGACCATGCTAAGCGGCGACCGGGTGTTGTAACAGTCACCGAGGCC
GAGACGGGGGACGATAAGCAGGAACTGGCCCGAGAAAGCAGAACCACACAATCAGCAACA
GACCAGCAGCTGCATTTCGACATTACTGACTGGAAGATCACGTGGGGAGATGAGGATGAT
GGGGATGGAGACGTGGATATGGATTGCGAGGGCGAAAAGGTCAAGGAGGAACAAAAAGAG
GTTGTTCCACAGCTCGGCGTGCACAGAAACGAAGAGGCCTGGAACGACGAGGCCTTGTTT
GAAGGCGATGCTCTCAGTCAGGTGAAGAAGGTCAACCTCGATATGAACGACCCCAACCTA
CTGTTCATCAACACAGATGGTCGTGTATCGAAGAATGTGACTCCTGCCATTCCCTCCACC
GAACTACAGCTTCGAAACAGATTTAACCTCAGTAACGACCGAGCCTACGATATGCTCAAG
GAGAACACACAGAGCAAGGTACGAGCCACTATTGGCAATCTGTCTATCGATCACTCCATG
CCAGCTCTCCGACTGCAAAGTCCATTCTACAAAGTGCGAATCTCGAAGCCTCAGGCTCGT
TCGTTCCATCGGCCATCATTTGTGGTGAAACCCAACACTACTGTTCACTTTTCGCGAATG
AAGATCCGAAAGAAGAAGCGAGACAGGGGAAAGCCTATCAAGGATCTTTTGGCCAAGACT
ACGGACTTGTCGTTGGGTGACTCTGCTCAGTACTTCCTCATGGAGTACGCTGAGCAATTC
CCCATGACCCTCTCCAACTACGGAATGGGCTCTAAGATGATCAACTATTACAGAAAAGCT
TCTCCTGAGGATACTTCCCGACCTAAGCTCCCCGTTGGAGAGACACATGTTCTGTCGGTC
CAGGACAAGTCTCCCTTTTGGAACTTTGGATTCGTCGAGCCGGGTAAGATTGTGCCTACT
CTGTACAACAAGATGATCCGAGCCCCCGTATTCAAGCATACACCACGTGACACAGATTTT
CTTATGATCCGATCAACTGGAGGCGATGTGACTGGGGCTGGCCAAAAGTACTTTCTGAGA
AATATTCCACATGTCTTCACTGTCGGTCAAACATACCCCGTGACTGACGTACCCGGGCCA
CACTCTCGAAAGGTAACCACGGCTTCCAAAAACCGGCTCAAGATGATTGTCTATCGAGTG
CTAAACGCATCACCTTATCATAGAATCAACGTGAAGGATATCGCCGAACATTTCCCTGAT
CAGATTGACACTCAAAACCGACAGAGGCTCAAGGAGTTTATGGAGTATCAGCGAACAGGT
GAGGATCAGGGTTACTGGAAGGTCAAGCCTACCGATACTCTTCCTGGCGAGGACGGCATC
AGAACCATGATCACACCCGAGGACATCACTCTCTTGGAGGCCATGCAGGTTGGTGTTCAG
AACCTTGAGGATGCTGGATATGGTCGAACTGACGACATTGAGAGCGACCATGAGAATGGC
GAGGAGTCTGGTCTTTCTCTGGAGGAGCAGTCTGCTCCATGGAACCTGACCAGAAACTTC
ATCAACGCTACCCAGGGCAAGGCCATGCTGCAGCTGCATGGCGAGGGTGATCCTTCTGGT
CGAGGCGAGGGTTTCTCATTCCTCAAGACTTCCATGAAGGGTGGTTTTCAGGCCGCTGGT
GAGTCTGTCAATGAGAAGCTTGACAAGAGCAAGTTTGGCGGACACAAGTACAACGTGGCC
CACCAGCAGCGTGCATACGACGATGAGATCAGCCGAATCTGGTATGCCCAGTGTCGAGCT
CTCAGCAACACCAAGGTACCTGAAGAGAACGACGAAGATTCAAAATGGGCCGATGCTGAG
GAGCAGCGAGAACAACAAGAGCGGGTGTCTACGCCTGGTTTCACTAGCGCAGCCTTCCCC
GACGATGATAACATGTCGCTCATGTCAGGCGACTCGGCCATGCAGCAGCGCAACAAGGTG
CTGCGAATCACGCGAATGGTCAAGGACGAGCACGGTATCATCCAGCGAAAGGTCGAGACC
ATCAAGGACCCCAGTGTCATTCGAGCATACATTCGACGACGAAAGCTGATGGACGAGGCC
AAGCTAACACTCGACGACTTAGACCCCACCAACGACGAGGAGGCCAACAAGCGAAACAAG
CGGCTGCTGGAGGAGCGGCTGGAGGATCTGCGAAAGAAGGGCGAGCGACGAAAGCAGCGA
CAGGCCCAGAAGCAGGGTCTCAACACAATCAACCTCGGCACCGACACTCCTCCCCCTTCA
GGCAAGGGTGTCGGCAAAGGCAAGGGTCCGCGACAGTGCAAGAACTGTGGTGCATATGGC
CACATTCGAACCAACAAGAGTTGTCCCATGTACAACCAACTCGAGGGACCTGCCAACCTG
TAG

Coding sequence    

>YALI0F32153g.cds
ATGGCCGACGAAGACGCTGACTGGGCCAGAGCACTCGGTGGCGACCCTACCAACGTTGGA
GGTGTTGTCCCATCGCTTGATGGTGCAGATGCCATCAAACACTCCGCAGACGCCATCGAT
TTCGAGGACGAAGACGAGCTGGCAGATGATGAGCTGCCCGAAGAGGAGGAAGCAACCGGA
AACAACGTGATTGAGGAGGATGAGGAGATGATTGTCGAGGAGGAAGCTGGTCTATCCATG
TCTGGGAACGATAGCCTGGGTGGCTTCCAGTTCCACATTGGCGAGGATGAGCTTGGAGGA
TTTGATCTGGGAGATGACGAACATTGGAGCCAGAGCTGGGGAGGCAACGAAATGGACGAG
CTGCCTGTGATGGACGAGTACGATGATGGTGGTGTTGTCCATGCTGGAATGTTGAATAAC
AATGCCAACAGCAACAACCATGTCGTGCATGCCGGTGAACCTGGAGAGTCAGACGAAGAG
ATGGAGATGGAGGCTGCTGGAGGCGACGGGCTGGACATGAAGCTTGACGCAGATGAGCTA
GGCGAACTGGGAAAGTTAGGGCCTGCCGACAAGATGGACGCCTCTGGATTTGATGACATC
CAACACATGAACATCCAGGAGCTCAATCTGGAGGAAATTGCACAGATGGCGCGTCAGGCT
GATCTTGACATGCTTGCAATGTACTTCCCCGGGTTCCAAAAGGGCAAGCCTATGAAGATG
AACTCGCTCTTCTACAACAAGTTTGCTGTTCTGAGTCTGCCCAAGCCTAACACTGCTCGT
CCGTGTGTCCCCACAAAGACTAAGCTGGAGGTGGCGCCAGATGAGCGTAAGATGACACGA
GGAACAGTGTGGGTGGACCATGCTAAGCGGCGACCGGGTGTTGTAACAGTCACCGAGGCC
GAGACGGGGGACGATAAGCAGGAACTGGCCCGAGAAAGCAGAACCACACAATCAGCAACA
GACCAGCAGCTGCATTTCGACATTACTGACTGGAAGATCACGTGGGGAGATGAGGATGAT
GGGGATGGAGACGTGGATATGGATTGCGAGGGCGAAAAGGTCAAGGAGGAACAAAAAGAG
GTTGTTCCACAGCTCGGCGTGCACAGAAACGAAGAGGCCTGGAACGACGAGGCCTTGTTT
GAAGGCGATGCTCTCAGTCAGGTGAAGAAGGTCAACCTCGATATGAACGACCCCAACCTA
CTGTTCATCAACACAGATGGTCGTGTATCGAAGAATGTGACTCCTGCCATTCCCTCCACC
GAACTACAGCTTCGAAACAGATTTAACCTCAGTAACGACCGAGCCTACGATATGCTCAAG
GAGAACACACAGAGCAAGGTACGAGCCACTATTGGCAATCTGTCTATCGATCACTCCATG
CCAGCTCTCCGACTGCAAAGTCCATTCTACAAAGTGCGAATCTCGAAGCCTCAGGCTCGT
TCGTTCCATCGGCCATCATTTGTGGTGAAACCCAACACTACTGTTCACTTTTCGCGAATG
AAGATCCGAAAGAAGAAGCGAGACAGGGGAAAGCCTATCAAGGATCTTTTGGCCAAGACT
ACGGACTTGTCGTTGGGTGACTCTGCTCAGTACTTCCTCATGGAGTACGCTGAGCAATTC
CCCATGACCCTCTCCAACTACGGAATGGGCTCTAAGATGATCAACTATTACAGAAAAGCT
TCTCCTGAGGATACTTCCCGACCTAAGCTCCCCGTTGGAGAGACACATGTTCTGTCGGTC
CAGGACAAGTCTCCCTTTTGGAACTTTGGATTCGTCGAGCCGGGTAAGATTGTGCCTACT
CTGTACAACAAGATGATCCGAGCCCCCGTATTCAAGCATACACCACGTGACACAGATTTT
CTTATGATCCGATCAACTGGAGGCGATGTGACTGGGGCTGGCCAAAAGTACTTTCTGAGA
AATATTCCACATGTCTTCACTGTCGGTCAAACATACCCCGTGACTGACGTACCCGGGCCA
CACTCTCGAAAGGTAACCACGGCTTCCAAAAACCGGCTCAAGATGATTGTCTATCGAGTG
CTAAACGCATCACCTTATCATAGAATCAACGTGAAGGATATCGCCGAACATTTCCCTGAT
CAGATTGACACTCAAAACCGACAGAGGCTCAAGGAGTTTATGGAGTATCAGCGAACAGGT
GAGGATCAGGGTTACTGGAAGGTCAAGCCTACCGATACTCTTCCTGGCGAGGACGGCATC
AGAACCATGATCACACCCGAGGACATCACTCTCTTGGAGGCCATGCAGGTTGGTGTTCAG
AACCTTGAGGATGCTGGATATGGTCGAACTGACGACATTGAGAGCGACCATGAGAATGGC
GAGGAGTCTGGTCTTTCTCTGGAGGAGCAGTCTGCTCCATGGAACCTGACCAGAAACTTC
ATCAACGCTACCCAGGGCAAGGCCATGCTGCAGCTGCATGGCGAGGGTGATCCTTCTGGT
CGAGGCGAGGGTTTCTCATTCCTCAAGACTTCCATGAAGGGTGGTTTTCAGGCCGCTGGT
GAGTCTGTCAATGAGAAGCTTGACAAGAGCAAGTTTGGCGGACACAAGTACAACGTGGCC
CACCAGCAGCGTGCATACGACGATGAGATCAGCCGAATCTGGTATGCCCAGTGTCGAGCT
CTCAGCAACACCAAGGTACCTGAAGAGAACGACGAAGATTCAAAATGGGCCGATGCTGAG
GAGCAGCGAGAACAACAAGAGCGGGTGTCTACGCCTGGTTTCACTAGCGCAGCCTTCCCC
GACGATGATAACATGTCGCTCATGTCAGGCGACTCGGCCATGCAGCAGCGCAACAAGGTG
CTGCGAATCACGCGAATGGTCAAGGACGAGCACGGTATCATCCAGCGAAAGGTCGAGACC
ATCAAGGACCCCAGTGTCATTCGAGCATACATTCGACGACGAAAGCTGATGGACGAGGCC
AAGCTAACACTCGACGACTTAGACCCCACCAACGACGAGGAGGCCAACAAGCGAAACAAG
CGGCTGCTGGAGGAGCGGCTGGAGGATCTGCGAAAGAAGGGCGAGCGACGAAAGCAGCGA
CAGGCCCAGAAGCAGGGTCTCAACACAATCAACCTCGGCACCGACACTCCTCCCCCTTCA
GGCAAGGGTGTCGGCAAAGGCAAGGGTCCGCGACAGTGCAAGAACTGTGGTGCATATGGC
CACATTCGAACCAACAAGAGTTGTCCCATGTACAACCAACTCGAGGGACCTGCCAACCTG
TAG

Predicted translation product    

>YALI0F32153g.aa
MADEDADWARALGGDPTNVGGVVPSLDGADAIKHSADAIDFEDEDELADDELPEEEEATG
NNVIEEDEEMIVEEEAGLSMSGNDSLGGFQFHIGEDELGGFDLGDDEHWSQSWGGNEMDE
LPVMDEYDDGGVVHAGMLNNNANSNNHVVHAGEPGESDEEMEMEAAGGDGLDMKLDADEL
GELGKLGPADKMDASGFDDIQHMNIQELNLEEIAQMARQADLDMLAMYFPGFQKGKPMKM
NSLFYNKFAVLSLPKPNTARPCVPTKTKLEVAPDERKMTRGTVWVDHAKRRPGVVTVTEA
ETGDDKQELARESRTTQSATDQQLHFDITDWKITWGDEDDGDGDVDMDCEGEKVKEEQKE
VVPQLGVHRNEEAWNDEALFEGDALSQVKKVNLDMNDPNLLFINTDGRVSKNVTPAIPST
ELQLRNRFNLSNDRAYDMLKENTQSKVRATIGNLSIDHSMPALRLQSPFYKVRISKPQAR
SFHRPSFVVKPNTTVHFSRMKIRKKKRDRGKPIKDLLAKTTDLSLGDSAQYFLMEYAEQF
PMTLSNYGMGSKMINYYRKASPEDTSRPKLPVGETHVLSVQDKSPFWNFGFVEPGKIVPT
LYNKMIRAPVFKHTPRDTDFLMIRSTGGDVTGAGQKYFLRNIPHVFTVGQTYPVTDVPGP
HSRKVTTASKNRLKMIVYRVLNASPYHRINVKDIAEHFPDQIDTQNRQRLKEFMEYQRTG
EDQGYWKVKPTDTLPGEDGIRTMITPEDITLLEAMQVGVQNLEDAGYGRTDDIESDHENG
EESGLSLEEQSAPWNLTRNFINATQGKAMLQLHGEGDPSGRGEGFSFLKTSMKGGFQAAG
ESVNEKLDKSKFGGHKYNVAHQQRAYDDEISRIWYAQCRALSNTKVPEENDEDSKWADAE
EQREQQERVSTPGFTSAAFPDDDNMSLMSGDSAMQQRNKVLRITRMVKDEHGIIQRKVET
IKDPSVIRAYIRRRKLMDEAKLTLDDLDPTNDEEANKRNKRLLEERLEDLRKKGERRKQR
QAQKQGLNTINLGTDTPPPSGKGVGKGKGPRQCKNCGAYGHIRTNKSCPMYNQLEGPANL
*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites