SACE0E04114g


YER086W ILV1, Threonine deaminase, catalyzes the first step in isoleucine biosynthesis; expression is under general amino acid control; ILV1 locus exhibits highly positioned nucleosomes whose organization is independent of known ILV1 regulation

Genomic environment map

Element type: CDS
Element length: 1731 nucleotides,
on sense strand of
Sace0E: 328473..330203.
Other names:
ILV1
YER086W
Coding sequence: 577 codons.
Database cross references:
ArrayExpress: P00927
CYGD: YER086w
EMBL: AAA34705.1
EMBL: AAB64641.1
EMBL: BK006939
EMBL: CAA25696.1
EMBL: M36383
EMBL: U18839
EMBL: X01466
GeneID: 856819
HOGENOM: HBG714501
NMPDR: fig|4932.3.peg.2080

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0161
Orthologs: strict determination not possible; homologs must be refined manually

Protein SACE0E04114p  


Protein domain map

Protein length: 576 amino acids
Protein family: GL3C0161
Database cross references:
DIP: DIP-4029N
GermOnline: YER086W
HSSP: 1TDJ
IntAct: P00927
InterPro: IPR000634
InterPro: IPR001721
InterPro: IPR001926
InterPro: IPR005787
KEGG: sce:YER086W
NextBio: 983097
PIR: S50589
PROSITE: PS00165
PeptideAtlas: P00927
Pfam: PF00291
Pfam: PF00585
RefSeq: NP_011009.1
SGD: S000000888
SMR: P00927
TIGRFAMs: TIGR01124
UniProtKB/Swiss-Prot: P00927
UniProtKB: THDH_YEAST

Phylogeny  

PhylomeDB:SACE0E04114g

Computed results for SACE0E04114p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>SACE0E04114g.nt
ATGTCAGCTACTCTACTAAAGCAACCATTATGTACGGTTGTTCGGCAAGGTAAACAGTCC
AAAGTGTCTGGATTGAACCTTTTGAGACTAAAGGCTCATTTGCACAGACAACACCTGTCA
CCTTCCTTGATAAAACTACACTCTGAATTGAAATTGGATGAGCTGCAAACTGATAACACC
CCTGATTACGTCCGTTTAGTTTTAAGGTCCTCTGTATACGATGTTATTAATGAATCTCCA
ATCTCTCAAGGTGTAGGTTTGTCTTCCCGTCTAAACACGAATGTCATCTTGAAAAGAGAA
GATCTATTGCCTGTTTTCTCTTTCAAGCTTCGTGGTGCCTATAACATGATTGCCAAGTTG
GACGATTCTCAAAGAAACCAGGGTGTTATTGCCTGTTCAGCTGGGAATCATGCCCAAGGT
GTGGCCTTTGCTGCTAAACACTTGAAAATACCTGCTACTATCGTTATGCCTGTTTGTACA
CCATCTATTAAGTATCAAAATGTCTCGAGATTAGGGTCTCAAGTCGTCCTATATGGTAAC
GATTTTGACGAGGCTAAGGCTGAATGTGCCAAATTGGCTGAAGAGCGTGGCTTGACGAAC
ATTCCTCCTTTCGATCATCCTTATGTCATTGCCGGTCAAGGTACTGTAGCTATGGAAATC
CTAAGACAAGTACGTACCGCTAATAAGATCGGTGCTGTCTTTGTTCCCGTCGGCGGTGGT
GGTTTAATTGCTGGTATTGGTGCTTATTTGAAAAGGGTTGCTCCTCATATCAAAATCATT
GGTGTTGAAACTTACGATGCGGCCACTTTACATAATTCCTTGCAACGCAACCAGAGAACT
CCTTTACCTGTGGTGGGTACTTTTGCCGATGGTACGTCTGTGCGTATGATTGGTGAAGAA
ACATTTAGAGTCGCCCAACAAGTGGTTGATGAAGTTGTTCTTGTTAACACTGACGAAATC
TGTGCTGCAGTAAAGGATATTTTTGAAGATACTAGAAGTATTGTAGAACCATCTGGTGCC
CTTTCAGTAGCCGGTATGAAGAAATACATCTCTACCGTACATCCAGAAATTGACCACACT
AAAAACACCTATGTTCCCATCCTTTCTGGTGCTAACATGAACTTTGATAGATTAAGATTT
GTTTCCGAACGTGCTGTTCTTGGTGAAGGAAAGGAAGTCTTCATGTTAGTTACTTTACCC
GACGTCCCTGGTGCGTTCAAGAAAATGCAAAAGATCATCCACCCAAGATCTGTCACTGAA
TTCTCTTACCGTTACAATGAACATCGTCATGAGTCCTCTAGTGAAGTGCCCAAGGCTTAC
ATTTACACTTCTTTCAGCGTCGTTGACAGAGAAAAGGAAATCAAGCAAGTTATGCAACAG
TTGAATGCTTTAGGTTTTGAAGCTGTGGATATCTCCGATAACGAATTGGCTAAATCTCAT
GGTAGATACTTGGTTGGTGGTGCTTCTAAGGTTCCTAATGAAAGAATTATTTCATTTGAA
TTCCCTGAAAGACCAGGTGCCTTGACTAGGTTCCTTGGAGGCCTAAGCGATTCTTGGAAT
CTTACTTTATTCCATTATAGAAACCATGGTGCCGATATCGGTAAGGTTTTAGCTGGTATT
TCCGTTCCTCCAAGGGAAAACTTAACCTTCCAAAAATTCTTGGAAGATTTAGGCTACACT
TATCATGATGAAACTGATAACACTGTTTATCAAAAATTCTTGAAATATTAA

Coding sequence    

>SACE0E04114g.cds
ATGTCAGCTACTCTACTAAAGCAACCATTATGTACGGTTGTTCGGCAAGGTAAACAGTCC
AAAGTGTCTGGATTGAACCTTTTGAGACTAAAGGCTCATTTGCACAGACAACACCTGTCA
CCTTCCTTGATAAAACTACACTCTGAATTGAAATTGGATGAGCTGCAAACTGATAACACC
CCTGATTACGTCCGTTTAGTTTTAAGGTCCTCTGTATACGATGTTATTAATGAATCTCCA
ATCTCTCAAGGTGTAGGTTTGTCTTCCCGTCTAAACACGAATGTCATCTTGAAAAGAGAA
GATCTATTGCCTGTTTTCTCTTTCAAGCTTCGTGGTGCCTATAACATGATTGCCAAGTTG
GACGATTCTCAAAGAAACCAGGGTGTTATTGCCTGTTCAGCTGGGAATCATGCCCAAGGT
GTGGCCTTTGCTGCTAAACACTTGAAAATACCTGCTACTATCGTTATGCCTGTTTGTACA
CCATCTATTAAGTATCAAAATGTCTCGAGATTAGGGTCTCAAGTCGTCCTATATGGTAAC
GATTTTGACGAGGCTAAGGCTGAATGTGCCAAATTGGCTGAAGAGCGTGGCTTGACGAAC
ATTCCTCCTTTCGATCATCCTTATGTCATTGCCGGTCAAGGTACTGTAGCTATGGAAATC
CTAAGACAAGTACGTACCGCTAATAAGATCGGTGCTGTCTTTGTTCCCGTCGGCGGTGGT
GGTTTAATTGCTGGTATTGGTGCTTATTTGAAAAGGGTTGCTCCTCATATCAAAATCATT
GGTGTTGAAACTTACGATGCGGCCACTTTACATAATTCCTTGCAACGCAACCAGAGAACT
CCTTTACCTGTGGTGGGTACTTTTGCCGATGGTACGTCTGTGCGTATGATTGGTGAAGAA
ACATTTAGAGTCGCCCAACAAGTGGTTGATGAAGTTGTTCTTGTTAACACTGACGAAATC
TGTGCTGCAGTAAAGGATATTTTTGAAGATACTAGAAGTATTGTAGAACCATCTGGTGCC
CTTTCAGTAGCCGGTATGAAGAAATACATCTCTACCGTACATCCAGAAATTGACCACACT
AAAAACACCTATGTTCCCATCCTTTCTGGTGCTAACATGAACTTTGATAGATTAAGATTT
GTTTCCGAACGTGCTGTTCTTGGTGAAGGAAAGGAAGTCTTCATGTTAGTTACTTTACCC
GACGTCCCTGGTGCGTTCAAGAAAATGCAAAAGATCATCCACCCAAGATCTGTCACTGAA
TTCTCTTACCGTTACAATGAACATCGTCATGAGTCCTCTAGTGAAGTGCCCAAGGCTTAC
ATTTACACTTCTTTCAGCGTCGTTGACAGAGAAAAGGAAATCAAGCAAGTTATGCAACAG
TTGAATGCTTTAGGTTTTGAAGCTGTGGATATCTCCGATAACGAATTGGCTAAATCTCAT
GGTAGATACTTGGTTGGTGGTGCTTCTAAGGTTCCTAATGAAAGAATTATTTCATTTGAA
TTCCCTGAAAGACCAGGTGCCTTGACTAGGTTCCTTGGAGGCCTAAGCGATTCTTGGAAT
CTTACTTTATTCCATTATAGAAACCATGGTGCCGATATCGGTAAGGTTTTAGCTGGTATT
TCCGTTCCTCCAAGGGAAAACTTAACCTTCCAAAAATTCTTGGAAGATTTAGGCTACACT
TATCATGATGAAACTGATAACACTGTTTATCAAAAATTCTTGAAATATTAA

Predicted translation product    

>SACE0E04114g.aa
MSATLLKQPLCTVVRQGKQSKVSGLNLLRLKAHLHRQHLSPSLIKLHSELKLDELQTDNT
PDYVRLVLRSSVYDVINESPISQGVGLSSRLNTNVILKREDLLPVFSFKLRGAYNMIAKL
DDSQRNQGVIACSAGNHAQGVAFAAKHLKIPATIVMPVCTPSIKYQNVSRLGSQVVLYGN
DFDEAKAECAKLAEERGLTNIPPFDHPYVIAGQGTVAMEILRQVRTANKIGAVFVPVGGG
GLIAGIGAYLKRVAPHIKIIGVETYDAATLHNSLQRNQRTPLPVVGTFADGTSVRMIGEE
TFRVAQQVVDEVVLVNTDEICAAVKDIFEDTRSIVEPSGALSVAGMKKYISTVHPEIDHT
KNTYVPILSGANMNFDRLRFVSERAVLGEGKEVFMLVTLPDVPGAFKKMQKIIHPRSVTE
FSYRYNEHRHESSSEVPKAYIYTSFSVVDREKEIKQVMQQLNALGFEAVDISDNELAKSH
GRYLVGGASKVPNERIISFEFPERPGALTRFLGGLSDSWNLTLFHYRNHGADIGKVLAGI
SVPPRENLTFQKFLEDLGYTYHDETDNTVYQKFLKY*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites