YALI0E02332g
similar to DEHA0C09174g Debaryomyces hansenii and uniprot|Q08773 Saccharomyces cerevisiae YOR304w ISW2
Element type: CDS
Element length: 3146 nucleotides,
on anti-sense strand of
Yali0E: complement(join(256357..259398,259458..259502)).
Other names:
YALI-IPF40269
Coding sequence: 1029 codons.
Element length: 3146 nucleotides,
on anti-sense strand of
Yali0E: complement(join(256357..259398,259458..259502)).
Other names:
YALI-IPF40269
Coding sequence: 1029 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912094
GenomeReviews: CR382131_GR
HOGENOM: HBG717285
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382131
GeneID: 2912094
GenomeReviews: CR382131_GR
HOGENOM: HBG717285
Homologs and Orthologs
Homologs in protein family: GL3M4588Orthologs: strict determination not possible; homologs must be refined manually
Protein YALI0E02332p 
similar to DEHA0C09174g Debaryomyces hansenii and uniprot|Q08773 Saccharomyces cerevisiae YOR304w ISW2; SubName: Full=YALI0E02332p;
Protein domain map
Database cross references:
InterPro: IPR000330
InterPro: IPR001005
InterPro: IPR001650
InterPro: IPR009057
InterPro: IPR014001
InterPro: IPR014021
InterPro: IPR015194
InterPro: IPR015195
InterPro: IPR017884
KEGG: yli:YALI0E02332g
PROSITE: PS51192
PROSITE: PS51194
PROSITE: PS51293
Pfam: PF00176
Pfam: PF00271
Pfam: PF09110
Pfam: PF09111
RefSeq: XP_503455.1
SMART: SM00487
SMART: SM00490
SMART: SM00717
UniProtKB/TrEMBL: Q6C7A7
UniProtKB: Q6C7A7_YARLI
Phylogeny
PhylomeDB:YALI0E02332g
InterPro: IPR000330
InterPro: IPR001005
InterPro: IPR001650
InterPro: IPR009057
InterPro: IPR014001
InterPro: IPR014021
InterPro: IPR015194
InterPro: IPR015195
InterPro: IPR017884
KEGG: yli:YALI0E02332g
PROSITE: PS51192
PROSITE: PS51194
PROSITE: PS51293
Pfam: PF00176
Pfam: PF00271
Pfam: PF09110
Pfam: PF09111
RefSeq: XP_503455.1
SMART: SM00487
SMART: SM00490
SMART: SM00717
UniProtKB/TrEMBL: Q6C7A7
UniProtKB: Q6C7A7_YARLI
Phylogeny 
PhylomeDB:YALI0E02332gSequence data 
>YALI0E02332g.nt ATGTCTGGACCAGCACCCATAGACTTGGATAAGTCCAATCCAGGAGTGAGTACCGTTGCA AAACTGCCCCTTGATCCTCTCATCACGACTTTTACTAACCCCAGTCTTCCATGGTTTCCA CCGTGACGACCCCAAACAGGGACACGCCTCCAGTCCAGCTGTCGACCGAAGATCTGCTCA ACATCCGATCGGGCTATCTGCTTCCCGACAAGGGTCTTGGTAAGCCCAAGCAGAAGTCCA TTCCCGAACTCGTCAAGAAAATCAAACATCTTTACGGGCTGAGCGACGTTTTTGAGTACT TCCTGAAGTCCAAGGGAGAGTCCGACCCCAATTACAGAAAGGCCATGGAACAGGCAAAGG AGGAGCTCGGTCTTAACAAGACCACCGAGAAACCCACCGGTGGCGCGCGACACAGAAAGA CCGAGAAGGAAGAAGACGCTGAGCTGATGCAGGGAGAGGAGGAGGCCGAGAACTCGGTCG AAACCGTCTTCGAGACTTCTCCTGGCTACATTCAGGGAACTCTGCGAGAGTACCAAGTTC AGGGCCTCAACTGGATGGTCTCTCTTTATGAGCATGGTCTTTCTGGCATTCTGGCCGACG AAATGGGTCTTGGTAAGACTCTGCAGACCATCTCTTTCCTGGGATACCTTCGATACTTCC GTGGAATTCCCGGTCCTCATTTGGTCTGTGTTCCCAAGTCTACCCTGGACAACTGGGCCC GAGAGTTTGCCAAATGGACTCCTGAGGTCAACGTTCTTGTTCTTCAGGGAGACAAGGAGG GCCGAGCTCAGCTTATCCAGGATCGGCTTCTGACCTGCGACTTTGACGTATGCATTACCT CCTACGAGATGGTTCTTCGAGAGAAGGGATATCTGCGACGGTTTGCTTGGCAATACATTG TGATTGATGAGGCTCATCGAATCAAGAACGAGGAGTCTTCTCTGTCACAGATCATCCGTC TGTTCCACACCGAAAACCGACTGTTGATCACCGGTACTCCTTTGCAAAACAACCTGCACG AGCTGTGGGCTCTTCTGAACTATATTCTCCCCGACGTGTTCCAGGACTCTGCGGCTTTTG ATGCCTGGTTCGGTGAAGACCAGAGTGGAGACCAGGACGCAGCCGTGAACCAGCTGCATA AGATTCTGCGGCCATTCCTACTCCGACGAGTCAAGGCCGATGTCGAGAAGTCTCTTTTGC CCAAGAAGGAGATCAACCTCTACGTTGGCATGTCCGACATGCAGGTTAAATGGTACCAGA AAATTCTCGAGAAGGACATTGACGCTGTCAACGGTCAGATTGGCAAGCGTGAGGGCAAAA CCCGGCTTCTCAACATTGTGATGCAGCTGCGAAAGTGCTGCAACCATCCCTATTTGTTCG AGGGTGCCGAGCCTGGACCTCCTTACACCACTGATGAGCATCTTGTCTACAATGCTGGCA AGATGGTGATTCTCGACAAGCTTCTGAAGCGAATTCAGGAGCAGGGATCTCGTGTTCTTA TCTTCTCCCAGATGAGTCGAGTCCTTGATATTCTTGAGGATTACTGTCTGTTCCGAGGTT ACAAGTATTGTCGAATTGATGGTCAGACTGCGCATGAGGACCGAATCAACGCCATTGATG CCTACAACAAGGAGGGATCCGAGAAGTTTGTGTTCTTGCTGACAACTCGAGCTGGTGGTC TCGGTATCAATTTGACTACTGCCGACCAAGTTGTCCTTTACGACTCTGACTGGAACCCTC AAGCTGATCTTCAGGCCATGGATCGAGCTCATCGAATTGGACAAACCAAGCAGGTTTATG TCTACCGATTCATTACTGAAAACGCTGTCGAGGAGAAGGTCATCGAGCGAGCTACACAGA AGCTGCGTCTGGACAAGCTGGTTATTCAGCAAGGCCGATCGCAATCCAAAGTCAACAACA ACGCCCAGAACAAGGATGATCTGCTTAATATGATTCAATTTGGTGCCGAGAAGGTGTTCA ACCGAGGAAAGGGCGAGGAACAGGAGGAGGCTGATCTTGATATTGACGACATTCTCAAGC GAGGCCAGCAGAAGACTATGGAGCTCAACTCCCGTTACGACTCTCTTGGTCTTGATGATC TGCAAAAGTTCACTTCCGATTCTGCCTACGAATGGAACGGAGAGGATTTCAAGAAGAAGG ACACCAGCAAGACTAACAAGCCCTTCGTGTGGATTGCACCCAACAAGCGTGAGCGAAAGG AAAACTACTCCATCGACTCGTACTACCGAGAAATGCTTAACCAGGGCGCTGCTGCGGGCA AGAACGCTGCGCCCAAGGCCCCCAAGGCCCCCAAGCAGCTGAACATCCAGGACCATCAGT TCTACCCCCAGCGACTTATCGACATTCAGGAGAAGGAGACTTCTTACTACCGAAAGCAAA TTGGTTACAAGGTGCCTCTTCCCGATGGAGCTGAAGAAGAGCTTGAGATGCGAGAGAGCG AGCGAGATCTTGAGCAGCAGGAGATTGACAACGCCGAGCCTCTCACCGAGGAGGAAGAGG AGGAGAAGGTAGACCTGTCCACTAAGGGTTTCTCCAACTGGAACCGACGAGATTTCACAA ATTTCATCCAGGCTTCGGCGCGTCATGGTCGAAACAACTTTGCTGCCATTGCCACAGAGT TCGAGGACAAGACCCAAGCCGATATCAAGAAGTACGCTGGCGTGTTCTGGCAACGATACA CTGAGATCAACGGCTTTGACAGATACATTCAGCAGATCGAGGCAGGTGAGGACAAGGCCA AGAAGCAGAACCATCAAAACACTCTTCTAACCCGAAAGGTGGAGGGCTATGAGGCCCCTC TACAGCAGATGGTCATTGTCTACCCTGCAGGCCAGAAGAAAATCTACTCTGAGGACGAGG ATCGGTATATTCTGGTTCAACTCTACAGGTACGGCCTTGAGACTGAGGGTGTTTATGAGA TGATCCGAGACGCCATTCGAGCATCACCCGTGTTCCGATTCGACTGGTTCTTCCTGTCGA GGACTCCTGCAGAGCTTGCGCGACGAGGCCAGACTCTGCTTTCTTACGTTGGTAAAGAGT ACGATGGTGCTGGCGAGAAGCGAAAATCGTCTTCCACCCCCGACGTCGAGACTCCCAAAA AGTCTGCCAAGAAGAAGAAGGCGTAG3">
>YALI0E02332g.cds ATGTCTGGACCAGCACCCATAGACTTGGATAAGTCCAATCCAGGATCTTCCATGGTTTCC ACCGTGACGACCCCAAACAGGGACACGCCTCCAGTCCAGCTGTCGACCGAAGATCTGCTC AACATCCGATCGGGCTATCTGCTTCCCGACAAGGGTCTTGGTAAGCCCAAGCAGAAGTCC ATTCCCGAACTCGTCAAGAAAATCAAACATCTTTACGGGCTGAGCGACGTTTTTGAGTAC TTCCTGAAGTCCAAGGGAGAGTCCGACCCCAATTACAGAAAGGCCATGGAACAGGCAAAG GAGGAGCTCGGTCTTAACAAGACCACCGAGAAACCCACCGGTGGCGCGCGACACAGAAAG ACCGAGAAGGAAGAAGACGCTGAGCTGATGCAGGGAGAGGAGGAGGCCGAGAACTCGGTC GAAACCGTCTTCGAGACTTCTCCTGGCTACATTCAGGGAACTCTGCGAGAGTACCAAGTT CAGGGCCTCAACTGGATGGTCTCTCTTTATGAGCATGGTCTTTCTGGCATTCTGGCCGAC GAAATGGGTCTTGGTAAGACTCTGCAGACCATCTCTTTCCTGGGATACCTTCGATACTTC CGTGGAATTCCCGGTCCTCATTTGGTCTGTGTTCCCAAGTCTACCCTGGACAACTGGGCC CGAGAGTTTGCCAAATGGACTCCTGAGGTCAACGTTCTTGTTCTTCAGGGAGACAAGGAG GGCCGAGCTCAGCTTATCCAGGATCGGCTTCTGACCTGCGACTTTGACGTATGCATTACC TCCTACGAGATGGTTCTTCGAGAGAAGGGATATCTGCGACGGTTTGCTTGGCAATACATT GTGATTGATGAGGCTCATCGAATCAAGAACGAGGAGTCTTCTCTGTCACAGATCATCCGT CTGTTCCACACCGAAAACCGACTGTTGATCACCGGTACTCCTTTGCAAAACAACCTGCAC GAGCTGTGGGCTCTTCTGAACTATATTCTCCCCGACGTGTTCCAGGACTCTGCGGCTTTT GATGCCTGGTTCGGTGAAGACCAGAGTGGAGACCAGGACGCAGCCGTGAACCAGCTGCAT AAGATTCTGCGGCCATTCCTACTCCGACGAGTCAAGGCCGATGTCGAGAAGTCTCTTTTG CCCAAGAAGGAGATCAACCTCTACGTTGGCATGTCCGACATGCAGGTTAAATGGTACCAG AAAATTCTCGAGAAGGACATTGACGCTGTCAACGGTCAGATTGGCAAGCGTGAGGGCAAA ACCCGGCTTCTCAACATTGTGATGCAGCTGCGAAAGTGCTGCAACCATCCCTATTTGTTC GAGGGTGCCGAGCCTGGACCTCCTTACACCACTGATGAGCATCTTGTCTACAATGCTGGC AAGATGGTGATTCTCGACAAGCTTCTGAAGCGAATTCAGGAGCAGGGATCTCGTGTTCTT ATCTTCTCCCAGATGAGTCGAGTCCTTGATATTCTTGAGGATTACTGTCTGTTCCGAGGT TACAAGTATTGTCGAATTGATGGTCAGACTGCGCATGAGGACCGAATCAACGCCATTGAT GCCTACAACAAGGAGGGATCCGAGAAGTTTGTGTTCTTGCTGACAACTCGAGCTGGTGGT CTCGGTATCAATTTGACTACTGCCGACCAAGTTGTCCTTTACGACTCTGACTGGAACCCT CAAGCTGATCTTCAGGCCATGGATCGAGCTCATCGAATTGGACAAACCAAGCAGGTTTAT GTCTACCGATTCATTACTGAAAACGCTGTCGAGGAGAAGGTCATCGAGCGAGCTACACAG AAGCTGCGTCTGGACAAGCTGGTTATTCAGCAAGGCCGATCGCAATCCAAAGTCAACAAC AACGCCCAGAACAAGGATGATCTGCTTAATATGATTCAATTTGGTGCCGAGAAGGTGTTC AACCGAGGAAAGGGCGAGGAACAGGAGGAGGCTGATCTTGATATTGACGACATTCTCAAG CGAGGCCAGCAGAAGACTATGGAGCTCAACTCCCGTTACGACTCTCTTGGTCTTGATGAT CTGCAAAAGTTCACTTCCGATTCTGCCTACGAATGGAACGGAGAGGATTTCAAGAAGAAG GACACCAGCAAGACTAACAAGCCCTTCGTGTGGATTGCACCCAACAAGCGTGAGCGAAAG GAAAACTACTCCATCGACTCGTACTACCGAGAAATGCTTAACCAGGGCGCTGCTGCGGGC AAGAACGCTGCGCCCAAGGCCCCCAAGGCCCCCAAGCAGCTGAACATCCAGGACCATCAG TTCTACCCCCAGCGACTTATCGACATTCAGGAGAAGGAGACTTCTTACTACCGAAAGCAA ATTGGTTACAAGGTGCCTCTTCCCGATGGAGCTGAAGAAGAGCTTGAGATGCGAGAGAGC GAGCGAGATCTTGAGCAGCAGGAGATTGACAACGCCGAGCCTCTCACCGAGGAGGAAGAG GAGGAGAAGGTAGACCTGTCCACTAAGGGTTTCTCCAACTGGAACCGACGAGATTTCACA AATTTCATCCAGGCTTCGGCGCGTCATGGTCGAAACAACTTTGCTGCCATTGCCACAGAG TTCGAGGACAAGACCCAAGCCGATATCAAGAAGTACGCTGGCGTGTTCTGGCAACGATAC ACTGAGATCAACGGCTTTGACAGATACATTCAGCAGATCGAGGCAGGTGAGGACAAGGCC AAGAAGCAGAACCATCAAAACACTCTTCTAACCCGAAAGGTGGAGGGCTATGAGGCCCCT CTACAGCAGATGGTCATTGTCTACCCTGCAGGCCAGAAGAAAATCTACTCTGAGGACGAG GATCGGTATATTCTGGTTCAACTCTACAGGTACGGCCTTGAGACTGAGGGTGTTTATGAG ATGATCCGAGACGCCATTCGAGCATCACCCGTGTTCCGATTCGACTGGTTCTTCCTGTCG AGGACTCCTGCAGAGCTTGCGCGACGAGGCCAGACTCTGCTTTCTTACGTTGGTAAAGAG TACGATGGTGCTGGCGAGAAGCGAAAATCGTCTTCCACCCCCGACGTCGAGACTCCCAAA AAGTCTGCCAAGAAGAAGAAGGCGTAG
>YALI0E02332g.aa MSGPAPIDLDKSNPGSSMVSTVTTPNRDTPPVQLSTEDLLNIRSGYLLPDKGLGKPKQKS IPELVKKIKHLYGLSDVFEYFLKSKGESDPNYRKAMEQAKEELGLNKTTEKPTGGARHRK TEKEEDAELMQGEEEAENSVETVFETSPGYIQGTLREYQVQGLNWMVSLYEHGLSGILAD EMGLGKTLQTISFLGYLRYFRGIPGPHLVCVPKSTLDNWAREFAKWTPEVNVLVLQGDKE GRAQLIQDRLLTCDFDVCITSYEMVLREKGYLRRFAWQYIVIDEAHRIKNEESSLSQIIR LFHTENRLLITGTPLQNNLHELWALLNYILPDVFQDSAAFDAWFGEDQSGDQDAAVNQLH KILRPFLLRRVKADVEKSLLPKKEINLYVGMSDMQVKWYQKILEKDIDAVNGQIGKREGK TRLLNIVMQLRKCCNHPYLFEGAEPGPPYTTDEHLVYNAGKMVILDKLLKRIQEQGSRVL IFSQMSRVLDILEDYCLFRGYKYCRIDGQTAHEDRINAIDAYNKEGSEKFVFLLTTRAGG LGINLTTADQVVLYDSDWNPQADLQAMDRAHRIGQTKQVYVYRFITENAVEEKVIERATQ KLRLDKLVIQQGRSQSKVNNNAQNKDDLLNMIQFGAEKVFNRGKGEEQEEADLDIDDILK RGQQKTMELNSRYDSLGLDDLQKFTSDSAYEWNGEDFKKKDTSKTNKPFVWIAPNKRERK ENYSIDSYYREMLNQGAAAGKNAAPKAPKAPKQLNIQDHQFYPQRLIDIQEKETSYYRKQ IGYKVPLPDGAEEELEMRESERDLEQQEIDNAEPLTEEEEEEKVDLSTKGFSNWNRRDFT NFIQASARHGRNNFAAIATEFEDKTQADIKKYAGVFWQRYTEINGFDRYIQQIEAGEDKA KKQNHQNTLLTRKVEGYEAPLQQMVIVYPAGQKKIYSEDEDRYILVQLYRYGLETEGVYE MIRDAIRASPVFRFDWFFLSRTPAELARRGQTLLSYVGKEYDGAGEKRKSSSTPDVETPK KSAKKKKA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/YALI/YALI0E02332g