Element type: CDS
Element length: 2763 nucleotides,
on sense strand of
Yali0F: 3661782..3664544.
Other names:
YALI-CDS0578.1
YALI-IPF2112
Coding sequence: 921 codons.
Element length: 2763 nucleotides,
on sense strand of
Yali0F: 3661782..3664544.
Other names:
YALI-CDS0578.1
YALI-IPF2112
Coding sequence: 921 codons.
Database cross references:
EMBL: CR382132
GeneID: 2907888
HOGENOM: Q6C008
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382132
GeneID: 2907888
HOGENOM: Q6C008
Homologs and Orthologs
Homologs in protein family: GL3M4588Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0F28831g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_506004.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C008
UniprotKB: Q6C008_YARLI
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0F28831g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_506004.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C008
UniprotKB: Q6C008_YARLI
Sequence data 
>YALI0F28831g.nt AAAAAAAAAAAGGGTCTAGTATAAACGAAAACCAACGTCTGCCCATGCTAAATAAACACG CTGACGACTCTGATGAATGAAACTATGAAATATATATAAAAAAACCACCTCTTCGCGGTC GCTACCCTTTGAATATAAAAAAACAAAGAAAAAAATAAAATAGAGGAGGAATAAGAGGAC ATCCCAAGCACTACTGTCATACGCTCTTCTGCAAACACCCGAATCCGAGTCCGATGTGTG GGTTTGACATTGCTGAGATGTATTTCCCTTGAACCCTTGAACAAAAAAAAAACAAAACAA AACAAGAAAAACAGGCCATACACCCGTCATATAGGCGGGATCGCAGTAAGAATTGCCGCT GATATAGCATATTCCAATAACAACCTTGAAAGAACATGTTTTGAGGGCGTTGTCTTGCAC ATTTGACTAATCAACTCGATACAGTGCCGTGAAATTTCGAATTGTACCATACTGACAGTA CCAAAATTGTGATGCAAGTTTGTACGAGAACGAGTACTGTAGCTGTAGCATGGTTTATAC CACGAGTTGCTGGACCCACTGTGATTACTAAAACATAGCCAACCATTCATATCCACATGA TACAGCAGCAGTTAGCAGCAGTTGGAACTCATGCGCACCGTCTTGAATATGAGCACGGCT GAGTTGAACAATCAAGGTGAGAGGGTGGTAAATAAGTATAAGAGGAATTCCAGAGGTAGC TGCCAAATCTATATACCTCTAAGGTATCTGCCACCACCTTTGCAACCTGACAGCTCGCCA CGAGAGAAAAATTTCGCAAGCTACCACTCCCTACCCTTGCCCAGCTCATTACTGATTAGG TTCGTCCAGCAGGTAAAAAAATCTTGTGATAACGTTACGTGATTAGAGACGACACGACTA CAAGTAGACGCAAAGGATTCCTCTAGGAGCTGACGGAAACGATAGACGAACATTCACCCA GCAACTTGTTAAACACCGCGTCGACCACCTCACATTCAAAATGAAAGACCAGCGAGTCCA GGTGCCTGCATCGCCCTCGCAGTCCCAAGCAAACGCGTTTTCGGCTCTCTTTGCACCTTC TGACAAGAACTCCCTCTCCAGCTTTGCCTACAACCCTTCGAAACCGGCATCCAATGGACA GAGACCCTCGGCAGCAACTCCCGCAGCCTTCACTGAAGAGCAGAAGCAGCGCCTGCAACG ACTGAAGCCCATGTTCAACCAGCTGCCCCTCGTCAAGCTGAGAGAAGCAGTTGCTAACAG CACCGATTTCGACCGAGCTGTGGCATACCTTCGACGACAGGGTAGCATTCCATCTTCTTC TATAAACTCCGCGCCAACCCATCGAAAGCCTGCTTCACTGGTCAATCCGTTGGCTTTCCT CAACCCTTCTGTCACCACCAAGCCTGCCCCTCTGTCAACCAAACGGGAGGTTTCCAAGTC CAAGAGTATCAGAGAGAAATGGAAAAAGGAGGAGCCTAAAGCTGAGGACGATCTCGTTGT CATGGAAGACAAGCCTAGTGCACCCAAGAGAAGGCTGGTGCAGGCAAAAGACCGAGCAGG ATATACCCCTCAGCCCACACGGTCTCCCACTCCATTGTCTCCTGTGTCCATTTCTCGAGA CACTCCCATGGATATCGATGAAGCAGAGGCCGACGAATCTGACGACTTCTCTGAGAAGGA AGATGTCGACCCTGGCTTCGATCTCAGAGTGTTGGAGTTCCTCAACACCTCTTCTCAGAG AGACCTCATCGACCTGTCGTCGTGTTCCCAGAAGGTCGGTGAGCTCATGGTCAAGAACCG ACCCTACAAGAACCTGCTGCAGGCCCAAAACGTTGAGTTTTCTGACTCGGAATCCTCATC TACTGAGAAGAAGCGGGGACGAGGAGGCCGAAAGCGAAACGCCGGAGAACGTATTGTTGA TGCCGTCTCCGCTACTCTTCGAGGCTACGAGGCGGTGGACTCGTTGATCCAGAAGTGTGA CGTCTTGGGTAACCAGGTGGCTGCCGATATCAAATCTTGGGGTGTGGATATTTTTGGAGC TAAGGACGGAGAGGGTGTTGACATTACAGACATAGACGAGGACGTTGTCAAGAAGTCGAC GGTCAAGTTCCTCACCGAGAAGCCCTCTATTCTCAGTGACGATCTTGTTCTGAAGGACTA CCAGCAGGTTGGAATCAACTGGCTGTATCTGCTCTACAAGAAACGGCTATCTTGCATTCT GGCCGATGAGATGGGTCTCGGAAAGACCTGCCAGGTCATCTCTTTCATGGCTCTGCTGAA GGAGCAAGGAGAACACGAGGGTCCCCATCTTGTCGTGGTGCCTTCTTCGACACTGGAAAA CTGGCTGCGAGAGTTCCAGAAATTCGCACCCTCTCTAGTTGTCGAGCCTTACTATGGCTC CCAGAACGAACGCGCCGAGATGCGAGAGACTCTGTCCGACCCCGAGAATAAATACGACGT CATTGTGACCACCTACAACCTGGCTTGTGGCACCAAATTCGATGTGTCCTTCCTAAAGTC CATCAAATTTAATTGCTGTGTTTACGATGAGGGCCATATGCTGAAGAACTCTCAGACTGA CCGATACAACAAGCTAATGCGACTCAAGGCCAATTTCCGACTTTTGCTGACAGGTACTCC TCTGCAGAATAACCTCCGGGAGCTTGTGTCTCTGCTTGCATTTATCATTCCGTCTCTATT TGATGGTTGTAAGGACGATTTGGCAGAGATATTCAAGCACAAGGCCACCACTAATGACGC CTCTTCACACATGCCTCTCTTGTCACAGCAGCGAGTTAACCGTGCAAAGACAATGATGAC TCCATTCATTCTGCGACGAAAGAAGGAGCAGGTGCTGAAGCATCTGCCTCCCAAGACTCA CGAAGTCGCATACTGCCATCTTTCTCCTGATCAACAGGCCATTTACGACGAGCAGATGGA GCGAATGAGACAGATGCGACGAGACAGAGCTGCTGGCAAGCCTTCCTCTCGAGTTGGAAA CCCCCTCATGCTTCTACGTAAGGCGGCTCTTCATCATCTGCTTTTCCGACGAAAGTTCGA CGATGACACTCTCAAGAGCATGTCCAAGGAGATCATGAAGGAGGAGCGGTACTACGATGC TAACCGAGACTACATCCGAGAGGATATGGAGGTCATGAGCGACTTTGAGCTAAACAGATT AGCTCTCCAGTTCCCCAGCATCGAGAAGTATGCTCTGGAAGAGGAGCCATGGATGGACGC TGCCAAGGTTAAGAAGTTGGCCGAGATGCTGCCTATCATGAAGGAAAACAATGATAGAGT CCTCATTTTCTCGCAGTTCACCCAGTGCTTGGACATTCTTGAGTCTGTTCTTAACACTCT GGGAATTGCATTTCTCCGCCTGGACGGCCAAACCCCCGTCGAGGCCCGGCAAGACATGAT CGACAAGTACTATGAGGAGACTGATATCACCGTCTTCCTGCTCTCTACCAAAGCTGGAGG GTTTGGTATTAACTTGGCATGTGCTAACACTGTTATCATTTTCGATCTTTCGTTCAACCC TCATGACGATAAGCAGGCCGAGGATCGAGCTCATCGAGTCGGTCAGACTCGGGACGTCCG AGTCATTAGACTGGTGTGTAAGGGTACTGTGGAGGAGAAGATTCTTGAGCTCAACAATAC CAAGCTGGCTCTGGACAAAAGTGTGTCTGGTGAGGGTGGTGAAGAGGAGGCCAAGAAGAA CGAGTCCAAGATCGAGGAGATGCTGATGGCGGATGACGAGTAGCTATGTAACGGACGATA AATAAACGCATAAGATCTGAACTTAGAAACATTAATAGATTATTTACACTTTTTTGGGTA CTTTTACTCGCGCGAGTATTGGGATGTATGTACTGTACTTGTAGGTAAGACCTTCAAAAG AAACACATCATGACTTTGCAGGACGAGATACCAGCTACCCAAGGAAAAAGAGAGCAAGCC AGAAATAAAAAGTGATCATGCCGAGGATCGAACTCGGAGCCTCCCCGGTGTGAGCGGGAC GTGATAACCATTACACCACACGACCTTTTTCACGTGACATAAT
>YALI0F28831g.cds ATGAAAGACCAGCGAGTCCAGGTGCCTGCATCGCCCTCGCAGTCCCAAGCAAACGCGTTT TCGGCTCTCTTTGCACCTTCTGACAAGAACTCCCTCTCCAGCTTTGCCTACAACCCTTCG AAACCGGCATCCAATGGACAGAGACCCTCGGCAGCAACTCCCGCAGCCTTCACTGAAGAG CAGAAGCAGCGCCTGCAACGACTGAAGCCCATGTTCAACCAGCTGCCCCTCGTCAAGCTG AGAGAAGCAGTTGCTAACAGCACCGATTTCGACCGAGCTGTGGCATACCTTCGACGACAG GGTAGCATTCCATCTTCTTCTATAAACTCCGCGCCAACCCATCGAAAGCCTGCTTCACTG GTCAATCCGTTGGCTTTCCTCAACCCTTCTGTCACCACCAAGCCTGCCCCTCTGTCAACC AAACGGGAGGTTTCCAAGTCCAAGAGTATCAGAGAGAAATGGAAAAAGGAGGAGCCTAAA GCTGAGGACGATCTCGTTGTCATGGAAGACAAGCCTAGTGCACCCAAGAGAAGGCTGGTG CAGGCAAAAGACCGAGCAGGATATACCCCTCAGCCCACACGGTCTCCCACTCCATTGTCT CCTGTGTCCATTTCTCGAGACACTCCCATGGATATCGATGAAGCAGAGGCCGACGAATCT GACGACTTCTCTGAGAAGGAAGATGTCGACCCTGGCTTCGATCTCAGAGTGTTGGAGTTC CTCAACACCTCTTCTCAGAGAGACCTCATCGACCTGTCGTCGTGTTCCCAGAAGGTCGGT GAGCTCATGGTCAAGAACCGACCCTACAAGAACCTGCTGCAGGCCCAAAACGTTGAGTTT TCTGACTCGGAATCCTCATCTACTGAGAAGAAGCGGGGACGAGGAGGCCGAAAGCGAAAC GCCGGAGAACGTATTGTTGATGCCGTCTCCGCTACTCTTCGAGGCTACGAGGCGGTGGAC TCGTTGATCCAGAAGTGTGACGTCTTGGGTAACCAGGTGGCTGCCGATATCAAATCTTGG GGTGTGGATATTTTTGGAGCTAAGGACGGAGAGGGTGTTGACATTACAGACATAGACGAG GACGTTGTCAAGAAGTCGACGGTCAAGTTCCTCACCGAGAAGCCCTCTATTCTCAGTGAC GATCTTGTTCTGAAGGACTACCAGCAGGTTGGAATCAACTGGCTGTATCTGCTCTACAAG AAACGGCTATCTTGCATTCTGGCCGATGAGATGGGTCTCGGAAAGACCTGCCAGGTCATC TCTTTCATGGCTCTGCTGAAGGAGCAAGGAGAACACGAGGGTCCCCATCTTGTCGTGGTG CCTTCTTCGACACTGGAAAACTGGCTGCGAGAGTTCCAGAAATTCGCACCCTCTCTAGTT GTCGAGCCTTACTATGGCTCCCAGAACGAACGCGCCGAGATGCGAGAGACTCTGTCCGAC CCCGAGAATAAATACGACGTCATTGTGACCACCTACAACCTGGCTTGTGGCACCAAATTC GATGTGTCCTTCCTAAAGTCCATCAAATTTAATTGCTGTGTTTACGATGAGGGCCATATG CTGAAGAACTCTCAGACTGACCGATACAACAAGCTAATGCGACTCAAGGCCAATTTCCGA CTTTTGCTGACAGGTACTCCTCTGCAGAATAACCTCCGGGAGCTTGTGTCTCTGCTTGCA TTTATCATTCCGTCTCTATTTGATGGTTGTAAGGACGATTTGGCAGAGATATTCAAGCAC AAGGCCACCACTAATGACGCCTCTTCACACATGCCTCTCTTGTCACAGCAGCGAGTTAAC CGTGCAAAGACAATGATGACTCCATTCATTCTGCGACGAAAGAAGGAGCAGGTGCTGAAG CATCTGCCTCCCAAGACTCACGAAGTCGCATACTGCCATCTTTCTCCTGATCAACAGGCC ATTTACGACGAGCAGATGGAGCGAATGAGACAGATGCGACGAGACAGAGCTGCTGGCAAG CCTTCCTCTCGAGTTGGAAACCCCCTCATGCTTCTACGTAAGGCGGCTCTTCATCATCTG CTTTTCCGACGAAAGTTCGACGATGACACTCTCAAGAGCATGTCCAAGGAGATCATGAAG GAGGAGCGGTACTACGATGCTAACCGAGACTACATCCGAGAGGATATGGAGGTCATGAGC GACTTTGAGCTAAACAGATTAGCTCTCCAGTTCCCCAGCATCGAGAAGTATGCTCTGGAA GAGGAGCCATGGATGGACGCTGCCAAGGTTAAGAAGTTGGCCGAGATGCTGCCTATCATG AAGGAAAACAATGATAGAGTCCTCATTTTCTCGCAGTTCACCCAGTGCTTGGACATTCTT GAGTCTGTTCTTAACACTCTGGGAATTGCATTTCTCCGCCTGGACGGCCAAACCCCCGTC GAGGCCCGGCAAGACATGATCGACAAGTACTATGAGGAGACTGATATCACCGTCTTCCTG CTCTCTACCAAAGCTGGAGGGTTTGGTATTAACTTGGCATGTGCTAACACTGTTATCATT TTCGATCTTTCGTTCAACCCTCATGACGATAAGCAGGCCGAGGATCGAGCTCATCGAGTC GGTCAGACTCGGGACGTCCGAGTCATTAGACTGGTGTGTAAGGGTACTGTGGAGGAGAAG ATTCTTGAGCTCAACAATACCAAGCTGGCTCTGGACAAAAGTGTGTCTGGTGAGGGTGGT GAAGAGGAGGCCAAGAAGAACGAGTCCAAGATCGAGGAGATGCTGATGGCGGATGACGAG TAG
>YALI0F28831g.aa MKDQRVQVPASPSQSQANAFSALFAPSDKNSLSSFAYNPSKPASNGQRPSAATPAAFTEE QKQRLQRLKPMFNQLPLVKLREAVANSTDFDRAVAYLRRQGSIPSSSINSAPTHRKPASL VNPLAFLNPSVTTKPAPLSTKREVSKSKSIREKWKKEEPKAEDDLVVMEDKPSAPKRRLV QAKDRAGYTPQPTRSPTPLSPVSISRDTPMDIDEAEADESDDFSEKEDVDPGFDLRVLEF LNTSSQRDLIDLSSCSQKVGELMVKNRPYKNLLQAQNVEFSDSESSSTEKKRGRGGRKRN AGERIVDAVSATLRGYEAVDSLIQKCDVLGNQVAADIKSWGVDIFGAKDGEGVDITDIDE DVVKKSTVKFLTEKPSILSDDLVLKDYQQVGINWLYLLYKKRLSCILADEMGLGKTCQVI SFMALLKEQGEHEGPHLVVVPSSTLENWLREFQKFAPSLVVEPYYGSQNERAEMRETLSD PENKYDVIVTTYNLACGTKFDVSFLKSIKFNCCVYDEGHMLKNSQTDRYNKLMRLKANFR LLLTGTPLQNNLRELVSLLAFIIPSLFDGCKDDLAEIFKHKATTNDASSHMPLLSQQRVN RAKTMMTPFILRRKKEQVLKHLPPKTHEVAYCHLSPDQQAIYDEQMERMRQMRRDRAAGK PSSRVGNPLMLLRKAALHHLLFRRKFDDDTLKSMSKEIMKEERYYDANRDYIREDMEVMS DFELNRLALQFPSIEKYALEEEPWMDAAKVKKLAEMLPIMKENNDRVLIFSQFTQCLDIL ESVLNTLGIAFLRLDGQTPVEARQDMIDKYYEETDITVFLLSTKAGGFGINLACANTVII FDLSFNPHDDKQAEDRAHRVGQTRDVRVIRLVCKGTVEEKILELNNTKLALDKSVSGEGG EEEAKKNESKIEEMLMADDE*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/YALI/YALI0F28831g