YALI0A11649g


similar to uniprot|Q6CH30 Yarrowia lipolytica YALI0A13233g threonine-rich protein

Genomic environment map

Element type: CDS
Element length: 2262 nucleotides,
on anti-sense strand of
Yali0A: complement(1217592..1219853).
Other names:
YALI-CDS0960.1
YALI-IPF10578
Coding sequence: 754 codons.
Database cross references:
EMBL: CR382127
GeneID: 2906178
GenomeReviews: CR382127_GR

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C5301
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0A11649p  


similar to uniprot|Q6CH30 Yarrowia lipolytica YALI0A13233g threonine-rich protein; SubName: Full=YALI0A11649p;

Protein domain map

Protein length: 753 amino acids
Protein family: GL3C5301
Database cross references:
KEGG: yli:YALI0A11649g
RefSeq: XP_499987.1
UniProtKB/TrEMBL: Q6CH75
UniProtKB: Q6CH75_YARLI

Computed results for YALI0A11649p  

None available yet

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>YALI0A11649g.nt
ATGCGTGATCCGTATGCCATTAAATACTGGCTTTCGAGATCCGGCTCTCCCCCGTACACT
CTTGCGCAGTACAACGAAGCCATCAACCAGGTGTACAAGATGTACGCCGCCCAGCAGGCT
GCTAGATGGAAGGAGACCCAAGAACAAGTCACGGTGGTCAACAACTGGCCCCCCGCTGGA
AAGTGTATCCCCCTTGCCTCGATCACCGCCACAACTGTCGTTCCTGGTACTAAAGCTGGA
ACCACCACAATCTCCAGCGGTGGTCGCTACCATGTGGAGATTACCAAAGTTCCTGTTACC
ACCAACACTATCCCTGGATTGTCTGCTGGAACACGAACCCTGGTAAATTCTCAGGGAAAC
TTCTACAATGAGGTTGTAGTTGTTCCTGCCTCGACAACCACCGTGCCAGGCCACACAGCT
GGAACTGCGACTCTTACGGATGCTGACGGCGGATTTTATGAAGTGGTCACGCTCGTTCCT
GTGTCAACCATCACTGTTCCGGGTTTAGTCACTGGCACGGAAACCCTCGTCAATCGCAAT
GGGAGTTTTTATGAGCAGGTAACCGTCGTCCCCACGTCAACAACGGTCGTTCCAGGTCAT
ATTGTTGGAACTGCCACTATTCGAAACAGCGATGGCGGCTTCAACGAGCGAGTCACGAGT
GTTCCTGTCTCCACCAACACCGTTCCTGGATTGATACCAGGAACTGCCACCCTTGTCAAC
CACAATGGGAGTTTTTATGAGCAGGTAACCGTCGTCCCCACGTCAACAACGGTCGTGCCA
GGCCATACAGCAGGAACTGAAACTCTGCGAAACACTGATGGTGGGTTCTATGTTCGGGCT
ACTGTCATCCCTGGGTCAACAACAGTGGTTCCGGGCCACACGGCTGGAACCGCGACTCTT
CAGAACACTGATGGCAGTTTCTATGAAGTAGTTACCAATGTTCCCGTCTCCACAAACATT
GTTCTAGGTCTGACTGCTGGCACAGAAACCCTCGTCGATAGTAAAGGCAACTTCTATGAG
CAGGTAACGCGCACTGTGGTCTCGACAAACACTGTCTTAGGAGAATCTTCTGGTACTGAG
ACACTCACCAATCCCCACGGTGACTTCTATGAGCAGGTAACCAGTGTTTCAAGCCCACCT
ATCTTTTCAACTCGTTCCTCCGGTAGCATTGTTTCCACAAACATTGTCCCAGGATACAGA
GAAGGTACGGAAATCCTCACTGGCACCAATGGAAACCTGTTTGTTCAAGTGACACGCACT
CCAATCTCCACCAACACTATTTCTGGATCCATTGTTGGCACAGAGATCCTTATCAACTCG
GACGGAGATTTTTTCGAGCAAGTAACCAGCATTCGAGCCTCCTCCATCGTTGATGCAACC
TCCTCAGCCTCTGCCTCCACTGATAGCTTCTCGTCAACTGGAAGCACCTCGTCAACTGGC
GTTCCTTCCTTGACACGTATTAGTGGTACATTCACTGTTTTTGCCACGAAAGATTCCTCT
CCATCTGCTTTCGTCGCCACAACCATCTCTTTTGGACACTCCAAAGTAGCTGAAACCATC
AGAGACACCACTGGAGACGCTATGGAGCAGGCGACTAGTACTCCATTCTCCACCAGCCCT
CTTGTAGGATCCGTTATCGACACGGAAATACTGACCAAGTCGAACGGAGATGTCAGCATC
CCGGTCCCAACAGCTGCTAATACCAAGTCCTCTGCAACTTCCTCGACTGATATCATCACA
GTCGTCCCTGACTCCGTGCATTTTGGATCCACAAAAGAAGTCAAGTCTCCCACTGACACC
ATCAGTAGTTCCTTTGTGCATGAGACGGGCGCCACCGTCGCCACCAAGACTTCTACTGGA
ATCACCGTTGCGATAGAAATACATGCGAGCCCCAGCAGTGTCCAGTATAAGCAGACTACC
GGTTTCCCGAAAGTTGCTTCCACTATGGCCATCGCCTCCAACAGTGTTGTCTCCGCTCTC
GTGGCTCCTAGACCTACAACACACACCAGTCTCAGAAACTCCGAAGATGAGTTCACTGGG
CTGAGTACCAGCGTTAATGTCTCGAAAAACATGGTGTCGACTGTATCTGGACCCAACGAA
TCTCAGACAAGCAAGGGACCTGGGGACGGCTCAATCAGCGTTGGCGGAGTTCCTCTCTCT
GCCCAGCCTCCCGGCCTTGAGTTTCCCATCCAGGCCAACAGTGCTTCCCGACTGGCTATC
GGTGCCGTTCTCGTCCTTCCAATGGCCCTTACTCTTCTGTGA

Coding sequence    

>YALI0A11649g.cds
ATGCGTGATCCGTATGCCATTAAATACTGGCTTTCGAGATCCGGCTCTCCCCCGTACACT
CTTGCGCAGTACAACGAAGCCATCAACCAGGTGTACAAGATGTACGCCGCCCAGCAGGCT
GCTAGATGGAAGGAGACCCAAGAACAAGTCACGGTGGTCAACAACTGGCCCCCCGCTGGA
AAGTGTATCCCCCTTGCCTCGATCACCGCCACAACTGTCGTTCCTGGTACTAAAGCTGGA
ACCACCACAATCTCCAGCGGTGGTCGCTACCATGTGGAGATTACCAAAGTTCCTGTTACC
ACCAACACTATCCCTGGATTGTCTGCTGGAACACGAACCCTGGTAAATTCTCAGGGAAAC
TTCTACAATGAGGTTGTAGTTGTTCCTGCCTCGACAACCACCGTGCCAGGCCACACAGCT
GGAACTGCGACTCTTACGGATGCTGACGGCGGATTTTATGAAGTGGTCACGCTCGTTCCT
GTGTCAACCATCACTGTTCCGGGTTTAGTCACTGGCACGGAAACCCTCGTCAATCGCAAT
GGGAGTTTTTATGAGCAGGTAACCGTCGTCCCCACGTCAACAACGGTCGTTCCAGGTCAT
ATTGTTGGAACTGCCACTATTCGAAACAGCGATGGCGGCTTCAACGAGCGAGTCACGAGT
GTTCCTGTCTCCACCAACACCGTTCCTGGATTGATACCAGGAACTGCCACCCTTGTCAAC
CACAATGGGAGTTTTTATGAGCAGGTAACCGTCGTCCCCACGTCAACAACGGTCGTGCCA
GGCCATACAGCAGGAACTGAAACTCTGCGAAACACTGATGGTGGGTTCTATGTTCGGGCT
ACTGTCATCCCTGGGTCAACAACAGTGGTTCCGGGCCACACGGCTGGAACCGCGACTCTT
CAGAACACTGATGGCAGTTTCTATGAAGTAGTTACCAATGTTCCCGTCTCCACAAACATT
GTTCTAGGTCTGACTGCTGGCACAGAAACCCTCGTCGATAGTAAAGGCAACTTCTATGAG
CAGGTAACGCGCACTGTGGTCTCGACAAACACTGTCTTAGGAGAATCTTCTGGTACTGAG
ACACTCACCAATCCCCACGGTGACTTCTATGAGCAGGTAACCAGTGTTTCAAGCCCACCT
ATCTTTTCAACTCGTTCCTCCGGTAGCATTGTTTCCACAAACATTGTCCCAGGATACAGA
GAAGGTACGGAAATCCTCACTGGCACCAATGGAAACCTGTTTGTTCAAGTGACACGCACT
CCAATCTCCACCAACACTATTTCTGGATCCATTGTTGGCACAGAGATCCTTATCAACTCG
GACGGAGATTTTTTCGAGCAAGTAACCAGCATTCGAGCCTCCTCCATCGTTGATGCAACC
TCCTCAGCCTCTGCCTCCACTGATAGCTTCTCGTCAACTGGAAGCACCTCGTCAACTGGC
GTTCCTTCCTTGACACGTATTAGTGGTACATTCACTGTTTTTGCCACGAAAGATTCCTCT
CCATCTGCTTTCGTCGCCACAACCATCTCTTTTGGACACTCCAAAGTAGCTGAAACCATC
AGAGACACCACTGGAGACGCTATGGAGCAGGCGACTAGTACTCCATTCTCCACCAGCCCT
CTTGTAGGATCCGTTATCGACACGGAAATACTGACCAAGTCGAACGGAGATGTCAGCATC
CCGGTCCCAACAGCTGCTAATACCAAGTCCTCTGCAACTTCCTCGACTGATATCATCACA
GTCGTCCCTGACTCCGTGCATTTTGGATCCACAAAAGAAGTCAAGTCTCCCACTGACACC
ATCAGTAGTTCCTTTGTGCATGAGACGGGCGCCACCGTCGCCACCAAGACTTCTACTGGA
ATCACCGTTGCGATAGAAATACATGCGAGCCCCAGCAGTGTCCAGTATAAGCAGACTACC
GGTTTCCCGAAAGTTGCTTCCACTATGGCCATCGCCTCCAACAGTGTTGTCTCCGCTCTC
GTGGCTCCTAGACCTACAACACACACCAGTCTCAGAAACTCCGAAGATGAGTTCACTGGG
CTGAGTACCAGCGTTAATGTCTCGAAAAACATGGTGTCGACTGTATCTGGACCCAACGAA
TCTCAGACAAGCAAGGGACCTGGGGACGGCTCAATCAGCGTTGGCGGAGTTCCTCTCTCT
GCCCAGCCTCCCGGCCTTGAGTTTCCCATCCAGGCCAACAGTGCTTCCCGACTGGCTATC
GGTGCCGTTCTCGTCCTTCCAATGGCCCTTACTCTTCTGTGA

Predicted translation product    

>YALI0A11649g.aa
MRDPYAIKYWLSRSGSPPYTLAQYNEAINQVYKMYAAQQAARWKETQEQVTVVNNWPPAG
KCIPLASITATTVVPGTKAGTTTISSGGRYHVEITKVPVTTNTIPGLSAGTRTLVNSQGN
FYNEVVVVPASTTTVPGHTAGTATLTDADGGFYEVVTLVPVSTITVPGLVTGTETLVNRN
GSFYEQVTVVPTSTTVVPGHIVGTATIRNSDGGFNERVTSVPVSTNTVPGLIPGTATLVN
HNGSFYEQVTVVPTSTTVVPGHTAGTETLRNTDGGFYVRATVIPGSTTVVPGHTAGTATL
QNTDGSFYEVVTNVPVSTNIVLGLTAGTETLVDSKGNFYEQVTRTVVSTNTVLGESSGTE
TLTNPHGDFYEQVTSVSSPPIFSTRSSGSIVSTNIVPGYREGTEILTGTNGNLFVQVTRT
PISTNTISGSIVGTEILINSDGDFFEQVTSIRASSIVDATSSASASTDSFSSTGSTSSTG
VPSLTRISGTFTVFATKDSSPSAFVATTISFGHSKVAETIRDTTGDAMEQATSTPFSTSP
LVGSVIDTEILTKSNGDVSIPVPTAANTKSSATSSTDIITVVPDSVHFGSTKEVKSPTDT
ISSSFVHETGATVATKTSTGITVAIEIHASPSSVQYKQTTGFPKVASTMAIASNSVVSAL
VAPRPTTHTSLRNSEDEFTGLSTSVNVSKNMVSTVSGPNESQTSKGPGDGSISVGGVPLS
AQPPGLEFPIQANSASRLAIGAVLVLPMALTLL*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites