YALI0E32835g


uniprot|O74934 Yarrowia lipolytica YALI0E32835g POX1 Acyl-CoA oxidase 1 (EC 1.3.3.6), peroxisomal, specific toward dicarboxilic acid

Genomic environment map

Element type: CDS
Element length: 2034 nucleotides,
on sense strand of
Yali0E: 3897102..3899135.
Other names:
YALI-CDS1242.1
YALI-IPF2836
Coding sequence: 678 codons.
Database cross references:
EMBL: AJ001299
EMBL: CR382131
GeneID: 2912003
HOGENOM: O74934

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0456 GL3C0456.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0E32835p  


uniprot|O74934 Yarrowia lipolytica POX1 Acyl-CoA oxidase 1 (EC 1.3.3.6), peroxisomal, specific toward dicarboxilic acid

Protein domain map

Protein length: 677 amino acids
Protein family: GL3C0456
Database cross references:
Gene3D: G3DSA:1.10.540.10
Gene3D: G3DSA:1.20.140.10
Gene3D: G3DSA:2.40.110.10
HSSP: P07872
InterPro: IPR002655
InterPro: IPR006090
InterPro: IPR006091
InterPro: IPR012258
InterPro: IPR013764
InterPro: IPR013786
KEGG: yli:YALI0E32835g
PANTHER: PTHR10909:SF11
PIRSF: PIRSF000168
Pfam: PF00441
Pfam: PF01756
Pfam: PF02770
RefSeq: XP_504703.1
UniProtKB/Swiss-Prot: O74934
UniprotKB: ACOX1_YARLI

Computed results for YALI0E32835p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>YALI0E32835g.nt
TGAGTATAATCCGTCTCCGTTCCCACTTGAGGGGAGGACTTTGGACGGGTTTCGGTGCAA
CCTCGTCAGGTGAATAATCATGATTCATTTGCGTGATCTGGCTGGGACAAAATACAAGTG
CTTGTATATGGTATACCAGGAGGACCACTTATGAGAATGCCAAAGAAAATTGAAGGGGAA
AATAAATGCTGCCACCCGATACTCGAAATATACCATTGGACGTGTTTTCCACATAGGTAA
TCGCCACTCCATACCCGAGATCTGAAGTGAAACATTCGTGTTCTGTAGCCGTTAGCCCAC
CCCCACACACTGAGACACTCGGCTGATGTTTCGAAATTCATGCTACTGTACGGTACGGAC
TTAACACTTATCCCCATTGTCCTGGGACTACTACATTGTAGTATACTCTACACATCCAAC
AAGCAATACTTTTTCGTACAAGTAGGTAGTTGTGCGAGTACTCGTACAATGGAAAAACAT
TATACATGAAGCAGGGCGTGCGTCGGTCTGTAGGGTGAAATAGGGAGCCGACAGTGCATT
TCCAGCATTCTAGATTGATCTGAATCAATAATTTGTGGGGGAAATTGGCACTAGTGCTTT
GTTTATTAGCTATTTGACAGTCTGATGATGAGAAAGTCGTTCTTCGGCTTAGCCCAACCC
CACACTCACTACGCACCTGGCACAGGGTGGTGCATATAATATGCAAGTACGGAGATGTAC
TAACGCTAGCCAGGAGAGAGTGACGAAACGTAGCCATCCGTTGAGCGGGCAGACGTGGGA
GTAATATAGTCGTTTAGGGCACGTGATTGCGACCCCAGTGGTATCATCTTTGTCTCCCAA
ACTCTTCGTAATGCCCACCCAGGACGATCCTCTCTGAACTTTACTACCCCACATTCCCGA
GACTATAAAGTATAGACGGCTCCTACAAAAGAGAGCAGGTGGAATAACACACTATTGACC
GACAATGACAACCAACACATTCACCGATCCTCCTGTGGAGATGGCCAAGGAGCGAGGTAA
GACTCAATTCACTGTCCGAGATGTGACCAACTTCCTCAATGGTGGAGAAGAAGAGACCCA
GATTGTCGAGAAGATCATGAGCAGTATTGAACGTGATCCAGTACTGTCTGTCACTGCTGA
CTACGACTGCAACCTTCAGCAGGCCCGAAAACAGACCATGGAGCGGGTGGCTGCTCTGTC
GCCTTATCTGGTCACCGATACTGAGAAGCTATCTCTGTGGCGTGCGCAACTGCATGGAAT
GGTTGATATGTCTACTCGTACGCGGTTGTCGATCCACAACAACCTGTTCATTGGTTCCAT
CAGGGGATCTGGTACTCCTGAACAGTTCAAGTACTGGGTCAAGAAGGGAGCGGTGGCTGT
TAAGCAGTTCTATGGATGCTTTGCCATGACAGAGTTGGGCCATGGAAGCAACCTCAAGGG
ACTAGAGACAACCGCCACTTATGACCAGGACAGTGACCAGTTCATTATCAACACTCCTCA
TATTGGTGCTACCAAGTGGTGGATTGGCGGTGCAGCCCACACTTCCACCCATTGTGTTTG
TTTCGCGAAACTGATTGTGCATGGCAAGGACTATGGTACTCGAAACTTTGTGGTACCTCT
CCGAAATGTCCACGATCACAGTCTCAAGGTCGGTGTTTCAATTGGAGACATTGGAAAGAA
GATGGGCAGAGATGGTGTTGACAATGGCTGGATCCAGTTCACCAATGTTCGAATCCCCAG
ACAGAACATGCTAATGAGATATGCCAAGGTGTCTGATACTGGAGTGGTAACCAAACCCGC
TCTTGACCAACTCACTTATGGAGCCCTCATTCGAGGTCGAGTGTCCATGATTGCCGACTC
GTTCCACGTCTCCAAACGATTCCTCACAATTGCTCTTCGGTACGCTTGTGTCCGACGACA
GTTTGGAACCTCTGGAGACACTAAGGAGACCAAGATCATCGACTACCCTTACCACCAGCG
ACGATTGCTGCCTCTTCTGGCCTACTGCTACGCTATGAAGATGGGTGCTGATGAGGCTCA
GAAGACTTGGATTGAGACCACCGATCGAATTCTGGCTCTCAATCCCAACGACCCCGCCCA
GAAGAACGATCTGGAGAAGGCCGTCACCGACACAAAGGAGCTGTTTGCTGCGTCTGCAGG
AATGAAGGCATTTACCACGTGGGGATGTGCCAAAATCATTGATGAGTGCCGACAGGCCTG
TGGAGGTCATGGATACTCTGGATATAACGGATTTGGCCAGGGCTACGCTGACTGGGTTGT
CCAGTGTACCTGGGAAGGAGACAACAACGTTCTGTGTCTGTCAATGGGCCGAGGGCTGGT
TCAGTCAGCTCTACAGATTTTGGCTGGAAAGCACGTCGGTGCTTCTATTCAGTACGTAGG
AGACAAGTCTAAAATCTCCCAGAACGGCCAGGGTACCCCCAGAGAGCAACTTCTGTCCCC
CGAGTTTCTAGTAGAAGCTTTCAGAACGGCTTCTCGAAACAACATTCTCAGAACCACCGA
TAAATACCAAGAGCTTGTCAAAACTCTCAATCCCGACCAGGCCTTTGAGGAGCTGTCTCA
GCAGAGATTCCAGTGTGCTCGAATCCACACACGACAGCATCTTATCTCTTCATTCTATGC
CCGAATTGCCACTGCCAAAGACGATATCAAGCCCCATCTGCTGAAACTGGCCAATCTGTT
TGCCCTCTGGTCAATTGAGGAGGACACTGGAATCTTCCTGCGGGAGAACATCCTCACCCC
TGGAGACATTGACCTGATCAACAGTCTTGTGGACGAGCTCTGTGTTGCAGTTCGAGATCA
GGTAATTGGACTCACTGATGCCTTTGGTCTCTCTGACTTCTTCATTAACGCTCCCATCGG
CTCCTACGATGGTAATGTTTACGAAAAGTACTTTGCCAAGGTCAACCAGCAAAACCCCGC
TACTAACCCTCGTCCTCCCTACTACGAGTCGACTCTCAAGCCCTTCTTGTTCCGAGAAGA
GGAGGACGATGAAATTTGCGATCTCGATGAGTGATGTTAGTAATTATGAGATACATTCTT
TGCAATGATGAGGGCTCTTGTAGAGTATTCTGCTGTAAAGGAGATCTTGAATTTTGTTGT
TGAATTTGTGTAATTGTCTTATGATTATTGTCTTGTGTAGCGCTTTTCAAGCCTGCAGGA
AAAGTGAAATGAACATGTTATTGCCATATCATCCGTTATTGGTGTTTAGTTCCTATTGGG
CTTGATTACCAACGATCCGGGTCGAAAGACCCCTCCGTTTAGGGGTGGTAAGTCTCAAGT
GGCTGAACACGTGCATGCACTGTACATCGCCGAG

Coding sequence    

>YALI0E32835g.cds
ATGGCCAAGGAGCGAGGTAAGACTCAATTCACTGTCCGAGATGTGACCAACTTCCTCAAT
GGTGGAGAAGAAGAGACCCAGATTGTCGAGAAGATCATGAGCAGTATTGAACGTGATCCA
GTACTGTCTGTCACTGCTGACTACGACTGCAACCTTCAGCAGGCCCGAAAACAGACCATG
GAGCGGGTGGCTGCTCTGTCGCCTTATCTGGTCACCGATACTGAGAAGCTATCTCTGTGG
CGTGCGCAACTGCATGGAATGGTTGATATGTCTACTCGTACGCGGTTGTCGATCCACAAC
AACCTGTTCATTGGTTCCATCAGGGGATCTGGTACTCCTGAACAGTTCAAGTACTGGGTC
AAGAAGGGAGCGGTGGCTGTTAAGCAGTTCTATGGATGCTTTGCCATGACAGAGTTGGGC
CATGGAAGCAACCTCAAGGGACTAGAGACAACCGCCACTTATGACCAGGACAGTGACCAG
TTCATTATCAACACTCCTCATATTGGTGCTACCAAGTGGTGGATTGGCGGTGCAGCCCAC
ACTTCCACCCATTGTGTTTGTTTCGCGAAACTGATTGTGCATGGCAAGGACTATGGTACT
CGAAACTTTGTGGTACCTCTCCGAAATGTCCACGATCACAGTCTCAAGGTCGGTGTTTCA
ATTGGAGACATTGGAAAGAAGATGGGCAGAGATGGTGTTGACAATGGCTGGATCCAGTTC
ACCAATGTTCGAATCCCCAGACAGAACATGCTAATGAGATATGCCAAGGTGTCTGATACT
GGAGTGGTAACCAAACCCGCTCTTGACCAACTCACTTATGGAGCCCTCATTCGAGGTCGA
GTGTCCATGATTGCCGACTCGTTCCACGTCTCCAAACGATTCCTCACAATTGCTCTTCGG
TACGCTTGTGTCCGACGACAGTTTGGAACCTCTGGAGACACTAAGGAGACCAAGATCATC
GACTACCCTTACCACCAGCGACGATTGCTGCCTCTTCTGGCCTACTGCTACGCTATGAAG
ATGGGTGCTGATGAGGCTCAGAAGACTTGGATTGAGACCACCGATCGAATTCTGGCTCTC
AATCCCAACGACCCCGCCCAGAAGAACGATCTGGAGAAGGCCGTCACCGACACAAAGGAG
CTGTTTGCTGCGTCTGCAGGAATGAAGGCATTTACCACGTGGGGATGTGCCAAAATCATT
GATGAGTGCCGACAGGCCTGTGGAGGTCATGGATACTCTGGATATAACGGATTTGGCCAG
GGCTACGCTGACTGGGTTGTCCAGTGTACCTGGGAAGGAGACAACAACGTTCTGTGTCTG
TCAATGGGCCGAGGGCTGGTTCAGTCAGCTCTACAGATTTTGGCTGGAAAGCACGTCGGT
GCTTCTATTCAGTACGTAGGAGACAAGTCTAAAATCTCCCAGAACGGCCAGGGTACCCCC
AGAGAGCAACTTCTGTCCCCCGAGTTTCTAGTAGAAGCTTTCAGAACGGCTTCTCGAAAC
AACATTCTCAGAACCACCGATAAATACCAAGAGCTTGTCAAAACTCTCAATCCCGACCAG
GCCTTTGAGGAGCTGTCTCAGCAGAGATTCCAGTGTGCTCGAATCCACACACGACAGCAT
CTTATCTCTTCATTCTATGCCCGAATTGCCACTGCCAAAGACGATATCAAGCCCCATCTG
CTGAAACTGGCCAATCTGTTTGCCCTCTGGTCAATTGAGGAGGACACTGGAATCTTCCTG
CGGGAGAACATCCTCACCCCTGGAGACATTGACCTGATCAACAGTCTTGTGGACGAGCTC
TGTGTTGCAGTTCGAGATCAGGTAATTGGACTCACTGATGCCTTTGGTCTCTCTGACTTC
TTCATTAACGCTCCCATCGGCTCCTACGATGGTAATGTTTACGAAAAGTACTTTGCCAAG
GTCAACCAGCAAAACCCCGCTACTAACCCTCGTCCTCCCTACTACGAGTCGACTCTCAAG
CCCTTCTTGTTCCGAGAAGAGGAGGACGATGAAATTTGCGATCTCGATGAGTGA

Predicted translation product    

>YALI0E32835g.aa
MAKERGKTQFTVRDVTNFLNGGEEETQIVEKIMSSIERDPVLSVTADYDCNLQQARKQTM
ERVAALSPYLVTDTEKLSLWRAQLHGMVDMSTRTRLSIHNNLFIGSIRGSGTPEQFKYWV
KKGAVAVKQFYGCFAMTELGHGSNLKGLETTATYDQDSDQFIINTPHIGATKWWIGGAAH
TSTHCVCFAKLIVHGKDYGTRNFVVPLRNVHDHSLKVGVSIGDIGKKMGRDGVDNGWIQF
TNVRIPRQNMLMRYAKVSDTGVVTKPALDQLTYGALIRGRVSMIADSFHVSKRFLTIALR
YACVRRQFGTSGDTKETKIIDYPYHQRRLLPLLAYCYAMKMGADEAQKTWIETTDRILAL
NPNDPAQKNDLEKAVTDTKELFAASAGMKAFTTWGCAKIIDECRQACGGHGYSGYNGFGQ
GYADWVVQCTWEGDNNVLCLSMGRGLVQSALQILAGKHVGASIQYVGDKSKISQNGQGTP
REQLLSPEFLVEAFRTASRNNILRTTDKYQELVKTLNPDQAFEELSQQRFQCARIHTRQH
LISSFYARIATAKDDIKPHLLKLANLFALWSIEEDTGIFLRENILTPGDIDLINSLVDEL
CVAVRDQVIGLTDAFGLSDFFINAPIGSYDGNVYEKYFAKVNQQNPATNPRPPYYESTLK
PFLFREEEDDEICDLDE*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites