CAGL0H04125g


highly similar to uniprot|P54783 Saccharomyces cerevisiae YML086c ALO1 D-Arabinono-1,4-lactone oxidase, catalyzes the final step in biosynthesis of D-erythroascorbic acid, which is protective against oxidative stress

Genomic environment map

Element type: CDS
Element length: 1578 nucleotides,
on anti-sense strand of
Cagl0H: complement(390841..392418).
Other names:
CAGL-CDS1932.1
CAGL-IPF1871
Coding sequence: 526 codons.
Database cross references:
EMBL: CR380954
GeneID: 2888523
HOGENOM: Q6FS20

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1598 GL3R1598.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0H04125p  


highly similar to uniprot|P54783 Saccharomyces cerevisiae YML086c ALO D-arabinono-1 4-lactone oxidase

Protein domain map

Protein length: 525 amino acids
Protein family: GL3R1598
Database cross references:
InterPro: IPR006093
InterPro: IPR006094
InterPro: IPR007173
InterPro: IPR010031
KEGG: cgr:CAGL0H04125g
PROSITE: PS00862
PROSITE: PS51387
Pfam: PF01565
Pfam: PF04030
RefSeq: XP_446974.1
TIGRFAMs: TIGR01678
UniProtKB/Swiss-Prot: Q6FS20
UniprotKB: ALO_CANGA

Computed results for CAGL0H04125p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0031966 mitochondrial membrane

Sequence data  

Nucleotide sequence

>CAGL0H04125g.nt
AGTCAGATGCAGCCGTTTACGTAATTGAGGCACCTGTCTTACGTGGTCGGCATCTAACCC
ACACGCTATAGTCTCCAAGCTAGGGATCGCTTCCAGTAGCTTGTAGACGTGCATGGCACT
GACACTGGTCGGCACAACGAGCCATTTCAATTGAGTCCATTTGCCCTTCAAGTTCTTCAG
CCGCGTGATCCAGTATGAAAGTATAAGGTCACCTACGGGGCACATCACGGATAACCCATG
AAGACACTGTAGATCAACTAATTGACGCATATCATCGACGTCTCTACATGCATGCACTTG
TAGATCAAGTGGTAGCTTCGTTGCGGATCCGCCAAGTAGATGACTCAATTCAACTACGCT
GACATCGCGGATCGAGAGCTTTTGTGAGAAGTGCTGTGGCATATCTCGAGCCTCCAATTG
TAAGAACAATAAATACAGCTCCAAGGTATCATTGCCCATGTTCACCACATAGCGATACAC
TTCCCACAGCAATTGGTGCGGTACTTGGGCTAGAACTTCCTTGTGACCGAAACGTGACCG
CGAAACCCATCCAGCTATCGTGTAGCTGCAACGACCAGTCAGCGTTTCATTCATATATAG
TCCTTACAAAAATAGCCTCCCTTCGAGACATTCTAAATTACAGGAAAATATAAGATTTAA
TAATACTCTGCCACTTAGCCAATTGATTATATAATTTCGTGGATTTCTTTAATATATGGA
CTATGTTCAACCTCATCGCATCACGAAGAACTGAAAAATTTCTGAAAATTTGGCAGCAAA
GGGAGAAGGGTTAACCCGAAATTGAGTCAAATTGTCATATAAAGCAAGGATACGGGGGGG
CATATTAACTCCTTGTTTTTATTGTTCAAACTTATATAAATCATATTTGTTTGCTTTTGT
TATTATCCCAAAGATAATCTCTTTGCAAGATCTAGCAATTGGAGTACTGGTATTTTTTTT
AAGTAAAGAGTTTCTCTAAACCAAAAAAAAGGACAATAACATGGATTTGAAAACATTTGG
TGGTCGGCGGAACTTTGTGTTTCGCAATTGGGCTGGTATCTATTCCTCAAGACCAGAATG
GTACTTTCAGCCTTCCTCAGTCGACGAAGTTGTCGAAATTGTCAAAGCTGCTAAACTAAA
GAACAAGACTATTGTTACAGTTGGCTCAGGCCACTCCCCTAGTAACATGTGTGTTACCGA
CGAATGGATGATGAACTTGGACAAGATGAACAAGTTACTAGATTTTGTGGAAAACGAGGA
TAAGACATACGCAGACGTTACTATTCAAGGTGGTACTAGGTTGTATAAGATCCACAAAAT
ATTAAGGGAAAAGGGATACGCCATGCAAAGTTTGGGGTCCATTTCTGAACAAAGTATTGG
TGGTATTATCTCTACTGGTACTCATGGTTCGTCTCCATTCCATGGTCTGGTATCTTCTAC
TATTGTCAACTTGACTGTTGTCAATGGTAAAGGTGAAGTATTATTTTTAGACGAAAAGTC
TAACCCTGAAGTTTTCAGGGCTGCTACTTTGTCGCTTGGTAAGATTGGTATTATTGTGGG
TGCAACTGTTCGTGTTGTTCCAGCTTTCAACATTAAATCAACCCAAGAAGTTATCAAATT
TGAAACTCTTTTGGAAAAATGGGACTCTCTCTGGACTTCTTCAGAGTTTATCAGAATTTG
GTGGTACCCATATACACGTAAGTGTATCCTTTGGAGAGGTGTAAAGACTAATGAACCACA
GACTAAGTCAAGATATTCGTGGTGGGGTTCTACTCTAGGTAGATTTTTCTACCAGACTTT
GTTGTTCATATCTACCAAGATTTACCCACCTTTGACTCCATATGTTGAAAGATTTGTCTT
CAGAAGACAATATGGTGAAGTTGAAACCCTAGGTAAAGGTGATGTGGCAATTGAGGATTC
TGTTACTGGGTTCAACATGGACTGTTTGTTTTCTCAGTTCGTTGATGAATGGGGTTGTCC
AATGGACAATGGTTTAGAAGTCTTGCGTTCTCTTGACCACTCAATTGCTCAAGCCGCTGC
TAACAAGGATTTCTATGTACATGTCCCAGTTGAGGTTCGTTGTGCAAACACAACTTTGCC
AAAGGAACAACCTGAAACTTCTTTCCGTTCTAACACTAGTAGAGGTCCAGTTTACGGTAA
CCTATTACGTCCTTACTTGGATAACACCCCATCTCAATGCTCTTATGCTCCTATCCACAG
TGTTACTAATAGTCAGTTGACGCTATACATTAACGCGACAATTTACAGACCATTCCACAC
CAATGCCCCAATTCACAAGTGGTTCACCTTGTTTGAAGATACAATGTCTGCTGCTGGCGG
TAAACCACATTGGGCTAAGAACTTCTTGGGCTCTACCTCTTTTGCTCAAGGTCAAGTTAA
GGCTGAAGGTCAATACCAAGACTATGAGATGAGAGGTATGGCTACAAGAGTCAAGGAATG
GTATGGTTCAGACTTGGAAACCTTTAAGAAGGTTAGAAGAGAACAAGACCCAGACAACAT
TTTCTTGGCAAACAAACAGTGGGCCTTGATCAACGGTATCATTGATGAAAACGAATAAAT
TTTTGAATACATTCCTTATAGCTAAGATATATTAATGTTATGGTTTCGTTTAGAGTAAAT
GTTCGATATATGTATTTCGAATTTTTCTCTAGATCTAGATGACTTGGTATATACTTGTCT
CTATTCCTGTCCTATAGTTACAAATTTACTGTCTTTATTTTTCAATTATGTCTCAATATA
CTAATAACTCTCAGTTTATTATTCGTGATATGATTTTATGTTTTTGGTCATTATTCTGAA
CTTACGAAATAGTGAAAACCATAGCATAATAATAAACAAAAAAATACAAAAAATCAAA

Coding sequence

>CAGL0H04125g.cds
ATGGATTTGAAAACATTTGGTGGTCGGCGGAACTTTGTGTTTCGCAATTGGGCTGGTATC
TATTCCTCAAGACCAGAATGGTACTTTCAGCCTTCCTCAGTCGACGAAGTTGTCGAAATT
GTCAAAGCTGCTAAACTAAAGAACAAGACTATTGTTACAGTTGGCTCAGGCCACTCCCCT
AGTAACATGTGTGTTACCGACGAATGGATGATGAACTTGGACAAGATGAACAAGTTACTA
GATTTTGTGGAAAACGAGGATAAGACATACGCAGACGTTACTATTCAAGGTGGTACTAGG
TTGTATAAGATCCACAAAATATTAAGGGAAAAGGGATACGCCATGCAAAGTTTGGGGTCC
ATTTCTGAACAAAGTATTGGTGGTATTATCTCTACTGGTACTCATGGTTCGTCTCCATTC
CATGGTCTGGTATCTTCTACTATTGTCAACTTGACTGTTGTCAATGGTAAAGGTGAAGTA
TTATTTTTAGACGAAAAGTCTAACCCTGAAGTTTTCAGGGCTGCTACTTTGTCGCTTGGT
AAGATTGGTATTATTGTGGGTGCAACTGTTCGTGTTGTTCCAGCTTTCAACATTAAATCA
ACCCAAGAAGTTATCAAATTTGAAACTCTTTTGGAAAAATGGGACTCTCTCTGGACTTCT
TCAGAGTTTATCAGAATTTGGTGGTACCCATATACACGTAAGTGTATCCTTTGGAGAGGT
GTAAAGACTAATGAACCACAGACTAAGTCAAGATATTCGTGGTGGGGTTCTACTCTAGGT
AGATTTTTCTACCAGACTTTGTTGTTCATATCTACCAAGATTTACCCACCTTTGACTCCA
TATGTTGAAAGATTTGTCTTCAGAAGACAATATGGTGAAGTTGAAACCCTAGGTAAAGGT
GATGTGGCAATTGAGGATTCTGTTACTGGGTTCAACATGGACTGTTTGTTTTCTCAGTTC
GTTGATGAATGGGGTTGTCCAATGGACAATGGTTTAGAAGTCTTGCGTTCTCTTGACCAC
TCAATTGCTCAAGCCGCTGCTAACAAGGATTTCTATGTACATGTCCCAGTTGAGGTTCGT
TGTGCAAACACAACTTTGCCAAAGGAACAACCTGAAACTTCTTTCCGTTCTAACACTAGT
AGAGGTCCAGTTTACGGTAACCTATTACGTCCTTACTTGGATAACACCCCATCTCAATGC
TCTTATGCTCCTATCCACAGTGTTACTAATAGTCAGTTGACGCTATACATTAACGCGACA
ATTTACAGACCATTCCACACCAATGCCCCAATTCACAAGTGGTTCACCTTGTTTGAAGAT
ACAATGTCTGCTGCTGGCGGTAAACCACATTGGGCTAAGAACTTCTTGGGCTCTACCTCT
TTTGCTCAAGGTCAAGTTAAGGCTGAAGGTCAATACCAAGACTATGAGATGAGAGGTATG
GCTACAAGAGTCAAGGAATGGTATGGTTCAGACTTGGAAACCTTTAAGAAGGTTAGAAGA
GAACAAGACCCAGACAACATTTTCTTGGCAAACAAACAGTGGGCCTTGATCAACGGTATC
ATTGATGAAAACGAATAA

Predicted translation product

>CAGL0H04125g.aa
MDLKTFGGRRNFVFRNWAGIYSSRPEWYFQPSSVDEVVEIVKAAKLKNKTIVTVGSGHSP
SNMCVTDEWMMNLDKMNKLLDFVENEDKTYADVTIQGGTRLYKIHKILREKGYAMQSLGS
ISEQSIGGIISTGTHGSSPFHGLVSSTIVNLTVVNGKGEVLFLDEKSNPEVFRAATLSLG
KIGIIVGATVRVVPAFNIKSTQEVIKFETLLEKWDSLWTSSEFIRIWWYPYTRKCILWRG
VKTNEPQTKSRYSWWGSTLGRFFYQTLLFISTKIYPPLTPYVERFVFRRQYGEVETLGKG
DVAIEDSVTGFNMDCLFSQFVDEWGCPMDNGLEVLRSLDHSIAQAAANKDFYVHVPVEVR
CANTTLPKEQPETSFRSNTSRGPVYGNLLRPYLDNTPSQCSYAPIHSVTNSQLTLYINAT
IYRPFHTNAPIHKWFTLFEDTMSAAGGKPHWAKNFLGSTSFAQGQVKAEGQYQDYEMRGM
ATRVKEWYGSDLETFKKVRREQDPDNIFLANKQWALINGIIDENE*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites