Element type: CDS
Element length: 2034 nucleotides,
on anti-sense strand of
Ergo0F: complement(1697786..1699819).
Other names:
AFR683C
AGOS_AFR683C
Coding sequence: 678 codons.
Element length: 2034 nucleotides,
on anti-sense strand of
Ergo0F: complement(1697786..1699819).
Other names:
AFR683C
AGOS_AFR683C
Coding sequence: 678 codons.
Database cross references:
EMBL: AE016819
GeneID: 4622520
GenomeReviews: AE016819_GR
HOGENOM: HBG758042
NMPDR: fig|33169.1.peg.3876
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: AE016819
GeneID: 4622520
GenomeReviews: AE016819_GR
HOGENOM: HBG758042
NMPDR: fig|33169.1.peg.3876
Homologs and Orthologs
Homologs in protein family: GL3C0060Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
AGD: AFR683C
InterPro: IPR001140
InterPro: IPR003439
InterPro: IPR003593
InterPro: IPR011527
InterPro: IPR017871
InterPro: IPR017940
KEGG: ago:AGOS_AFR683C
PROSITE: PS00211
PROSITE: PS50893
PROSITE: PS50929
Pfam: PF00005
Pfam: PF00664
RefSeq: NP_986231.1
SMART: SM00382
UniProtKB/TrEMBL: Q751Z2
UniProtKB: Q751Z2_ASHGO
Phylogeny
PhylomeDB:Q751Z2
AGD: AFR683C
InterPro: IPR001140
InterPro: IPR003439
InterPro: IPR003593
InterPro: IPR011527
InterPro: IPR017871
InterPro: IPR017940
KEGG: ago:AGOS_AFR683C
PROSITE: PS00211
PROSITE: PS50893
PROSITE: PS50929
Pfam: PF00005
Pfam: PF00664
RefSeq: NP_986231.1
SMART: SM00382
UniProtKB/TrEMBL: Q751Z2
UniProtKB: Q751Z2_ASHGO
Phylogeny 
PhylomeDB:Q751Z2Sequence data 
>ERGO0F21802g.nt ATGACCTCGAGGAGAACATATATGTGGCTATCGCTAGCTGTTAAGCAGGTGCCGAGGCCT GTTATTGGGATGCAGCGATGGCCGCCGTTCAATCATCTACATCAACGCACCCGCTTTTCG TTCAGCGCATCGACAAGAATTCAAACGAGGCTAAACTCAACAGCAAACCCTCGGGCAAGT ACGGAAGAGAAACCCCACCAAGAAAGACTCGAGCTGTCGACGGCGACGGGCAACGCATCA GGTGCCAAGGATGTGCGTAGACTGTTCCAATTGGCGCGGCCTGAGCTCAAGTCTCTAGTA TGCGCCTTGGTGCTGATCGTGATATCCGGAGCCGTGAGCATGACCATCCCCAGTGTTATC GGGAAACTGCTAGATGTTGCACGAGAGGACGTGGAGCAGGAGAGCGGGGATGAGGAGGAG GCGGTACTGTATGGGCTCCCCGAGAAACAGTTCTACTTGGCATTGAGCGGGATCTTCGCC GTGGGCGCACTTGCGAACATGGGCCGCATTGTAGTACTTAAAACGACTGGTGAGCGCCTG GTTGCCCGGCTGCGGACTCGGACAATGAAGGCCGCGCTAGAGCAGGAAGGGGCCTTTCTT GACAGCAACCGTGTGGGCGATCTGATCTCGCGACTTTCTTCAGATGCCAGCATTGTATCG AAGTCTATCACACAAAATACATCGGACGGTGCACGCGCCGTTATCCAGGGAGCTGTGGGA TTTGGAATGATGAGCTACATATCCTGGCAGTTGACTGCGGTTATGACGCTGCTGGCACCT CCGCTAGTGCTGATGGCTGCCTTCTATGGGCGGCGCGTGCGAAATTTGTCTCGAGAACTA CAGACAAAGGTCGGCGGGCTGACTAAGGTGGCAGAGGAACAGCTAAATGCTACGCGGACG GTTCAGGCCTACTGTGGTGAGCGCCGTGAGATCCGTCGTTACGCTACTGAGGTTAGGAAT GTGTTTGATGTCGGGCTCAAGGAGGCGCTGATATCTGGCTCCTTCTTCGGAGCAACAGGC TTTGTCGGTAATGCGACACTGTTGGCGCTGCTACTTACCGGTACCTCTATGATCAAGGGC GGGGGGATCTCTGTAGGCGAACTTTCAAGCTTCATGATGTACGCTGTCTACACCGGCAGC TCGTTGTTCAATCTGTCGTCATTCTATTCGGAGCTTATGAAGGGTGCAGGCGCAGCTGTG CGTGTCTTCGAGTTGAACGACCGCAAACCGCTTATCCATCCAACCATTGGTAAGGATCCC GTATCACTGACGGGCAAGACAATTGCTTTCAATAGCGTGAACTTCGCATACCCTACCAGA CCGCACCATCAGGTTTTCGCAGGCATGGACCTCAGCATCTGCCCAGGGGAACATGTCTGT ATTGTTGGACCGTCGGGAGGCGGCAAATCTACGGTGGCATCGCTGCTTCTGCGTTTCTAC GATCCCATCAGCGGGTCCATAACCATTGGCGGTGAAGACATCCGCTTATTCAACCTGAGC AAATACCGCCGCATGATGGGTATTGTTCAGCAGGAACCCGTCCTTTTCAACGCGAGCATC CTCGAGAACATCACATACGCATTACCCTTACACCTGACGAAAGATCCCGCTCGCATAGAC CGTGCGCTGCGGCTTTCAAACTGCTCGGCTTTTGTTGGAAGCTTCCCTGAGGGCCTTCAG ACCGCTGTAGGCCCTCGAGGTACCCAGCTATCGGGCGGGCAAAAGCAGCGAGTCGCACTA GCCCGCGCCTTCCTTCAGGATCCTGCGATCCTGATCCTAGACGAAGCTACGAGCGCGCTC GACTCCAAGAGCGAGGATATCGTTGCGAGCACTCTACTTCAGCGCTGCCAAGAGGCCAAG ATTACTATTTCTATCGCCCATAGGAAGAGCACCATCCAGCATAGCACCAGAGTCATAGTG CTGGATAAGCTTGGCCATGTTTTGGAGACAGGCACCTACCAACAACTAATTGGTGACCCC GGCTCCAGCCTCAGCGGCCTCCTATCGAAGGAACATAGCTCCGACGCTGAATGA
>ERGO0F21802g.cds ATGACCTCGAGGAGAACATATATGTGGCTATCGCTAGCTGTTAAGCAGGTGCCGAGGCCT GTTATTGGGATGCAGCGATGGCCGCCGTTCAATCATCTACATCAACGCACCCGCTTTTCG TTCAGCGCATCGACAAGAATTCAAACGAGGCTAAACTCAACAGCAAACCCTCGGGCAAGT ACGGAAGAGAAACCCCACCAAGAAAGACTCGAGCTGTCGACGGCGACGGGCAACGCATCA GGTGCCAAGGATGTGCGTAGACTGTTCCAATTGGCGCGGCCTGAGCTCAAGTCTCTAGTA TGCGCCTTGGTGCTGATCGTGATATCCGGAGCCGTGAGCATGACCATCCCCAGTGTTATC GGGAAACTGCTAGATGTTGCACGAGAGGACGTGGAGCAGGAGAGCGGGGATGAGGAGGAG GCGGTACTGTATGGGCTCCCCGAGAAACAGTTCTACTTGGCATTGAGCGGGATCTTCGCC GTGGGCGCACTTGCGAACATGGGCCGCATTGTAGTACTTAAAACGACTGGTGAGCGCCTG GTTGCCCGGCTGCGGACTCGGACAATGAAGGCCGCGCTAGAGCAGGAAGGGGCCTTTCTT GACAGCAACCGTGTGGGCGATCTGATCTCGCGACTTTCTTCAGATGCCAGCATTGTATCG AAGTCTATCACACAAAATACATCGGACGGTGCACGCGCCGTTATCCAGGGAGCTGTGGGA TTTGGAATGATGAGCTACATATCCTGGCAGTTGACTGCGGTTATGACGCTGCTGGCACCT CCGCTAGTGCTGATGGCTGCCTTCTATGGGCGGCGCGTGCGAAATTTGTCTCGAGAACTA CAGACAAAGGTCGGCGGGCTGACTAAGGTGGCAGAGGAACAGCTAAATGCTACGCGGACG GTTCAGGCCTACTGTGGTGAGCGCCGTGAGATCCGTCGTTACGCTACTGAGGTTAGGAAT GTGTTTGATGTCGGGCTCAAGGAGGCGCTGATATCTGGCTCCTTCTTCGGAGCAACAGGC TTTGTCGGTAATGCGACACTGTTGGCGCTGCTACTTACCGGTACCTCTATGATCAAGGGC GGGGGGATCTCTGTAGGCGAACTTTCAAGCTTCATGATGTACGCTGTCTACACCGGCAGC TCGTTGTTCAATCTGTCGTCATTCTATTCGGAGCTTATGAAGGGTGCAGGCGCAGCTGTG CGTGTCTTCGAGTTGAACGACCGCAAACCGCTTATCCATCCAACCATTGGTAAGGATCCC GTATCACTGACGGGCAAGACAATTGCTTTCAATAGCGTGAACTTCGCATACCCTACCAGA CCGCACCATCAGGTTTTCGCAGGCATGGACCTCAGCATCTGCCCAGGGGAACATGTCTGT ATTGTTGGACCGTCGGGAGGCGGCAAATCTACGGTGGCATCGCTGCTTCTGCGTTTCTAC GATCCCATCAGCGGGTCCATAACCATTGGCGGTGAAGACATCCGCTTATTCAACCTGAGC AAATACCGCCGCATGATGGGTATTGTTCAGCAGGAACCCGTCCTTTTCAACGCGAGCATC CTCGAGAACATCACATACGCATTACCCTTACACCTGACGAAAGATCCCGCTCGCATAGAC CGTGCGCTGCGGCTTTCAAACTGCTCGGCTTTTGTTGGAAGCTTCCCTGAGGGCCTTCAG ACCGCTGTAGGCCCTCGAGGTACCCAGCTATCGGGCGGGCAAAAGCAGCGAGTCGCACTA GCCCGCGCCTTCCTTCAGGATCCTGCGATCCTGATCCTAGACGAAGCTACGAGCGCGCTC GACTCCAAGAGCGAGGATATCGTTGCGAGCACTCTACTTCAGCGCTGCCAAGAGGCCAAG ATTACTATTTCTATCGCCCATAGGAAGAGCACCATCCAGCATAGCACCAGAGTCATAGTG CTGGATAAGCTTGGCCATGTTTTGGAGACAGGCACCTACCAACAACTAATTGGTGACCCC GGCTCCAGCCTCAGCGGCCTCCTATCGAAGGAACATAGCTCCGACGCTGAATGA
>ERGO0F21802g.aa MTSRRTYMWLSLAVKQVPRPVIGMQRWPPFNHLHQRTRFSFSASTRIQTRLNSTANPRAS TEEKPHQERLELSTATGNASGAKDVRRLFQLARPELKSLVCALVLIVISGAVSMTIPSVI GKLLDVAREDVEQESGDEEEAVLYGLPEKQFYLALSGIFAVGALANMGRIVVLKTTGERL VARLRTRTMKAALEQEGAFLDSNRVGDLISRLSSDASIVSKSITQNTSDGARAVIQGAVG FGMMSYISWQLTAVMTLLAPPLVLMAAFYGRRVRNLSRELQTKVGGLTKVAEEQLNATRT VQAYCGERREIRRYATEVRNVFDVGLKEALISGSFFGATGFVGNATLLALLLTGTSMIKG GGISVGELSSFMMYAVYTGSSLFNLSSFYSELMKGAGAAVRVFELNDRKPLIHPTIGKDP VSLTGKTIAFNSVNFAYPTRPHHQVFAGMDLSICPGEHVCIVGPSGGGKSTVASLLLRFY DPISGSITIGGEDIRLFNLSKYRRMMGIVQQEPVLFNASILENITYALPLHLTKDPARID RALRLSNCSAFVGSFPEGLQTAVGPRGTQLSGGQKQRVALARAFLQDPAILILDEATSAL DSKSEDIVASTLLQRCQEAKITISIAHRKSTIQHSTRVIVLDKLGHVLETGTYQQLIGDP GSSLSGLLSKEHSSDAE*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://192.168.122.177/elt/ERGO/ERGO0F21802g