CAGL0J02662g
similar to uniprot|P43610 Saccharomyces cerevisiae YFR038w IRC5 Putative ATPase containing the DEAD/H helicase-related sequence motif
Element type: CDS
Element length: 2535 nucleotides,
on sense strand of
Cagl0J: 264512..267046.
Other names:
CAGL-CDS0700.1
CAGL-IPF6535
Coding sequence: 845 codons.
Element length: 2535 nucleotides,
on sense strand of
Cagl0J: 264512..267046.
Other names:
CAGL-CDS0700.1
CAGL-IPF6535
Coding sequence: 845 codons.
Database cross references:
EMBL: CR380956
GeneID: 2889751
HOGENOM: Q6FPM4
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380956
GeneID: 2889751
HOGENOM: Q6FPM4
Homologs and Orthologs
Homologs in protein family: GL3M4588Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: cgr:CAGL0J02662g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_447820.1
SMART: SM00487
SMART: SM00490
UniprotKB: Q6FPM4_CANGA
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: cgr:CAGL0J02662g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_447820.1
SMART: SM00487
SMART: SM00490
UniprotKB: Q6FPM4_CANGA
Sequence data 
>CAGL0J02662g.nt ATGAATACAAGGAAGAGGTTTTTGAGGAGTGCCAGGAAGGCCGCTGCGGTTACTGTCAAC TATGCGGAACCTGACGATAATGATCTGAGTGCTAATACTTTTGCGAGTGATGATGAGAAT GCTGACGTCAGTGTAGACGCTGAGGATACGCCGCCTGTGTGGCTGCGAGATGATGTACGT CCGAATGAGGACGTTCAGCTCGACTCAGACGACGAAAACGGTGGTGATAAAGACGATGAT AGGGATCTTGACGATCTTTTACTGAAGGAAAAGGAAAAACACAGGCTGGAAGCACAGACT AAAGAGGAAGATAGTGATGACGAAGTAAATGAGTTGGATAAAGCCAGTGTCACGTCCAAA TTAAAGAAACTAGATGAGTTCGTAAAGCAAAGTCAGGTTTACTCAAGTATCATCGCTGAT ACACTGTTGAAGAGGACTTTAGAGAGAACAGAGGAATCTGATGCGTCAAATAATGACCCT GTGAAAGAACCTCCTGCGAAAAAGGCTAAAAAATCAAAATCTATACTAGATTTTTTTACT AGAAGACAAAATACAGATGACAATAGGGTTGAGGAAATGGCTGTTGATGTTAAGAAGGAA ACGGAGGAAGAGCGTATAGCCAAAGAGCAACCATCTTACTTGAAGAATTGTGTCCTAAAG CCTTACCAAATGGAAGGGCTGAATTGGTTAATTACACTTTATGAAAATGGGCTAAACGGT ATCCTAGCCGATGAGATGGGTTTGGGTAAGACTATTCAGAGTATCGCTTTATTGTCTTTC ATTTATGAAATGGACACAAAAGGCCCCTTCTTAATTGCTGCACCATTAAGCACCGTCGAT AATTGGATGAACGAATTCGCAAAATTCGCTCCAGAAATTCCGATTCTTAAATATTATAGT CAAAATGGACAAGATGCAAGACAGAAATTGTTGAAAAAATTCTTCAAAAATAATAATAGG GAAGGTGTTATTGTTACCTCATATGAGATGATAATTAGGGATGCAAATATAATAATGGGT GAACAATGGAAGTTTCTAATTGTAGATGAAGGTCATCGTTTGAAGAATATTAACTGTCGT TTGATACAGGAATTGAAGCGAATCAATACTTCTAACAGACTTTTACTAACAGGTACGCCC TTACAGAATAACTTGTCAGAGTTGTGGTCTCTATTGAACTTTATTCTGCCTGACATTTTT GCTGATTTTGAAATATTCAATAAATGGTTTGATTTCAAAGATCTTGATTTGCAAAGTAAT TCTGCTAAATTAAATAAGTTGATTAACGATGAACTAGAAAAAAATTTGATATCCAACCTG CATACAATCTTAAAACCATTTTTGTTAAGAAGATTGAAAAGTGTGGTCTTAAAGGACGTT CTTCCGCCAAAGAGAGAGTACATAGTCAACTGCCCTTTATCACCAATCCAAACAAAGTTT TACAGAATGGCTTTATCTGGAAAGCTTAAGGTGACTGTTTTCAAAGAACTCGTCAAAGCA TTTTTCACTCTGAATCAAGAGTACATAGGGACTGTTTCAAATAAATCCATTCGAGACTTC ATTGATTATAAATTAAGTGAAGAGCCAGATGAAGACAAAGTAACCGCCGTTATCAAGCAA ATGGATGACATATATATGGAACACCTAAACACGTTTACCAAGAATCAGAGACTACAAAAT ATGATGATGCAGCTACGTCAAGTTGTAGACTCTACTCTACTATTTTTCTTCCCATACATG GAACCTGAGGACATCACATTGGATTATCTGCTTGCATCATCTGGAAAACTACAAATGTTG CAGAAATTGGCAATACCTCTAATAAAAAAAGGTCACAAAATATTGATATTCTCTCAGTTT GTTGGTATGTTAGATTTATTGGAGGATTGGTCTGAACTAAACTCCTTCAATTCATTGAGA ATTGATGGTGGTGTTGATAACGAATCAAGGAAAGAATATATCGATGAATTTAACAAAAAA GGTGACGACCATCAAATTTTCTTGCTTTCGACAAGAGCAGCTGGTCTTGGTATTAACCTT GTAGCTGCAGATACTGTGATTATTTTTGATAGTGATTGGAACCCTCAGGTTGATCTCCAA GCAATGGATAGATGTCACAGAATTGGCCAGACAAAACCAGTAATAGTATACAGATTTTGC TGTGACAATACTATTGAGCACGTCATACTAACCAGAGCTGTTAATAAGAGAAAGTTAGAA CGAATGGTTATTCAAATGGGTAAGTTCAGTAATTTGAAGAAGTTAGCCTTGAATGAGAGA TCTTTCCTCCAACAAAGCACTGGTATGAATCCAAACAAGACCAGCAATAAAGAACTTGTA CAGGAACTATCACAGCTTCTAATGAGTAAAGAATCAAGTATTGGATTCGAGACGTCTAAA AAACCCAAACAAGATGATATACTCACTGAAGCTGAATTAAAGGAGTTATCAGATAGATCG CTCAAATTTTACTCACCAGATAGAGAAGTCGAGTTCCCTCATGTAAGGCTATTTGAGACA ACATCTGGATTTTAA
>CAGL0J02662g.cds ATGAATACAAGGAAGAGGTTTTTGAGGAGTGCCAGGAAGGCCGCTGCGGTTACTGTCAAC TATGCGGAACCTGACGATAATGATCTGAGTGCTAATACTTTTGCGAGTGATGATGAGAAT GCTGACGTCAGTGTAGACGCTGAGGATACGCCGCCTGTGTGGCTGCGAGATGATGTACGT CCGAATGAGGACGTTCAGCTCGACTCAGACGACGAAAACGGTGGTGATAAAGACGATGAT AGGGATCTTGACGATCTTTTACTGAAGGAAAAGGAAAAACACAGGCTGGAAGCACAGACT AAAGAGGAAGATAGTGATGACGAAGTAAATGAGTTGGATAAAGCCAGTGTCACGTCCAAA TTAAAGAAACTAGATGAGTTCGTAAAGCAAAGTCAGGTTTACTCAAGTATCATCGCTGAT ACACTGTTGAAGAGGACTTTAGAGAGAACAGAGGAATCTGATGCGTCAAATAATGACCCT GTGAAAGAACCTCCTGCGAAAAAGGCTAAAAAATCAAAATCTATACTAGATTTTTTTACT AGAAGACAAAATACAGATGACAATAGGGTTGAGGAAATGGCTGTTGATGTTAAGAAGGAA ACGGAGGAAGAGCGTATAGCCAAAGAGCAACCATCTTACTTGAAGAATTGTGTCCTAAAG CCTTACCAAATGGAAGGGCTGAATTGGTTAATTACACTTTATGAAAATGGGCTAAACGGT ATCCTAGCCGATGAGATGGGTTTGGGTAAGACTATTCAGAGTATCGCTTTATTGTCTTTC ATTTATGAAATGGACACAAAAGGCCCCTTCTTAATTGCTGCACCATTAAGCACCGTCGAT AATTGGATGAACGAATTCGCAAAATTCGCTCCAGAAATTCCGATTCTTAAATATTATAGT CAAAATGGACAAGATGCAAGACAGAAATTGTTGAAAAAATTCTTCAAAAATAATAATAGG GAAGGTGTTATTGTTACCTCATATGAGATGATAATTAGGGATGCAAATATAATAATGGGT GAACAATGGAAGTTTCTAATTGTAGATGAAGGTCATCGTTTGAAGAATATTAACTGTCGT TTGATACAGGAATTGAAGCGAATCAATACTTCTAACAGACTTTTACTAACAGGTACGCCC TTACAGAATAACTTGTCAGAGTTGTGGTCTCTATTGAACTTTATTCTGCCTGACATTTTT GCTGATTTTGAAATATTCAATAAATGGTTTGATTTCAAAGATCTTGATTTGCAAAGTAAT TCTGCTAAATTAAATAAGTTGATTAACGATGAACTAGAAAAAAATTTGATATCCAACCTG CATACAATCTTAAAACCATTTTTGTTAAGAAGATTGAAAAGTGTGGTCTTAAAGGACGTT CTTCCGCCAAAGAGAGAGTACATAGTCAACTGCCCTTTATCACCAATCCAAACAAAGTTT TACAGAATGGCTTTATCTGGAAAGCTTAAGGTGACTGTTTTCAAAGAACTCGTCAAAGCA TTTTTCACTCTGAATCAAGAGTACATAGGGACTGTTTCAAATAAATCCATTCGAGACTTC ATTGATTATAAATTAAGTGAAGAGCCAGATGAAGACAAAGTAACCGCCGTTATCAAGCAA ATGGATGACATATATATGGAACACCTAAACACGTTTACCAAGAATCAGAGACTACAAAAT ATGATGATGCAGCTACGTCAAGTTGTAGACTCTACTCTACTATTTTTCTTCCCATACATG GAACCTGAGGACATCACATTGGATTATCTGCTTGCATCATCTGGAAAACTACAAATGTTG CAGAAATTGGCAATACCTCTAATAAAAAAAGGTCACAAAATATTGATATTCTCTCAGTTT GTTGGTATGTTAGATTTATTGGAGGATTGGTCTGAACTAAACTCCTTCAATTCATTGAGA ATTGATGGTGGTGTTGATAACGAATCAAGGAAAGAATATATCGATGAATTTAACAAAAAA GGTGACGACCATCAAATTTTCTTGCTTTCGACAAGAGCAGCTGGTCTTGGTATTAACCTT GTAGCTGCAGATACTGTGATTATTTTTGATAGTGATTGGAACCCTCAGGTTGATCTCCAA GCAATGGATAGATGTCACAGAATTGGCCAGACAAAACCAGTAATAGTATACAGATTTTGC TGTGACAATACTATTGAGCACGTCATACTAACCAGAGCTGTTAATAAGAGAAAGTTAGAA CGAATGGTTATTCAAATGGGTAAGTTCAGTAATTTGAAGAAGTTAGCCTTGAATGAGAGA TCTTTCCTCCAACAAAGCACTGGTATGAATCCAAACAAGACCAGCAATAAAGAACTTGTA CAGGAACTATCACAGCTTCTAATGAGTAAAGAATCAAGTATTGGATTCGAGACGTCTAAA AAACCCAAACAAGATGATATACTCACTGAAGCTGAATTAAAGGAGTTATCAGATAGATCG CTCAAATTTTACTCACCAGATAGAGAAGTCGAGTTCCCTCATGTAAGGCTATTTGAGACA ACATCTGGATTTTAA
>CAGL0J02662g.aa MNTRKRFLRSARKAAAVTVNYAEPDDNDLSANTFASDDENADVSVDAEDTPPVWLRDDVR PNEDVQLDSDDENGGDKDDDRDLDDLLLKEKEKHRLEAQTKEEDSDDEVNELDKASVTSK LKKLDEFVKQSQVYSSIIADTLLKRTLERTEESDASNNDPVKEPPAKKAKKSKSILDFFT RRQNTDDNRVEEMAVDVKKETEEERIAKEQPSYLKNCVLKPYQMEGLNWLITLYENGLNG ILADEMGLGKTIQSIALLSFIYEMDTKGPFLIAAPLSTVDNWMNEFAKFAPEIPILKYYS QNGQDARQKLLKKFFKNNNREGVIVTSYEMIIRDANIIMGEQWKFLIVDEGHRLKNINCR LIQELKRINTSNRLLLTGTPLQNNLSELWSLLNFILPDIFADFEIFNKWFDFKDLDLQSN SAKLNKLINDELEKNLISNLHTILKPFLLRRLKSVVLKDVLPPKREYIVNCPLSPIQTKF YRMALSGKLKVTVFKELVKAFFTLNQEYIGTVSNKSIRDFIDYKLSEEPDEDKVTAVIKQ MDDIYMEHLNTFTKNQRLQNMMMQLRQVVDSTLLFFFPYMEPEDITLDYLLASSGKLQML QKLAIPLIKKGHKILIFSQFVGMLDLLEDWSELNSFNSLRIDGGVDNESRKEYIDEFNKK GDDHQIFLLSTRAAGLGINLVAADTVIIFDSDWNPQVDLQAMDRCHRIGQTKPVIVYRFC CDNTIEHVILTRAVNKRKLERMVIQMGKFSNLKKLALNERSFLQQSTGMNPNKTSNKELV QELSQLLMSKESSIGFETSKKPKQDDILTEAELKELSDRSLKFYSPDREVEFPHVRLFET TSGF*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0J02662g