Protein

UniProt accession
E3SP17 [UniProt]
Protein name
WLM domain-containing protein
PhaLP type
VAL

evidence: ML prediction

probability: 71 % (predicted by ML model)

Protein sequence
MAIKKNSKINFYKFVQVKVPNVSAKTKMELKPIISNTVAINNLGATVNSIAVIVKDFKKIQLEKLVLSRKSLKDFEANYTKTKKQKPFSGFSPAALVKKKSWLEGLFKMLSGLIKAAIVIPALKWLSDPANRDKVARFIEVLSKLATFIFKVSKFGVVNTIEGLYTLMSDASSPWEKIGGLVQGLTGLGTLLLGLRWLSNPTRIITDFGNVLIFLHNNLIRGRRGLIGRAGALGLIAATAYGGYKLYEYLKEDGTGGRPDPNDDGSQAAANISIGDSGNVTVDKDGNLSGAVDFTLGETVNIQLNMDTDEDDTKSKGNWFTNLFKSTGGVLPSFAKGGWISGPQSGYGVSLDGGRSTSFIGHGTEYVARKSDGGAFIVPFDTPGTKTQPNLTDKRISEARSLGFDLDGFSNGGTLPKGMLLSQKAAFDHVYNLAKLVGGAKYPEIVAAQAMHESNYLDPRTNSVYNATNRTNAFGQTGDRGFGTIIRKGFKVGWSKYDNLSRAVGDNIKLWHDVANNKENYNAFGNILDGIAAVAPAYSPNADPENIKKGYTTDAYSKGMIRALKVGGFDISGMKDKTPKSSSNSGSGRRPTGNIFSNFMGGVKSFFGFGDKPESNNKKTKTQNKVNAVKPASHPDTGSGFTVGGTRDQSGRPLVFSQPAAQMFAAAMRDSGIDLASFIASTGRSKSKNTEIGGDPNSHHLYGEALDINGEGYQWLKANGRRFGWQYGYNHNPDSAHFKYVGAKAGTTPILSEPGKDYAGGNSLHGHIGEGAREGGRRDGGKGITDADLTAKKIGMGNLFGNQGGGGQNKSMFPGNDRSGQFQQTRQQKRLENQTKERNNARRQISERSQEMIKEVMAAVAQQNGVNSQAIQAANTALAAVAGQTGGGQPQMIPSGSGGVGSIASTLQSTLNPLRGLLR
Physico‐chemical
properties
protein length:919 AA
molecular weight:98284,00000 Da
isoelectric point:9,77149
aromaticity:0,08270
hydropathy:-0,43319

Domains

Domains [InterPro]
Protein sequence: E3SP17
1 919
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage P-SSM7
[NCBI]
445688 Kyanoviridae > Palaemonvirus > Palaemonvirus pssm7
Host Prochlorococcus marinus str. NATL1A
[NCBI]
167555 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ADO99091.1 [NCBI]
Genbank nucleotide accession
GU071103 [NCBI]
CDS location
range 9011 -> 11770
strand +
CDS
ATGGCGATAAAAAAGAATTCAAAAATTAATTTCTATAAATTTGTACAGGTAAAGGTACCTAATGTTAGTGCAAAAACGAAAATGGAGTTGAAACCCATTATTTCGAATACTGTCGCTATCAATAATTTGGGTGCAACTGTAAATTCTATTGCAGTAATCGTTAAAGATTTTAAAAAGATTCAACTTGAAAAGTTAGTACTTTCAAGGAAATCTCTAAAGGATTTTGAAGCAAACTATACAAAAACAAAAAAGCAGAAACCTTTCTCTGGATTTAGTCCTGCTGCTTTAGTAAAGAAAAAAAGTTGGTTAGAAGGTCTATTCAAGATGTTGAGTGGACTAATTAAAGCAGCGATTGTTATTCCTGCTTTAAAATGGTTGTCTGATCCTGCGAATAGAGACAAAGTAGCAAGGTTTATCGAAGTCCTTTCTAAGTTGGCAACGTTCATCTTTAAAGTCTCAAAATTTGGTGTTGTTAATACTATTGAGGGTTTATATACTCTCATGTCAGATGCCTCTTCGCCTTGGGAAAAAATAGGAGGACTTGTACAAGGATTAACTGGACTTGGAACTTTATTATTAGGTCTTCGTTGGTTGAGTAATCCAACTAGAATCATTACAGATTTTGGTAATGTACTCATCTTCCTTCATAATAATTTAATTAGAGGTAGAAGAGGATTAATAGGTAGAGCTGGAGCACTTGGATTAATTGCAGCAACTGCTTATGGAGGATATAAACTTTATGAATATCTAAAGGAAGATGGAACAGGTGGAAGACCAGATCCAAATGATGATGGATCTCAAGCAGCAGCAAACATATCCATAGGTGATAGCGGAAATGTAACCGTTGATAAGGATGGCAATCTCTCAGGTGCAGTAGACTTTACCTTAGGTGAGACTGTTAATATCCAACTAAATATGGATACTGATGAAGATGATACTAAATCTAAAGGTAATTGGTTTACTAACCTATTCAAATCTACTGGCGGTGTATTACCTTCATTTGCAAAAGGAGGATGGATTAGTGGACCACAATCAGGATATGGAGTTTCATTGGATGGAGGGAGATCCACTTCGTTCATCGGACATGGAACTGAGTACGTCGCTAGAAAGAGCGATGGGGGAGCTTTCATCGTTCCTTTTGATACTCCTGGAACAAAAACACAACCAAACCTAACAGATAAGAGGATAAGTGAGGCTAGGAGTCTGGGATTTGACTTAGATGGTTTTAGTAATGGAGGAACTCTACCAAAAGGTATGTTGCTTTCACAAAAAGCTGCATTCGATCATGTGTATAATCTAGCAAAACTAGTAGGAGGAGCAAAATACCCCGAAATTGTTGCTGCTCAGGCAATGCACGAATCAAACTACCTAGATCCTAGGACTAATAGTGTTTATAATGCCACAAATAGAACTAATGCTTTCGGTCAAACTGGTGACAGAGGATTTGGTACTATTATTAGAAAGGGTTTTAAAGTAGGTTGGTCAAAATATGATAATCTATCCCGTGCAGTAGGCGATAATATTAAACTTTGGCATGATGTGGCTAATAATAAGGAAAACTATAATGCTTTTGGTAATATTTTAGATGGTATTGCTGCAGTTGCTCCAGCATATTCACCTAATGCAGATCCTGAAAATATTAAAAAAGGATATACTACTGATGCTTATAGTAAAGGAATGATTAGAGCATTAAAAGTTGGTGGATTTGATATTAGTGGTATGAAAGATAAAACTCCTAAGTCCTCATCTAATAGTGGTTCGGGAAGGAGACCAACTGGTAACATCTTCTCAAACTTTATGGGTGGTGTTAAGAGTTTCTTTGGATTTGGTGATAAACCAGAGAGTAATAATAAGAAGACAAAAACACAAAATAAAGTAAATGCTGTCAAACCTGCATCACATCCTGATACAGGTTCTGGATTTACTGTTGGAGGAACGAGGGATCAAAGTGGTAGACCTTTAGTATTCTCACAACCAGCAGCACAAATGTTTGCTGCAGCAATGAGAGATTCTGGAATCGATCTAGCATCATTTATTGCAAGTACTGGTAGAAGTAAATCTAAAAATACTGAAATTGGTGGAGATCCTAATTCACATCATCTATATGGTGAAGCACTTGATATTAATGGTGAAGGATATCAATGGTTGAAAGCAAATGGTAGACGTTTTGGTTGGCAATATGGTTACAACCATAATCCTGACAGTGCTCACTTTAAGTATGTTGGTGCTAAAGCAGGTACTACACCAATATTATCAGAACCTGGTAAGGATTATGCTGGTGGTAATAGTCTTCATGGGCATATAGGTGAAGGTGCTCGTGAGGGTGGTCGTAGAGATGGTGGTAAAGGGATAACAGATGCGGATTTAACAGCAAAGAAAATTGGAATGGGAAATCTATTTGGAAACCAAGGGGGTGGTGGACAGAATAAAAGTATGTTCCCAGGCAATGACAGGTCAGGTCAATTCCAACAAACAAGGCAGCAGAAAAGATTGGAGAACCAAACAAAAGAAAGAAATAATGCACGTCGTCAGATAAGTGAGAGAAGTCAAGAAATGATTAAGGAAGTTATGGCAGCAGTTGCTCAACAAAATGGGGTAAATAGTCAAGCGATCCAAGCAGCAAATACAGCACTAGCAGCAGTGGCAGGACAGACTGGTGGTGGTCAACCACAAATGATTCCAAGTGGTTCAGGTGGAGTAGGATCTATTGCATCTACTTTACAATCTACCCTTAATCCTTTGAGAGGTTTATTAAGATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.