Protein

Protein accession
2wOIs [EnVhog]
Representative
3e99j
Source
EnVhog (cluster: phalp2_22950)
Protein name
2wOIs
Lysin probability
93%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MYLQTDKRWSGLDMDSVPFKIGRWGCTVTSVTNGLLALKMGLLNPKQVSDKLSFTNQGLLLWGSTKNVGLDMVSYSRYFNKDRAQKALKSPIEFCIVELDHWHWALVWSLKAGVIIYDPLFGIRPMWPKYKNITKTVILRKI
Physico‐chemical
properties
protein length:142 AA
molecular weight:16352,1 Da
isoelectric point:9,79
hydropathy:-0,07
Representative Protein Details
Accession
3e99j
Protein name
3e99j
Sequence length
154 AA
Molecular weight
17993,64700 Da
Isoelectric point
9,75995
Sequence
MKRWSQRDSRWNNTYFNGAIKSPKTTVGTIGCAVTSVCMVHSKFYPRNPITPLEAAKTWKFTRNGLLIWGETDFPGMKFVVRKTDYNKDMIKEYAKDKNKGVIVEVNRNHWCAVWGWSIFGPVLFDPWDGKIYWKVPKKYRISGFALFSNLEEI
Other Proteins in cluster: phalp2_22950
Total (incl. this protein): 15 Avg length: 179,7 Avg pI: 8,54

Protein ID Length (AA) pI
3e99j 154 9,75995
175yu 200 5,01982
1Lg3P 162 9,12152
1dsId 200 5,03903
2L0nO 188 8,76488
2lhfG 184 9,42014
3eip2 199 8,55233
3lwit 164 9,65564
4fPWm 198 8,73561
4gVU1 158 9,55733
4iBEE 242 6,34513
4iM7s 153 9,89656
8efuR 193 8,80214
8taVp 159 9,59169
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13076
8dNG7
11 33,5% 158 1.143E-27
2 phalp2_33004
4hV5l
1 27,3% 146 5.513E-27
3 phalp2_28438
1KsOK
2 22,7% 158 2.978E-21
4 phalp2_14350
4eunE
39 27,7% 162 4.076E-21
5 phalp2_39926
1nFGg
52 37,2% 110 5.015E-20
6 phalp2_26281
XEqg
8 32,6% 104 4.038E-18
7 phalp2_26426
1Ly8S
4 28,0% 146 1.264E-16
8 phalp2_10611
2e250
1 27,8% 122 1.876E-14
9 phalp2_28654
3nomK
21 29,0% 162 1.474E-12
10 phalp2_21823
4iHEp
1 22,9% 161 3.749E-12

Domains

Domains
Unannotated
Representative sequence (used for alignment): 3e99j (154 AA)
Member sequence: 2wOIs (142 AA)
1 154 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2wOIs
Method AlphaFoldv2
Resolution 96.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3e99j) rather than this protein.
PDB ID
3e99j
Method AlphaFoldv2
Resolution 97.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50