Protein

Protein accession
4FSV7 [EnVhog]
Representative
4w90f
Source
EnVhog (cluster: phalp2_39335)
Protein name
4FSV7
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNAKKFYDQIRAIFFPNGLTQAQVDNIEHVLRALREFGVSDSRWIAYVLATVFHETSGTMLPVHEKGNREYFLRYDIKGNPKKAAELGNNQAGDGYKYRGRGFVQLTGKRNYQLFGDRYNIDLVTNPDLVIEPVISANVLVDGMVSGLFTGKSLDDCFPLHGDADWINARRIVNGTDKAQLIANYAQQFHNALVIANESDKPAPIPIQNPELSIDYTPINQWVTDQERPAPLPPSSWSPPVQHRTRPVPNPAPLPAPAPLVQTRDPWWLRALTSKIFWVNLITLVTGGGVFTLVDLTPEQRATVAELMVYISGGVTILLRIFFSNLPPVKK
Physico‐chemical
properties
protein length:331 AA
molecular weight:37042,9 Da
isoelectric point:8,52
hydropathy:-0,21
Representative Protein Details
Accession
4w90f
Protein name
4w90f
Sequence length
295 AA
Molecular weight
32866,72930 Da
Isoelectric point
5,39967
Sequence
MNRKIFYDAMRQTLFGQITQPQVDGMENILNEWEARELDDLRWLAYILATTYHETGHTMQPIEEWGKGRKHSYGQPDAENGQCYYGRGYVQLTHKRNYSTFADRMGVDLVQNPELALDPSNAVKILIDGMVNGLYTGVGLPRYFGKSTNWEEARRIVNGEDHKHDIAEYAKAFYAALQTASSFQPDEQVRESIVTTTDGEEIIMLEHEQPQPAPPQSIAEPMPVSNGSGWLTHTGMAVSGLIGLAGMLGYVPGMTPEYGAGMIQTALGISGTRKAAPSLIKYALQIFFSLRSKNP
Other Proteins in cluster: phalp2_39335
Total (incl. this protein): 4 Avg length: 326,5 Avg pI: 7,36

Protein ID Length (AA) pI
4w90f 295 5,39967
4GoCm 309 6,63421
4muG6 371 8,89536
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6562
14Ua3
65 48,4% 223 5.694E-73
2 phalp2_17517
5yaNZ
733 51,2% 203 3.310E-68
3 phalp2_23913
1MvJ3
1257 51,7% 197 4.450E-65
4 phalp2_14563
4WxUn
402 42,7% 215 9.321E-55
5 phalp2_17986
1eoEg
230 42,0% 221 3.250E-54
6 phalp2_34010
83A2R
53 41,0% 229 2.888E-53
7 phalp2_15378
1Klbp
1 36,3% 253 1.006E-52
8 phalp2_31757
46nle
40 34,0% 297 2.019E-50
9 phalp2_7409
4FTG1
6 38,6% 199 8.494E-49
10 phalp2_6317
71FE5
3 32,7% 223 7.835E-41

Domains

Domains
GH19
Unannotated
Representative sequence (used for alignment): 4w90f (295 AA)
Member sequence: 4FSV7 (331 AA)
1 295 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4FSV7
Method AlphaFoldv2
Resolution 74.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4w90f) rather than this protein.
PDB ID
4w90f
Method AlphaFoldv2
Resolution 79.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50