Protein

Protein accession
2Cfw7 [EnVhog]
Representative
4Fmu3
Source
EnVhog (cluster: phalp2_2298)
Protein name
2Cfw7
Lysin probability
96%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIGIERYSADQWASLANVVADIGHAFKRRIAALAADDGDDEPAIMRICGHRDLSPDADGDGTVEPQEWTKTCPGFDVAAWLGAGMKPDPAHVWSEA
Physico‐chemical
properties
protein length:96 AA
molecular weight:10306,3 Da
isoelectric point:4,44
hydropathy:-0,30
Representative Protein Details
Accession
4Fmu3
Protein name
4Fmu3
Sequence length
59 AA
Molecular weight
6769,64790 Da
Isoelectric point
7,76940
Sequence
MVGGGHGICNFTRFQWRALDVFIDILLSRYPNAAVCGHRDLNPEKACPSFDARAWWNFQ
Other Proteins in cluster: phalp2_2298
Total (incl. this protein): 19 Avg length: 76,7 Avg pI: 6,02

Protein ID Length (AA) pI
4Fmu3 59 7,76940
1Dppp 107 4,89528
1td3S 52 5,73064
2MNWI 76 4,05003
2eRIM 71 4,99066
3DeQw 58 5,68438
3FrIW 65 4,96360
4xSJG 79 5,55689
5UeKo 70 4,71737
6HjLg 63 4,64303
7URg4 73 8,85269
7j5v5 79 6,52099
7lMWc 75 8,74747
7wdmK 112 6,98724
7wdqO 80 8,89027
8ECix 68 5,76361
8zOch 65 4,64286
eQQc 110 6,45131
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5114
1MHVU
6 48,9% 49 1.677E-14
2 phalp2_19536
3DXjR
12 38,3% 60 7.138E-12
3 phalp2_9258
6ZeMl
429 34,7% 69 2.556E-11
4 phalp2_6944
2Tcfx
1 32,7% 58 3.280E-10
5 phalp2_22672
3xgBR
5 38,6% 44 3.065E-09
6 phalp2_6408
bUKw
4 36,9% 46 1.514E-08
7 phalp2_25760
4OCuh
74 41,8% 43 1.030E-07
8 phalp2_15353
1Ae5Q
1 39,1% 46 1.419E-07
9 phalp2_499
8FzyL
2 40,0% 45 1.332E-06
10 phalp2_28979
6bWlc
2 42,5% 40 1.835E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4Fmu3 (59 AA)
Member sequence: 2Cfw7 (96 AA)
1 59 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Fmu3) rather than this protein.
PDB ID
4Fmu3
Method AlphaFoldv2
Resolution 91.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50