Protein

Protein accession
4HSRn [EnVhog]
Representative
4eR2J
Source
EnVhog (cluster: phalp2_34384)
Protein name
4HSRn
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIAPAKTAVTTPFGWVKDYPLNKGSYPGYGNAPTSRHGFHTGVDFSHSPDNKIYMPETGVVQLFNWNGTDYNGNHIIVQVGNRRHFLGHIKNNGFLVKNGTTVKIGTPIAIMGDTGYADGVHLHWGLRVDGKIVNGLNYVKETPIMLTEKQIIDLFRLAFNKEPTANQIKNYLTDGWLPLAMNVAGTLRNSLVAESAKVTSAAKRIKELEAALATPATELKPGKYVVK
Physico‐chemical
properties
protein length:228 AA
molecular weight:24987,4 Da
isoelectric point:9,54
hydropathy:-0,25
Representative Protein Details
Accession
4eR2J
Protein name
4eR2J
Sequence length
222 AA
Molecular weight
24394,21230 Da
Isoelectric point
5,68301
Sequence
MQRPSPSPITTPFGQVPGYPLNNGFHKGVDFAFIPDNKVYMPEDGIVHVVPWDNHSAEGNTIYISVGNRKHALCHLSKFLVSDGQYCNAGKVIGVMGETGAAEGVHLHWAVTVGGQLVDPLTLVKGGQNVVTPDEVNRIAIAMVDRPATQEDISTFTGKSVTEVIDAYDRTDERKKIRGKVTDYDHLAETAQQQLAELQAKLDVANAHNEYEELPFKVFKQK
Other Proteins in cluster: phalp2_34384
Total (incl. this protein): 6 Avg length: 236,2 Avg pI: 7,21

Protein ID Length (AA) pI
4eR2J 222 5,68301
18ZB5 254 6,30324
1LnXc 255 6,65666
4Iic9 228 9,62244
4KBUB 230 5,43963
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13188
2tJnG
1 28,9% 183 8.779E-24
2 phalp2_993
2vbyj
4 30,0% 160 6.557E-14
3 phalp2_16527
72xAK
1 31,6% 142 6.163E-12
4 phalp2_2348
4TuWR
59 29,4% 163 1.127E-11
5 phalp2_10612
2eP5N
1 26,7% 224 5.079E-11
6 phalp2_6324
75jON
4 28,1% 167 6.861E-11
7 phalp2_4014
134xF
12 25,3% 193 6.598E-08
8 phalp2_35019
4U95D
2 27,4% 171 2.281E-06

Domains

Domains
PET_M23
Unannotated
Representative sequence (used for alignment): 4eR2J (222 AA)
Member sequence: 4HSRn (228 AA)
1 222 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4HSRn
Method AlphaFoldv2
Resolution 82.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4eR2J) rather than this protein.
PDB ID
4eR2J
Method AlphaFoldv2
Resolution 84.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50