Protein

Protein accession
8m4Sy [EnVhog]
Representative
4G2G4
Source
EnVhog (cluster: phalp2_27389)
Protein name
8m4Sy
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MLNKTTFNNRLKTFGLFQSATLLQVNGCSDIIDAFYNAKGTDLRKLAYILGTVYHETAMKMQPVKEFGGEKYLLSKVYYPYYGRDLVQTTWKANYEKVKKFTGVDVVTNPELIGIMPLSARVAVVFMFKGLYTGKKLEDYFNDKIEDWVNARRIINGIDKAAEIAAIAQKFYECLK
Physico‐chemical
properties
protein length:176 AA
molecular weight:20145,2 Da
isoelectric point:9,25
hydropathy:-0,16
Representative Protein Details
Accession
4G2G4
Protein name
4G2G4
Sequence length
240 AA
Molecular weight
27424,08370 Da
Isoelectric point
6,44363
Sequence
MGMRLSAAAIRERQRGLAALTLYTGVLDGIAGPKTQTAIADFCELERLDPGVDIAEHLYNKVISVPVEQYQRPEDLADTLKLVCTSVYRNDPRVWAYMMATVQHETKAAYFPVEEAYFVSSTNARVRYLKSQEYWPYFGRGLVQLTWLFNYEKYKKILGVDLVGDPDLALEPSFSLFILVHGMLTGTYTGKPLELYINQNHCDYIQARRVVNGVRKGETLPDRSELIASYAKQWEQFYAK
Other Proteins in cluster: phalp2_27389
Total (incl. this protein): 14 Avg length: 183,9 Avg pI: 8,13

Protein ID Length (AA) pI
4G2G4 240 6,44363
1Mau4 176 9,44380
2ZPAB 176 9,33027
46nMp 133 9,21094
4FUmT 242 5,82636
4ND2n 176 9,23911
4nTVj 133 7,87103
58tdt 138 6,57595
7ZzmP 207 8,58276
8aFNM 228 7,67397
8jhCw 183 8,84372
8jx4P 191 8,34358
lEqB 175 7,11950
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23913
1MvJ3
1257 42,6% 164 1.291E-53
2 phalp2_17517
5yaNZ
733 40,9% 161 6.840E-51
3 phalp2_28332
1asZp
50 35,2% 196 3.870E-43
4 phalp2_14563
4WxUn
402 37,7% 175 1.207E-41
5 phalp2_34010
83A2R
53 32,9% 191 3.343E-39
6 phalp2_5647
4nOVi
1 33,0% 248 6.242E-39
7 phalp2_22521
1j3qD
2 27,7% 180 1.460E-17
8 phalp2_14147
2iCT4
4 25,6% 253 1.438E-12
9 phalp2_18983
8cPqj
14 27,8% 237 1.371E-09

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 4G2G4 (240 AA)
Member sequence: 8m4Sy (176 AA)
1 240 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4G2G4) rather than this protein.
PDB ID
4G2G4
Method AlphaFoldv2
Resolution 87.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50