Protein

Protein accession
gI7O [EnVhog]
Representative
4A0cu
Source
EnVhog (cluster: phalp2_30328)
Protein name
gI7O
Lysin probability
89%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFDRSRSVWEQSGFTVAEHTISPPVAPAMIDSVVVHYPGHDGVGDDTARDLANSQRYYATNRGYSLGYNAVVDRAGVLWQVRGLEYRNAANKGANNTSVSVQLRVNLDEPGSAAAVARVRRFVADMERWCGHPLTVLGHRDVGATACPGDAIYNQIQRGEFDPSANKDLTMKLIDPPQRVYDTRKQTGRFNDGETRKISVGQPNAKAVFVNVTAVNPNNTGFITMWGAGPQPDVSNVNYKLGDIVCNTSWVPVAGDGTIQIYSYAACDLLVDVQAVAS
Physico‐chemical
properties
protein length:278 AA
molecular weight:30191,4 Da
isoelectric point:6,20
hydropathy:-0,31
Representative Protein Details
Accession
4A0cu
Protein name
4A0cu
Sequence length
293 AA
Molecular weight
31717,38170 Da
Isoelectric point
8,48225
Sequence
MIPGLIPRTEWEQPGWRVSEHTSSGPQNYSLAEYVVIHYTAAPTTPSSREGVIGFIQRTQRDYAQNRGYSIGYNWVVDRSGRIWETRGDQYRCGANGNTENNTRGPAILCLVDGAEPAGSTMVRSVQAIVEHCDQRAGRTLRVVGHRDIRATSCPGDGLYDQVKNGVFRPVIATPPVINLTSECIMRLVDPPQRLLDTRTTRQPKDGETLFVQVPGNPKAAFVNVTIVNPAKGGYATAFNPATGKPATSNVNFPPPGQAVCNTSWVPCGENGMIAVFVSAATHVVVDLQAVAS
Other Proteins in cluster: phalp2_30328
Total (incl. this protein): 57 Avg length: 278,3 Avg pI: 6,01

Protein ID Length (AA) pI
4A0cu 293 8,48225
16p2o 278 5,52722
17mC5 291 4,73659
1B5CD 282 5,76242
1KidC 320 5,02556
1oXJB 310 5,41331
1wJL3 223 7,69898
22HjV 296 5,80908
26Nl5 281 5,82915
26faD 281 5,38819
26mOX 292 6,79223
272hA 281 5,61054
29ww7 292 6,49354
2JNfz 292 6,49547
2KI6A 231 6,37230
2KIRg 295 8,44595
2MuaR 281 6,09345
2OYWB 276 5,82278
2PxCr 276 5,82278
2R59R 292 6,29108
2R5Js 292 6,29528
2R9XY 278 5,97124
2Rlh9 284 7,61548
3SBvh 294 4,42363
3VrRo 294 6,36917
3gC8d 281 5,34528
3mzWF 276 5,96482
42sm1 281 5,30992
4Accg 279 5,97261
4DKco 275 5,94999
4X5Zt 211 5,72763
4oeto 280 6,51934
4sBtu 292 6,18467
4sQrf 281 5,19903
4sgvk 247 5,62208
4txwd 278 5,43554
5yMUF 281 5,71876
6QYPT 246 9,19778
81y3l 277 5,46020
85M9w 275 5,54410
8DwdC 291 4,50116
8E6Tf 291 4,39515
8sQ0Q 294 5,55109
8tqin 295 8,67591
8wOVn 294 4,42352
Ajvn 292 6,29426
E6Yj 277 6,19712
Re6F 276 5,97102
gDrJ 284 6,58698
gKPt 276 5,80004
sUSq 281 5,62384
u85m 220 6,70674
uZuM 249 5,71643
ut7F 278 5,52733
v5Mk 268 5,45850
y2Va 253 5,30481
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39086
2ckB5
17 41,8% 196 2.160E-62
2 phalp2_26561
8egJl
2 26,4% 336 3.842E-34
3 phalp2_33745
ZH7b
6 25,0% 299 5.314E-32
4 phalp2_23343
5AsJX
1 31,7% 195 1.296E-25
5 phalp2_15057
7s7pP
172 26,5% 196 4.726E-22
6 phalp2_20947
7lyTA
33 28,9% 190 2.899E-21
7 phalp2_11750
42dFf
4 24,8% 189 3.954E-18
8 phalp2_13210
2JwXt
44 22,2% 184 5.822E-17
9 phalp2_35279
6U58t
2 24,5% 244 3.477E-16
10 phalp2_20811
63TES
2 26,4% 178 4.682E-16

Domains

Domains
Ami2
Unannotated
Representative sequence (used for alignment): 4A0cu (293 AA)
Member sequence: gI7O (278 AA)
1 293 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4A0cu) rather than this protein.
PDB ID
4A0cu
Method AlphaFoldv2
Resolution 93.34
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50