Protein

Protein accession
6VgQy [EnVhog]
Representative
4g2bf
Source
EnVhog (cluster: phalp2_15824)
Protein name
6VgQy
Lysin probability
85%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSEQWGVCEEPCIILDVSHWQGTDIDFPACKALGVRGVIVKRYHGSGYVKAYDEQFPSAQLAGIELLGDYAWLVPTSVPVDLQVAAWATPAKGRLPFTIDWEDPSTKLRGKPLVSILERAIEIKSDKDGRRPTIYTGAWYWRDFCQGVDSEIVAACDLHLAAYPRKTNTGLRYTEAQQEVCGGHMPGVPLPWAKRGLQPLIWQFSGDAQPLRLPPSKNGASPAVDVNTGDARRLFATIDAPPDTLPEGETGRRRSSQRVKAFSDAATLLRAPEGEHTPIVLESEKE
Physico‐chemical
properties
protein length:286 AA
molecular weight:31505,3 Da
isoelectric point:5,65
hydropathy:-0,36
Representative Protein Details
Accession
4g2bf
Protein name
4g2bf
Sequence length
256 AA
Molecular weight
28146,74510 Da
Isoelectric point
4,89238
Sequence
MKLRDLVTDPCAMLDVAHFQGASIPWEECAHLGIRAVVIKWWHGPWRNQPLVAQQQYREAKAAGLLVGRYAWWVPNASVDAQIAAWLSDPWPDEDLPLCIDMEDPALPKGLPTLAAAVHIVEVIEQATGRAPIIYSGAWWADAWLGARSPELARCPYWHAAYPRKAAKGTDYVGAVAEWLAQDAPRLPAIWAAETPIAWQLDGTDDATGTGALRLPNDVDVDVNLADLVALRRLVPTHRDTDPAPFDPTPPALRLS
Other Proteins in cluster: phalp2_15824
Total (incl. this protein): 3 Avg length: 281,3 Avg pI: 5,09

Protein ID Length (AA) pI
4g2bf 256 4,89238
4eFh6 302 4,72442
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10644
2AeOK
60 30,7% 218 3.404E-18
2 phalp2_34559
4UNXW
287 31,6% 215 7.883E-16
3 phalp2_39912
1jXPT
300 31,2% 224 1.066E-15
4 phalp2_26214
jZqY
82 27,9% 222 1.300E-13
5 phalp2_36157
6TiaL
25 31,0% 216 4.298E-13
6 phalp2_26738
367Qc
15 26,9% 234 4.298E-13
7 phalp2_20360
2RVWF
69 28,9% 214 3.462E-12
8 phalp2_14402
4w7mj
98 28,0% 239 3.462E-12
9 phalp2_26909
4r87F
36 27,8% 219 4.661E-12
10 phalp2_4176
1Xqpx
2 24,5% 261 8.445E-12

Domains

Domains
Representative sequence (used for alignment): 4g2bf (256 AA)
Member sequence: 6VgQy (286 AA)
1 256 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6VgQy
Method AlphaFoldv2
Resolution 74.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4g2bf) rather than this protein.
PDB ID
4g2bf
Method AlphaFoldv2
Resolution 76.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50