Protein

Protein accession
6Vgp5 [EnVhog]
Representative
2S3iH
Source
EnVhog (cluster: phalp2_8866)
Protein name
6Vgp5
Lysin probability
98%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
VTDIFDFAGCAQYAQPLRDACAKWGVDGASVPYFLAQLSVESMKFSRVTENLNYRSDTLLRVCNGRNGIHTIDDATAVVMRGHDAIAEALYGGSWGASHLGNTEPGDGARFCGHGLIQTTGRYNHHVVSQRVHGDDRYLENPALLTLAGEAAEAAASFWMGKKLNGVTDVEAITHAINGGQEGLMARQAMTQHLLTYNP
Physico‐chemical
properties
protein length:199 AA
molecular weight:21464,8 Da
isoelectric point:5,80
hydropathy:-0,22
Representative Protein Details
Accession
2S3iH
Protein name
2S3iH
Sequence length
202 AA
Molecular weight
22109,41850 Da
Isoelectric point
4,86970
Sequence
MSIFSDFDCGEFTGALENYADHWGISDPKDQARFLGQLSVESQQFTRVVENLNYRPARLLEIFRGRNGLDTLDQATAICAGGPERIGEAMYGLPWGSTHLGNTEPGDGGNFIGRGLIMITGRQNYHDASYGCFGDDRLLQNPDLLTQTDNAANVACWFWYNRKLSIITDIAAITQKVNGGLTDLAGRIAQTNRALSLISSAP
Other Proteins in cluster: phalp2_8866
Total (incl. this protein): 4 Avg length: 204,8 Avg pI: 7,55

Protein ID Length (AA) pI
2S3iH 202 4,86970
3W3iD 190 9,69329
48CzP 228 9,85285
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16306
5aCST
92 40,8% 191 5.361E-39
2 phalp2_37433
2Fv3r
11643 38,8% 188 1.235E-37
3 phalp2_2766
23eb
36 40,4% 188 4.329E-37
4 phalp2_5234
8cPFI
3482 38,2% 188 2.548E-35
5 phalp2_10067
7wmbf
22 37,3% 193 3.059E-31
6 phalp2_39259
3ZvQn
35 38,1% 173 4.182E-31
7 phalp2_1820
3er8S
16 33,4% 203 1.782E-29
8 phalp2_22262
7pbAz
301 37,5% 176 3.602E-27
9 phalp2_22433
K5Pz
97 33,7% 222 4.921E-27
10 phalp2_1654
8rD3A
4011 34,4% 186 5.286E-25

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2S3iH (202 AA)
Member sequence: 6Vgp5 (199 AA)
1 202 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6Vgp5
Method AlphaFoldv2
Resolution 95.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2S3iH) rather than this protein.
PDB ID
2S3iH
Method AlphaFoldv2
Resolution 97.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50