Protein

Protein accession
496lc [EnVhog]
Representative
2GDF5
Source
EnVhog (cluster: phalp2_6920)
Protein name
496lc
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAERIYKLGTNSGKWKNVTSAELFASDKWTNLDPEFARRIFAMLNLLIDFGGSAGLGATYRSYESQRSTFLSRYHGVSGPSKKSVAWKGTDNRGVSYSFWEKNPGETSIAPPGMSYHDLVTSQGKCLAVDLMGYQSDRNLMQAYAPWFGLYMTWWDLSDPHHFQPIEVPHAKKYYNPKIHKLSYWSFKF
Physico‐chemical
properties
protein length:189 AA
molecular weight:21660,3 Da
isoelectric point:9,30
hydropathy:-0,53
Representative Protein Details
Accession
2GDF5
Protein name
2GDF5
Sequence length
180 AA
Molecular weight
20035,75900 Da
Isoelectric point
9,59078
Sequence
VTLYPFGYRGKKLTLEQIARQPVVAGMDPEFRRRVFAMMEHAAQVSKSLGIGGARRSSATQLAGFLDRHQVVTLGGCCRYNGKRYALKKGRAHMAPPGLSYHEDTTPDGKCLAADLIGDLRWMNANCGKFGLRHFAKVNNEPWHVQPVEIPNGRSRYNPKKHHPLPVFDLPGDDESDLVA
Other Proteins in cluster: phalp2_6920
Total (incl. this protein): 10 Avg length: 200,6 Avg pI: 9,40

Protein ID Length (AA) pI
2GDF5 180 9,59078
15XOi 242 9,94813
5qTSo 170 9,15175
6MqKU 200 8,83206
6VF0 171 9,09676
8juVb 253 9,39551
8oy26 257 9,31460
lxuw 171 9,51239
qzHp 173 9,83280
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12435
lKKf
19 57,7% 175 3.808E-77
2 phalp2_911
8pNJ8
7 41,7% 163 1.938E-47
3 phalp2_37737
4i40I
27 41,1% 180 5.156E-41
4 phalp2_12519
11EF1
7 38,2% 178 5.268E-32
5 phalp2_29288
qzib
9 31,5% 165 4.286E-19
6 phalp2_27543
5nBeO
48 27,6% 188 3.999E-16
7 phalp2_17679
6V089
7 29,8% 201 1.980E-11
8 phalp2_15608
2ldH7
2 25,3% 158 2.304E-10
9 phalp2_18305
5coeD
66 31,5% 133 5.770E-10
10 phalp2_30207
3YPMB
2 29,4% 153 1.556E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2GDF5 (180 AA)
Member sequence: 496lc (189 AA)
1 180 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2GDF5) rather than this protein.
PDB ID
2GDF5
Method AlphaFoldv2
Resolution 95.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50