Protein

Protein accession
GcsZ [EnVhog]
Representative
GcsZ (this protein)
Source
EnVhog (cluster: phalp2_38677)
Protein name
GcsZ
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MQLSEHLSLAEVTRSETAKRRGISNMPTDAHIANFKLLAEKVFEPIRNHFGKPIHISSGYRSKDLNTAIGGALSSQHCSGEAIDIDMDGHAGGVTNKMVFDYIKDNLEFDQLIWEFGTDALQIRYRNENNHINSNRNIFIEYNCSIYLHLFFQLSNGKLRSIFGISWSSNGRWLLWRNCWSKTRRFSNL
Physico‐chemical
properties
protein length:189 AA
molecular weight:21756,3 Da
isoelectric point:8,81
hydropathy:-0,43
Other Proteins in cluster: phalp2_38677
Total (incl. this protein): 89 Avg length: 154,6 Avg pI: 7,49

Protein ID Length (AA) pI
1CxeN 155 7,06460
1Jv8n 153 8,64735
1KgCP 154 6,58573
1XahA 154 7,81990
1esth 153 8,66785
1etYf 154 6,96519
1wYC1 176 9,36779
1zhvJ 155 7,89050
25NXV 155 6,20599
272Ex 155 7,89056
30mdP 149 6,97525
3HmB9 163 9,28437
3XNZ0 155 6,37611
3tqY 149 6,51900
47GJa 153 7,86993
48DXy 154 6,96337
4GEDE 154 6,29329
4IYl5 154 7,01304
4MfZS 149 8,83090
4aqSB 152 7,91319
4bAuE 153 7,85291
4basB 154 6,96126
4ltjl 154 7,01549
4m3dj 154 7,01697
4mdTY 155 8,66960
4n9KM 154 7,00816
4peRn 154 7,01492
51uSg 154 6,96030
52Y9Z 154 7,79476
53aHH 154 7,01583
542d9 155 7,06289
553gY 155 7,01441
56Czh 154 6,64950
56FaJ 154 6,29488
56Hsv 154 6,96030
56Pky 152 9,07613
57UPh 154 8,83160
57i7Q 154 7,83389
58koM 154 6,95876
58tPv 154 7,01492
591Yy 154 7,74780
5BPWu 155 7,87406
5BRyp 154 7,01498
5akJf 155 7,06238
5avKg 154 7,01498
5b7j8 154 7,01640
5bVBW 154 7,01583
5cwke 176 9,30081
5d21w 154 6,96371
5de7Z 156 8,55155
5f4Fd 155 9,38358
5gaKE 155 7,88244
5hdyG 154 6,96604
5icHL 154 7,01640
5j72h 154 9,19540
5mN1C 154 8,68952
5n68B 155 7,84517
5vlVT 155 6,51275
5wXsy 154 7,01640
5wc9S 155 7,79805
5wvs4 155 7,06289
5y1ja 153 7,86993
5y5z9 155 7,06289
5ySRN 154 7,76201
6HhNb 155 7,01441
6Mq6o 155 6,20587
6UlAS 154 7,02129
6yHM7 155 7,87966
7X2Vo 155 7,90842
7XFsz 149 6,29920
83FyK 154 6,51809
8mTpE 151 6,20110
8mj7S 155 7,89024
8ostD 154 6,96371
8py9G 152 9,09709
8rrCP 155 6,58823
FKJF 155 7,06289
HhW7 153 8,96750
SQet 153 6,51400
T8xS 154 6,51332
jOpD 155 7,01714
A0A6J5M9L1 154 7,79953
A0A6J5N345 135 8,85701
A0A6J5NEN9 151 7,87032
A0A6J5NI22 155 7,86632
A0A6J5PUN8 151 6,97587
A0A6J5PYI3 155 8,92644
A0A6J7WPB5 155 6,21326
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1343
17yCd
10947 51,0% 137 2.238E-51
2 phalp2_29699
1f2kx
4387 57,3% 129 3.804E-50
3 phalp2_21393
3LPeU
1997 45,0% 140 6.644E-43
4 phalp2_30101
38zFk
347 47,6% 126 5.451E-38
5 phalp2_21836
4mpYp
274 40,8% 159 1.915E-37
6 phalp2_23143
4GhkT
33 42,6% 129 6.725E-37
7 phalp2_38153
6WaM5
59 47,0% 134 1.135E-35
8 phalp2_31042
1Db9K
217 42,7% 131 2.618E-34
9 phalp2_5035
1nwkL
67 38,9% 172 2.618E-34
10 phalp2_29625
84TVx
40 40,9% 144 3.584E-34

Domains

Domains
PET_M15
Disordered region
Protein sequence: GcsZ
1 189
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
GcsZ
Method AlphaFoldv2
Resolution 74.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50