Protein

Protein accession
C8Su [EnVhog]
Representative
4GkAF
Source
EnVhog (cluster: phalp2_21895)
Protein name
C8Su
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKYLVGAAPVTTSELPGMHEFAKQVCKYSDKSLWNNGTYVMRDVRNKPGVISNHAKSLALDASYRFSKINGRKDGRAISLPYIYKLLEHADQLGIQLVIDYAFQRSWKCDRRTWIKGTFEISDWYHIELNPHFAADPELVKTGWAMVFGASPITAPPLM
Physico‐chemical
properties
protein length:159 AA
molecular weight:18080,6 Da
isoelectric point:9,21
hydropathy:-0,28
Representative Protein Details
Accession
4GkAF
Protein name
4GkAF
Sequence length
121 AA
Molecular weight
13593,53780 Da
Isoelectric point
9,35928
Sequence
VRDVRGKPGIVSNHAKGVAVDLSYRWQSEKKKGLQNGRKVSLAYMIKLLEHADTLGIQLVIDYALNRSWRCSRGTWIAGTFESGDWYHIEVDPVMCNSPELAKQAWDKVFGVIPAVIKNPV
Other Proteins in cluster: phalp2_21895
Total (incl. this protein): 131 Avg length: 141,8 Avg pI: 9,17

Protein ID Length (AA) pI
4GkAF 121 9,35928
15nsr 178 9,80166
18G4s 163 9,40376
19O9b 163 9,45418
19Xtz 144 9,32704
1DvCy 126 10,24991
1JC87 118 9,20385
1JIOl 118 9,02972
1JJnP 118 9,51175
1jxtP 162 9,64758
1mQzN 169 9,79966
1nBRZ 162 9,59575
1qeYV 178 9,85382
1sM3K 131 9,57841
1tLZQ 123 9,55159
1vESM 118 9,29894
1yA2A 152 9,48228
1yDg5 158 9,82319
1ycmX 118 9,03036
1yi5x 144 9,50852
1yiEG 84 7,90848
1zm5p 144 9,54521
20wdz 162 9,48809
27UrS 162 9,57428
27Wm4 162 9,48151
27ZjN 159 9,46230
2RrOB 176 9,72443
2Ybzz 172 9,77600
2YeGC 162 9,64758
2ZkpC 118 9,72249
2qih3 162 9,42716
30md1 162 9,40292
30odM 140 9,41969
31FQp 162 9,59620
31nZ5 145 9,57737
34a0F 162 9,48809
36NxZ 162 9,40292
3zCta 117 10,68178
40nhM 155 9,24955
45V1K 118 9,63978
45XfX 180 9,59143
46VXM 128 8,95061
49lRH 175 9,71089
4AroF 164 9,51129
4CdBn 174 10,42565
4DICT 114 9,79360
4G8sr 176 9,77716
4GEn1 103 7,79598
4Gh66 162 9,45340
4GjYL 91 7,80340
4NDl6 141 8,95016
4NHKj 132 9,99507
4NKsx 158 9,99113
4NNIY 145 9,56880
4Np0z 94 5,42479
4NrrE 100 7,92544
4Nsmy 103 8,61983
4OLWK 103 6,81513
4QTaO 144 9,48776
4UZBG 77 5,14543
4UZw9 144 9,35683
4acDq 148 9,67956
4glzn 159 9,05634
4jtSP 80 5,52841
4jtoe 159 9,34316
4nEps 107 9,00741
4oava 146 9,51310
4ob6E 116 6,73419
4puyQ 103 7,80353
4rHik 146 9,51806
509vK 122 9,23041
53jUN 118 9,19817
57Ezg 163 9,45372
59XD3 180 9,56300
5C53k 132 10,06798
5HjOk 112 7,79631
5aunQ 175 9,74216
5clhA 103 7,79598
5cnfi 123 9,91106
5duTL 175 9,93073
5j8Fx 119 7,73052
5mAlV 125 9,24736
5nrHO 162 9,59665
5uGQy 179 9,17232
5uMXP 156 9,71398
5vGQP 130 9,26432
5vyqK 103 7,80340
5wIWT 179 9,42684
6LTNs 103 7,81139
6LqS2 145 9,51845
7VxWn 168 10,18177
7W6Aj 89 7,90197
81rwd 162 9,48809
81sKl 162 9,57647
83DUm 107 8,71137
83ELL 159 9,36624
83FE3 121 9,42001
83Fhl 118 9,51175
83Fza 121 9,51839
84EFe 162 9,62650
87JvA 162 9,64449
87JwM 162 9,48809
88rTQ 158 9,78812
8EPOw 117 6,57214
8mMTD 178 9,80127
8nTUv 162 9,67795
8oj0L 96 9,26876
8rUl9 178 9,80127
8rsan 175 9,45811
8s9VC 163 9,48080
8sFcm 163 9,60026
8sG1f 163 9,45302
8sa3g 162 9,61490
8scc7 163 9,45340
8sdij 163 9,48080
8srIP 89 9,20559
9bP7 178 9,71037
C7VV 118 9,48796
C8Hu 163 9,32414
C9eE 145 9,19450
CfEX 145 9,19450
CfVW 176 9,98675
LhMs 118 9,24846
SZdV 162 9,40292
SbRn 103 7,80340
aGBb 176 10,00254
jQNr 162 9,56313
jXGW 144 9,51310
sLoU 77 5,52881
xcQ1 104 8,66083
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1708
2iIeV
1834 52,8% 121 5.732E-60
2 phalp2_36358
qD4l
947 30,5% 118 1.908E-20
3 phalp2_4433
2R1uw
426 27,0% 133 2.616E-20
4 phalp2_27264
46ZD7
137 29,9% 137 2.710E-17
5 phalp2_36082
6xZKL
58 25,4% 118 1.518E-08

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4GkAF (121 AA)
Member sequence: C8Su (159 AA)
1 121 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4GkAF) rather than this protein.
PDB ID
4GkAF
Method AlphaFoldv2
Resolution 93.62
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50