Protein

Protein accession
4rD69 [EnVhog]
Representative
30Vri
Source
EnVhog (cluster: phalp2_9955)
Protein name
4rD69
Lysin probability
98%
PhaLP type
endolysin
Probability: 89% (predicted by ML model)
Protein sequence
MNAVDILRNVENYFDRNHNFFMLWGGLFAALFFGLFVPYNMYSRAMVQLETQQNANVILAAQILDMNTRMEFLELSFDRKQKVMREVECLARNIYFEAGGEPRAGKIAVAEVTMNRVKSNQFPKTVCAVVHQKHKNICQFSWVCEGKRSVRNNNAWRESQRIAESILISKKRYGIIGNAKYFHATYVNPKWADESRMIAQIGNHIFYH
Physico‐chemical
properties
protein length:208 AA
molecular weight:24255,7 Da
isoelectric point:9,51
hydropathy:-0,27
Representative Protein Details
Accession
30Vri
Protein name
30Vri
Sequence length
226 AA
Molecular weight
26393,88230 Da
Isoelectric point
9,73365
Sequence
VHSNVYDRKGGRNMRTMKRLHQIIVTVALTVTVATRLGHAETQYLTISALDWQVSKTMKKVADDQQLAKTLNRNDLWPQPKNYHDTDKKTKPDNREVDCLAHNIYYEAGHEPTEGKIAVGLVTLNRVSDHQFPKTICAVVRQKTAGTCQFSWNCLRLRAPNYHDKLWQDSMRIAHELLRGDDKHDVYRIKYSNVMYFHNKKIKTDWKHRMTPVNSPGHNIFYRGRL
Other Proteins in cluster: phalp2_9955
Total (incl. this protein): 104 Avg length: 209,9 Avg pI: 9,22

Protein ID Length (AA) pI
30Vri 226 9,73365
15YUt 214 8,99593
15ebo 218 9,13364
15sZ3 208 8,93985
1O05H 208 9,14679
1QEtO 259 9,21223
1QKGb 209 9,32698
1QTD9 208 9,25162
1R6tF 227 8,49456
1Rbgi 208 9,33278
1WW7W 208 9,46662
1aqlA 208 9,13351
1grYZ 169 9,40924
1ikRW 208 9,25645
1jzgL 208 9,40943
1oIng 214 9,03333
1wRX7 212 8,99497
1wTZV 209 9,30035
1zCON 208 9,22847
1zEwV 208 9,25058
22Pmh 208 9,14705
274jM 212 9,35083
27fip 208 8,97337
2822G 214 9,15633
2W7TP 209 8,83296
3XBun 209 9,32698
3Yu1z 211 9,51362
3zCm1 209 9,18928
444Y8 209 9,07027
44yiK 209 9,08471
45SQi 208 9,60439
464UT 208 9,17232
468Dw 176 9,15756
468YL 240 9,39216
46RYI 208 8,92618
46voc 208 9,37843
477nv 208 9,36431
47bYM 167 9,14647
47lMa 202 7,10831
49TY6 209 9,15646
49hYv 208 9,16039
49kQY 212 9,12068
49wWX 208 9,20281
4A9EL 212 9,10869
4AaRF 213 8,86313
4BSVB 208 9,15640
4ITI7 208 9,51671
4Yy6N 209 9,45978
4aZQy 208 9,48802
4lqyf 208 9,50994
4necs 208 9,21680
4o7MN 204 9,35876
4qe3Z 209 8,93630
4rKta 208 9,69200
4rsFr 213 9,05492
4zUOp 212 9,17187
51ugw 208 9,10953
54K7H 208 8,96125
55Hft 209 9,66473
56oEL 217 9,25929
57tFB 209 9,24330
57vlQ 212 9,13364
5BGiX 212 9,13364
5Cixt 213 9,02920
5D1fm 213 9,42684
5Ds0P 209 9,14705
5a5JB 208 9,08445
5ac0l 208 9,34871
5ctOP 208 9,51310
5dpOW 234 9,30061
5eX87 209 9,60406
5ew8g 208 9,48802
5fND9 208 9,44605
5jFNC 209 9,16381
5mXi4 212 9,35302
5md0c 208 9,46726
5migH 209 9,12571
5mlzR 209 9,12545
5mmIz 200 9,08490
5oieB 217 9,28707
5yPHn 208 9,36211
6A0Co 234 9,58795
6AgAE 214 9,02217
6Aw3B 202 9,35883
6GLmj 208 9,04577
6VSr6 214 9,13409
6xpec 217 9,17129
6yRbs 208 9,23163
6ySbq 204 9,37494
7UhIP 210 9,69200
7X6aj 208 9,44605
7XVWI 209 9,23672
87fOH 209 9,23698
BmCr 209 9,15195
FnUx 208 9,48751
Jafv 214 9,08458
Jv0r 214 9,14685
L0yA 209 9,11082
L4oV 208 9,25116
M3Hg 209 9,07027
Pdck 223 9,38249
Qw3q 172 9,24517
So4i 235 9,33530
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15462
87Yrl
67 42,9% 149 4.353E-46
2 phalp2_582
Vnfh
29 35,6% 146 3.006E-37
3 phalp2_8528
1rrcd
232 31,2% 144 1.137E-34
4 phalp2_6086
543gw
128 32,8% 192 3.963E-34
5 phalp2_13600
5lmUm
9 33,5% 167 5.827E-32
6 phalp2_27849
8yD71
6 24,0% 191 1.087E-31
7 phalp2_38710
V6TC
7 33,7% 145 7.510E-29
8 phalp2_45
7E8g0
9 30,9% 152 2.409E-25
9 phalp2_24620
5m183
3 28,7% 153 2.409E-25
10 phalp2_33558
8Ba1I
1 24,7% 182 1.546E-24

Domains

Domains
Disordered region
Hydro_2
Representative sequence (used for alignment): 30Vri (226 AA)
Member sequence: 4rD69 (208 AA)
1 226 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (30Vri) rather than this protein.
PDB ID
30Vri
Method AlphaFoldv2
Resolution 80.74
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50