Protein

Protein accession
1L4QJ [EnVhog]
Representative
40Wqz
Source
EnVhog (cluster: phalp2_25240)
Protein name
1L4QJ
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
VNVRYSPYIAAKVVLQAAKWVGLTEVRNNAEWDDLSTSHKDTLAAEFKGELLRVGWQSGWPYCAAFCETVWRRAYQGRAEVGDVSTMLTPGCLVSYANATKLGWTSTTPYVGSIGIMRLGESSKGHAFIVTGLKGLRMATIEANTSPEAGSPMEDREGDGVYAKHRTLVFKPTPGLHLIGFINPCVYV
Physico‐chemical
properties
protein length:188 AA
molecular weight:20531,2 Da
isoelectric point:8,36
hydropathy:-0,10
Representative Protein Details
Accession
40Wqz
Protein name
40Wqz
Sequence length
182 AA
Molecular weight
19478,09720 Da
Isoelectric point
8,78428
Sequence
MNSQKIIDVASQYVGLYEIVPNAKWTSKTAANPSALSDKLLSFLDASGWEPGWPYCMAFVEAVYSEAFASDPKALKKIKQLLSPSVMSTYKACKPFITKTPTPGAIFVMQKGNGGFGHAGIVVAQIAKDKFSTIEGNTSPAPASAEADRNGDGIYAKSRKLAFENNSGLHLIGFIDFSSIMA
Other Proteins in cluster: phalp2_25240
Total (incl. this protein): 75 Avg length: 197,1 Avg pI: 8,59

Protein ID Length (AA) pI
40Wqz 182 8,78428
11hP7 193 8,42855
11jDc 195 9,26773
11jkx 188 8,86216
13iCa 177 9,15743
13iuZ 181 9,29945
18bpR 193 8,59146
1IVSW 188 6,89533
1LSm5 193 7,84305
1O7Km 187 8,74212
1godS 178 9,12835
1iiA2 190 8,95648
1invN 188 7,69261
1kS8f 193 8,95551
1pDKs 193 9,25149
2ZYDx 193 6,96126
2aJzc 254 6,59624
2azeE 191 9,49995
2euEm 232 6,83127
2lIm 185 8,22580
47zmc 193 6,41652
49PqA 193 9,29442
4BU5U 193 8,59023
4RxyV 200 9,27831
4YAgn 193 9,14905
4YFDf 195 9,26773
4gSLU 188 6,31142
4gawL 253 8,41643
4gdVx 186 8,60016
4obKL 202 8,98813
4rR6E 193 9,25181
4vYQP 178 8,37156
53F8R 193 9,43587
53J1v 195 9,26773
53LUL 247 8,57147
541dl 179 9,25097
553Xo 195 8,84198
55Dwo 193 9,29500
55bMH 193 8,76656
55tVv 193 8,73774
5BQM8 192 8,88163
5CnEj 180 9,14479
5DLvj 193 8,95551
5c1d6 247 8,57296
5eUFq 247 8,59159
5fSYd 193 8,73774
5h5QS 193 9,13403
5iO9P 193 8,95551
5kR1z 247 7,68710
5lXRq 193 9,54914
5lXWS 193 9,13403
5mxy0 140 9,17077
5wfUt 193 9,25097
5xw7j 193 8,73774
6A0NO 193 8,95551
6Dcdr 177 9,18560
6PYTK 193 9,50427
6SvFm 237 8,59752
6Tv3d 203 5,08069
6U01D 198 8,62066
6U4iA 187 7,66595
6UntE 195 9,36289
6xA1p 193 9,17741
6yB9a 252 6,38043
6yLSz 193 8,73774
6ys7u 193 8,73651
6yyVC 243 7,67380
6zoox 193 9,25149
71PYN 189 9,41479
71dXO 180 6,74255
7HPU8 170 9,42736
83fth 193 9,25194
8qFyY 196 8,41669
lKyM 193 9,09895
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2074
2WV8c
1 39,1% 179 1.625E-54
2 phalp2_35073
54O41
6 30,4% 184 3.364E-36
3 phalp2_30395
4LZwv
478 31,7% 186 5.648E-29
4 phalp2_5100
1J6h1
99 28,5% 175 1.548E-23
5 phalp2_12082
5lAC2
5 26,5% 188 7.966E-21
6 phalp2_15890
4BLoi
4 26,8% 175 7.058E-20
7 phalp2_27447
7EjDS
7 30,5% 190 1.316E-19
8 phalp2_5073
1zCAp
2 27,0% 181 2.602E-17
9 phalp2_28163
fkdY
23 28,3% 155 9.008E-17
10 phalp2_743
1NQdC
55 31,6% 120 1.675E-16

Domains

Domains
Representative sequence (used for alignment): 40Wqz (182 AA)
Member sequence: 1L4QJ (188 AA)
1 182 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40Wqz) rather than this protein.
PDB ID
40Wqz
Method AlphaFoldv2
Resolution 93.25
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50