Protein

Protein accession
4A1r9 [EnVhog]
Representative
4A1r9 (this protein)
Source
EnVhog (cluster: phalp2_16178)
Protein name
4A1r9
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MYQITFKPSPNKWTGRNGYQPIAIVNHRMVGYLPGTDATFANPDRDVSAHFGIGYRTAGGPVRISQYVDLSDSAWGNGNYDPSGGWTLIKKTENGTVINPNYYTVSIEHEDGGPNDGVVTQAVKDASCWLQTILLSGSIDLMRYVGIQIRENTTAVALGNIIPVTETIIDHNRISGRLKPYCWRPYKLDTSGFPGWQPILIQQLRGSTMAIQDTLNLLEQQITELENQRNAAIAEALVANNKLSQAKDKAIEIKTLSTQISTKADEIVAL
Physico‐chemical
properties
protein length:270 AA
molecular weight:29736,2 Da
isoelectric point:6,10
hydropathy:-0,31
Other Proteins in cluster: phalp2_16178
Total (incl. this protein): 27 Avg length: 234,6 Avg pI: 7,89

Protein ID Length (AA) pI
14QzF 275 6,37264
2GFsc 275 9,39557
2TMxS 214 9,12577
2Ugiq 236 9,21809
2Uli6 217 8,68732
2Z4kd 249 8,98465
2bd1t 268 5,56763
3QEVF 237 7,09268
3xPK5 234 9,36592
3xRQn 214 9,23531
3xTxc 220 9,11481
4SElg 214 9,47990
4StZa 212 8,59378
4Sty0 214 9,33181
4zXKT 242 5,22103
6T47h 269 6,39697
6WhaC 238 6,30199
71Dd 222 8,70834
71zbu 255 5,56178
74qt 222 6,04650
7IOt5 218 6,16347
7Utjy 221 9,17980
87RN6 216 9,08245
8Zkn 230 8,91258
g3Zg 228 6,86401
qaTI 225 8,93604
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28639
8jnfC
92 36,2% 215 1.671E-62
2 phalp2_13794
6WRQm
7 31,5% 203 1.266E-51
3 phalp2_30952
173qx
118 37,7% 183 3.943E-50
4 phalp2_24509
4NPVU
68 31,2% 272 1.429E-44
5 phalp2_18213
4LQlA
5 34,6% 176 6.877E-32
6 phalp2_38112
6KhfV
6 31,4% 216 3.225E-31
7 phalp2_1931
4lGtr
11 29,5% 237 1.541E-28
8 phalp2_28537
40K7Z
71 30,8% 191 9.773E-28
9 phalp2_12587
1lnEh
1 33,6% 184 1.808E-27
10 phalp2_32182
6TIHm
13 31,3% 169 3.784E-22

Domains

Domains
Unannotated
Unannotated
Protein sequence: 4A1r9
1 270
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4A1r9
Method AlphaFoldv2
Resolution 94.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50