Protein

Protein accession
4eFEj [EnVhog]
Representative
4eFEj (this protein)
Source
EnVhog (cluster: phalp2_10838)
Protein name
4eFEj
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTTPIINAPTCAQDRAAAYVLMQPKHGSYTAREIDDIARAYWTHARACGVDPCVAWAQMLHESGALSSWWSQPPRHNPAGIGVNGATSDAQPVGAWAFDGALWRAGCSFPGGWAAHAIPAHLGRLVAYAVKPHDRTAAQAAYVLMAEAARVLPFQLHGRCETVESLGGVWAKDAGYGGRLVTLLGIMRGT
Physico‐chemical
properties
protein length:190 AA
molecular weight:20215,8 Da
isoelectric point:8,33
hydropathy:-0,01
Other Proteins in cluster: phalp2_10838
Total (incl. this protein): 37 Avg length: 194,1 Avg pI: 9,02

Protein ID Length (AA) pI
14jZy 176 9,00058
16MR6 199 8,91335
1JGBS 182 8,96899
1R10R 227 10,50005
1Xr3F 208 8,82574
1cX4Y 199 10,06476
1eCCE 176 9,24981
1omY9 190 8,53582
1onCp 202 8,96886
2HPq7 184 8,83083
2TgXf 211 7,70711
2YHDQ 149 5,37057
2YHjs 196 9,75956
3NPH5 189 9,24117
3hfQR 189 9,26934
42i5i 189 9,21622
49Grj 205 8,94127
4YKe7 215 9,73590
4aEhu 201 8,33326
4eHiq 206 9,80269
4eJg0 204 9,05080
55ZKr 201 9,17148
5kzsP 174 9,22112
5uHfT 195 9,02997
5vjBU 194 9,25587
5ztrH 183 9,09657
6LAky 184 8,84798
6XnPE 201 9,37668
6xpoX 177 8,75315
80JXd 203 9,48538
8aCZj 192 9,37275
8o81l 191 10,24262
8oJnl 222 7,83421
G12g 184 9,22641
IRnm 202 8,97440
btQR 191 9,06105
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13498
4QRrS
2 46,2% 134 5.052E-43
2 phalp2_23917
1NSCq
2 26,4% 193 6.296E-36
3 phalp2_14514
4Tllk
1 27,1% 162 2.433E-17
4 phalp2_27962
Oa8x
2 20,5% 195 4.051E-12
5 phalp2_15109
f6dA
5 23,0% 191 3.961E-10
6 phalp2_8443
134xp
24 25,1% 187 9.870E-10
7 phalp2_2689
74qd1
13 21,3% 187 1.813E-09
8 phalp2_22098
629mC
1 19,5% 194 5.072E-08
9 phalp2_34718
35Hpg
2 24,4% 188 3.094E-07
10 phalp2_9098
5F0fs
35 29,4% 146 4.179E-07

Domains

Domains
Unannotated
Protein sequence: 4eFEj
1 190
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4eFEj
Method AlphaFoldv2
Resolution 96.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50