Protein

Protein accession
4kU4z [EnVhog]
Representative
4kTKJ
Source
EnVhog (cluster: phalp2_10863)
Protein name
4kU4z
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSLHRSIAPIDSRRPWWHGTGPYFLILLTTLAIFLAINQTLKNDYKEKLDIQREEIARLQLEAENLQYQKEILEEALLRRIQEVNGLLDKINQMRPDLTVTDEERMLLEKLVTAEARGEDYEGMLAVANVVINRVASESFPSTINGVITQPGQFCPVRTGSIYSMEPDESARKAVADALKGYQVVDGALFFYNPKVVSHGHWIRTRTTITDIGNHRFAL
Physico‐chemical
properties
protein length:219 AA
molecular weight:24939,2 Da
isoelectric point:5,78
hydropathy:-0,25
Representative Protein Details
Accession
4kTKJ
Protein name
4kTKJ
Sequence length
218 AA
Molecular weight
25061,39290 Da
Isoelectric point
5,78879
Sequence
MRVEKPQFFNRRPWWHGTGPYFLLILTILALSLAINTTLKNEYKALLQEQREDVARLQLEVENLQHQKEILEETLLRRIQEVNDLIYKFDSMRPDVTVTDEELELLERLVTAEAQGESYEGQLAVANVVIDRTLSPAFPDTIKAVILQPGQFCPVAKGIINTITPSDTARQAVADALKGHRIIEENALYFYNPRIVSRGHWIRTRQTVTDIGNHRFAL
Other Proteins in cluster: phalp2_10863
Total (incl. this protein): 2 Avg length: 218,5 Avg pI: 5,79

Protein ID Length (AA) pI
4kTKJ 218 5,78879
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7898
8IIiG
19 45,9% 135 1.297E-27
2 phalp2_28980
6cvs6
6 38,0% 176 2.908E-26
3 phalp2_3197
7nFFO
87 34,3% 163 7.090E-15
4 phalp2_6393
4aJT
3 30,7% 195 9.222E-13
5 phalp2_15434
3JPpe
5 30,9% 171 1.039E-11
6 phalp2_15462
87Yrl
67 31,7% 151 3.475E-11
7 phalp2_35967
5hWAV
5 32,5% 135 4.698E-11
8 phalp2_36491
1kbDK
1 32,7% 186 6.350E-11
9 phalp2_15448
441CI
8 25,1% 203 3.037E-05

Domains

Domains
Unannotated
Hydro_2
Representative sequence (used for alignment): 4kTKJ (218 AA)
Member sequence: 4kU4z (219 AA)
1 218 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4kU4z
Method AlphaFoldv2
Resolution 89.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4kTKJ) rather than this protein.
PDB ID
4kTKJ
Method AlphaFoldv2
Resolution 90.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50