Protein

Protein accession
4ECpk [EnVhog]
Representative
16c2g
Source
EnVhog (cluster: phalp2_15270)
Protein name
4ECpk
Lysin probability
76%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKGCDFESGPTPAELLEAGIFFVCRYACSLPNGKALTRAEANGYTEAGMGIVTVWEDQAQAALGGAAQGMADGVGAAAFARALGQPPGSAIYVAFDFDVSQAQISVCLDYLRAFRSVVAESGYVGGPYGGILIVNEAADEDLVQCYYWQTEAWSNGQVSRFADLLQLANPTYIAGKQVDIDEVVNNVTRFGAWNYDGLWPKIVPKPPTPKPPFPPIPEEIKDMAPTVTFDSEGNAYVAGVSTVPNNEGHLLVFKNPKGTGSWLATDVTDQIHHEAPNAALYTIES
Physico‐chemical
properties
protein length:285 AA
molecular weight:30417,8 Da
isoelectric point:4,40
hydropathy:-0,06
Representative Protein Details
Accession
16c2g
Protein name
16c2g
Sequence length
289 AA
Molecular weight
29825,69480 Da
Isoelectric point
4,38453
Sequence
VLGVDAAGHPDPSSCVQAGYSLVGQYLGGANATSPAYVQQVTEANAGLFSIWEVGARAAANGAGQGVIDAQAALTRARALGQPKGSVIYFTADFQPASDEMANVVGYFRATSTAVRQDGYLGGAYGGTETLNAVQGIVDVGWQSNAWTNGVQLPWVAMRQRLQQTTVAGTTCDIDDVINPPVGAWNLNGLWPSDPPTPPNVYPGDHVQSQTVEVVISGGHGWFASPVNAANIVSVAALTENPDVVGRYDHVPVSWAMATSPGPNSPNGAITVTGPADGTWGLIVWSVTP
Other Proteins in cluster: phalp2_15270
Total (incl. this protein): 2 Avg length: 287,0 Avg pI: 4,39

Protein ID Length (AA) pI
16c2g 289 4,38453
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8820
2pMjJ
148 32,3% 226 2.887E-35
2 phalp2_14466
4IpVT
1 40,3% 196 2.509E-34
3 phalp2_23126
4EcjY
7 29,3% 215 1.030E-30
4 phalp2_19051
1dYaD
7 27,0% 277 2.589E-28
5 phalp2_15053
7qmsB
8 29,3% 215 4.074E-27
6 phalp2_4951
6F0j3
21 30,4% 194 1.880E-26
7 phalp2_12781
8asOL
12 28,3% 261 4.701E-26
8 phalp2_12671
1ZrhB
6 31,0% 200 6.379E-26
9 phalp2_20415
3dkyH
32 28,9% 297 6.379E-26
10 phalp2_2850
giZX
26 30,4% 194 2.934E-25

Domains

Domains
Rv2525c
Disordered region
Representative sequence (used for alignment): 16c2g (289 AA)
Member sequence: 4ECpk (285 AA)
1 289 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08924

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4ECpk
Method AlphaFoldv2
Resolution 83.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (16c2g) rather than this protein.
PDB ID
16c2g
Method AlphaFoldv2
Resolution 75.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50