Protein

Protein accession
4iBMP [EnVhog]
Representative
4i3pC
Source
EnVhog (cluster: phalp2_18141)
Protein name
4iBMP
Lysin probability
99%
PhaLP type
endolysin
Probability: 73% (predicted by ML model)
Protein sequence
MADIQTFNTSHIYSINDDTLRSIVQLLWLWKAKDGIKIVQRAINGLGGNITIDGWFGAESVQAINSLNPNGLESRLSKEINYLKENRDPYYITIAKQEIGVKETKGKKHNKRVVQYHSTTYGKYKNDEVPWCGSFINWVMLQSGVTKTVAYPERAKAWIDFGFGLHQPTHGAIAIKNRSGGGHVCFVVGKSENGKYLYCLGGNQGDAVTVKKYKKSIFTNFRLPFGQEYIALDTYTGHHASSTSEA
Physico‐chemical
properties
protein length:246 AA
molecular weight:27530,9 Da
isoelectric point:9,31
hydropathy:-0,42
Representative Protein Details
Accession
4i3pC
Protein name
4i3pC
Sequence length
182 AA
Molecular weight
20151,27280 Da
Isoelectric point
10,11478
Sequence
MKKEVKIKSAGTRVILSLLRESESLDIVQIALRELGKHLVVDGIIGPITRRAIESVEQKALYRTIEAIRKGQRLIPQSVTKEPITQSWVDVAVAELGVKEIRGGRSNPRVEQYHDAVGITWAKDDVPWCGSFVGFVLLKSGYKIPAKAYRALSWKSWAKSSHRPILGSVAVKSRKGGVVREG
Other Proteins in cluster: phalp2_18141
Total (incl. this protein): 8 Avg length: 227,5 Avg pI: 9,73

Protein ID Length (AA) pI
4i3pC 182 10,11478
2VeTd 239 9,81816
38zSn 241 10,14966
3cbSV 239 9,81520
3ccon 240 9,51864
3gp6W 240 9,68549
7Fg8a 193 9,40647
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6409
c01x
1 26,2% 198 1.163E-18
2 phalp2_28323
16UXy
9 27,9% 161 1.908E-17
3 phalp2_11155
5BBbB
16 29,3% 174 9.008E-17
4 phalp2_21029
9AcO
22 28,7% 174 1.999E-15
5 phalp2_12776
7ZYHD
4 26,1% 153 3.714E-15
6 phalp2_9646
1LfXa
7 25,1% 171 3.821E-13
7 phalp2_39271
47VoK
404 28,7% 153 7.107E-11
8 phalp2_20540
4muWr
288 23,6% 169 3.284E-10

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 4i3pC (182 AA)
Member sequence: 4iBMP (246 AA)
1 182 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4i3pC) rather than this protein.
PDB ID
4i3pC
Method AlphaFoldv2
Resolution 78.40
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50