Protein

Protein accession
4EiIi [EnVhog]
Representative
4EY3t
Source
EnVhog (cluster: phalp2_10923)
Protein name
4EiIi
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MINTRPLRAGGRERSEAVTFSGTYTLLADISEYEPNIVDAVYLAWSHAVVIRAAYGDQHDDQAWYGGARRAQLHKLGAKFVGIYQYLVAGQDGAAQADAMHRLVGALQPGEVLIADFEEATAAEDQVGTHQMLTAWYNRMVALGYPAKYLWTYTGLYFGQAHGVLPVQWLADYNGVEPSSPHILWQFTGNFNMPGIGYGDCSIFRGTVDELAALAYQAPAPPVPSTAPAVSDGRIVSVSYNDATVAWTGHNAARFNCKINGPGPIDGQTNTVSIPQASYSGLESGHTYTVTVVPIGADLHTEGAPGQITFVTK
Physico‐chemical
properties
protein length:313 AA
molecular weight:33655,2 Da
isoelectric point:5,25
hydropathy:-0,10
Representative Protein Details
Accession
4EY3t
Protein name
4EY3t
Sequence length
309 AA
Molecular weight
33785,26870 Da
Isoelectric point
5,28798
Sequence
VVAFRAGGTTPADLPRVVSEAGELTYLADVSEFQPNVADKTYLSWSKAICVRAAFGNAHDDGAWYNGARRKALHDGGVRFLGIYQFLVAGEDGTSQANALHDLVGPLEEGEVLIADFEQGTKPMLSAWYNRMLALGYPGKYLWTYTGLWFGEQQGALPVQWLADYTSVEPSGQHVLWQFTSNFSVPGVGTADCSVFHGTIDQLAALAYSTKPAPPPANQPTPAPGGTYQAVRPTGYQVTCSWHPVAGQTSYHFQIQWYKNGFGWVMYDDAHQGAVTTTLAVAPGTRYRWRVAAGVANYVWSDWEEFTTP
Other Proteins in cluster: phalp2_10923
Total (incl. this protein): 12 Avg length: 314,0 Avg pI: 5,63

Protein ID Length (AA) pI
4EY3t 309 5,28798
2Su8s 353 9,00606
3AxdL 311 4,75796
4ECdU 326 4,88272
4Eivv 325 6,03604
4K4eZ 311 5,78691
4j4HK 271 5,11991
4kuRA 322 5,75008
5ok7p 279 4,47149
6TJF0 325 6,29608
872AG 323 4,91995
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19875
5GUq8
18 57,2% 222 9.207E-93
2 phalp2_1978
4EA8c
6 35,1% 222 1.282E-36
3 phalp2_9690
23WOG
27 33,6% 205 7.029E-35
4 phalp2_23128
4EkZf
1 34,3% 227 1.301E-34
5 phalp2_16973
870CF
1 38,0% 192 6.052E-34
6 phalp2_39507
5nLHO
1 32,3% 235 2.130E-24
7 phalp2_19688
4Ezja
7 30,4% 220 8.913E-22
8 phalp2_39912
1jXPT
300 28,9% 214 6.359E-15
9 phalp2_18049
3Aq5m
1 24,5% 212 6.663E-14
10 phalp2_39009
8i5mw
7 25,2% 218 1.289E-10

Domains

Domains
Disordered region
Unannotated
Unannotated
Representative sequence (used for alignment): 4EY3t (309 AA)
Member sequence: 4EiIi (313 AA)
1 309 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4EiIi
Method AlphaFoldv2
Resolution 92.10
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4EY3t) rather than this protein.
PDB ID
4EY3t
Method AlphaFoldv2
Resolution 88.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50