Protein

Protein accession
4E3BF [EnVhog]
Representative
5kiq4
Source
EnVhog (cluster: phalp2_25844)
Protein name
4E3BF
Lysin probability
94%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTTAWHVNEGLAPLVGQLESAHPGMVIGTIGDKAHREEVSDHNPNAAGRVNAADLMLGKAFTEAAALALLPFLIADSRTHYVIHNRRIWTSETGAWEPYYGDDPHTNHVHESVKDSAHNNTAPWKITPRKVTMHKLDTAWPEIGQGDSDDQLDGYNVIYRLQKITGAAADGDWGPETTAQIAAWCKISKSKATKLTEAIYRQVMGLTNPG
Physico‐chemical
properties
protein length:210 AA
molecular weight:22985,5 Da
isoelectric point:5,97
hydropathy:-0,47
Representative Protein Details
Accession
5kiq4
Protein name
5kiq4
Sequence length
253 AA
Molecular weight
27650,48250 Da
Isoelectric point
4,98719
Sequence
MAVPWVVDRGLNKLLEQINAAAPGRSKVSDGSIGDPAHQATESDHNPEHPPPSGNPDFQVDARDFTQDPAHNADMGVVSEAIRQSHDRRVSYVIFNRRIFSGLDGPQPWVWRPYSGTDPHTNHMHVSVRDSTHDQTQDWSIGIHAPTSPEPPMTLPDGFSTTFNGMVGEVDAMIYDKPKVVWGPLTGQENLLHTRLVTMEAKLDLVTNMLKDLSLASGLDPAALQTAFEAALAAKMSEIADAVVDEEHQRLGA
Other Proteins in cluster: phalp2_25844
Total (incl. this protein): 4 Avg length: 235,5 Avg pI: 5,22

Protein ID Length (AA) pI
5kiq4 253 4,98719
5H0FJ 248 4,78047
6DkkF 231 5,15623
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13863
7tSrJ
5 43,2% 155 6.328E-33
2 phalp2_39293
4f6EK
17 40,0% 155 5.064E-29
3 phalp2_24512
4OaDs
63 36,6% 169 3.889E-21
4 phalp2_4027
15vGm
7 36,6% 169 1.517E-19
5 phalp2_27322
4qzt4
17 30,2% 175 1.083E-11

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 5kiq4 (253 AA)
Member sequence: 4E3BF (210 AA)
1 253 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4E3BF
Method AlphaFoldv2
Resolution 84.01
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5kiq4) rather than this protein.
PDB ID
5kiq4
Method AlphaFoldv2
Resolution 76.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50