Protein

Protein accession
1EfeJ [EnVhog]
Representative
6IdaF
Source
EnVhog (cluster: phalp2_24757)
Protein name
1EfeJ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSVSPATQHKAEIIARLILHDKAEYDKVEAQTGAPWHWVGIIHYREADLNFKTHLANGDPLGKPTIHTPKGLLARTWDEGAVQALKKDGDLNRHDWGSFARYAYQMEAYNGWGYRGRIPSPYLWAGAADQPKGKFVSDGHFNPSKADDQLGGLVVLRSLMALTPITFTDMIPIKSPDGVIKTAVNILASVFSKGSD
Physico‐chemical
properties
protein length:196 AA
molecular weight:21594,3 Da
isoelectric point:7,95
hydropathy:-0,35
Representative Protein Details
Accession
6IdaF
Protein name
6IdaF
Sequence length
229 AA
Molecular weight
25948,00070 Da
Isoelectric point
6,78182
Sequence
MAIPEAHRARADAVAKKIIDDLSRYQKVWLQTGVPWYWVAPIHYREADLSWHGHLANGDPLTHRTVHVPAGRIPHKPPPYTWEEAAADALECRHLHHIKTWDVAQFAYQAEGYNGWGYRHHHINSPYLWSGCNHYSRGKFVRDGVFDRHRADEQLGVMVILRRLMVLDPTVVFPAASSYEGPTLPPPTVALPRPSAEPADASNQAEGVAPQESLLHKLLRRAEDLIHDA
Other Proteins in cluster: phalp2_24757
Total (incl. this protein): 3 Avg length: 224,7 Avg pI: 6,96

Protein ID Length (AA) pI
6IdaF 229 6,78182
6Y3Of 249 6,13664
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14048
7ZUcR
275 44,6% 197 1.409E-55
2 phalp2_37723
4fdtE
11 46,8% 175 8.356E-54
3 phalp2_38823
1uIEy
17 44,2% 217 2.377E-51
4 phalp2_3770
5Ib0w
2 44,5% 166 1.142E-50
5 phalp2_10683
2X3S5
169 41,0% 190 2.907E-47
6 phalp2_39991
1NEK5
15 48,7% 154 5.442E-47
7 phalp2_14953
lENm
6 45,5% 167 1.711E-45
8 phalp2_29480
1J3dZ
2 38,8% 180 5.893E-42
9 phalp2_2649
6UfrA
143 37,1% 194 1.311E-37
10 phalp2_37253
3Nsfn
6 41,8% 153 1.532E-33

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6IdaF (229 AA)
Member sequence: 1EfeJ (196 AA)
1 229 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6IdaF) rather than this protein.
PDB ID
6IdaF
Method AlphaFoldv2
Resolution 83.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50