Protein

Protein accession
5EEm2 [EnVhog]
Representative
1IRN9
Source
EnVhog (cluster: phalp2_11666)
Protein name
5EEm2
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
VKSKVLKQFLLSKTAKVALQVALIAPTPILVGLTASGYVTFKPRVVKTEVIRPVEVVKYRDVDIGGRQVTAHNLDCATQRHDELVCLTCNIYEESRNQPLAGQILVAKATMNRAQATGDSVCKTVWKRKQFSWTNLPLHKKPVKELEPWRKALEVAHLVMQEYGKAKPSDVNVIVMAGGQASDDIKWYHTQEVSPKWNKDLQKVTVIGDHIFYKKS
Physico‐chemical
properties
protein length:216 AA
molecular weight:24287,1 Da
isoelectric point:9,61
hydropathy:-0,26
Representative Protein Details
Accession
1IRN9
Protein name
1IRN9
Sequence length
195 AA
Molecular weight
22369,36960 Da
Isoelectric point
9,61045
Sequence
LNKSHFSVANWRYGVVSAALCLLFYFYLTTLQFFTTPEVVKRGAIVSGVVEKEELTKEGRISACKKDKECRLLAEIGYYESRNQKTKAAAVGPMFVALNRKEANGWANTLRGVVYQKWQFSYTHDGSLERGFKEKSAYERMLYLAHKVYSGDVKDPTNGALWYHTHQVSPGWSKKLKHVVTLGDHKFYKRVKNES
Other Proteins in cluster: phalp2_11666
Total (incl. this protein): 3 Avg length: 209,3 Avg pI: 9,52

Protein ID Length (AA) pI
1IRN9 195 9,61045
4Lkxc 217 9,34580
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2416
54zSE
377 40,7% 130 1.142E-22
2 phalp2_8528
1rrcd
232 41,1% 124 1.213E-20
3 phalp2_45
7E8g0
9 37,4% 139 5.733E-20
4 phalp2_12064
5eok8
21 36,1% 130 1.518E-17
5 phalp2_582
Vnfh
29 32,1% 199 3.841E-17
6 phalp2_38972
84qx3
543 34,8% 149 2.892E-15
7 phalp2_5410
2GVbS
418 30,8% 133 5.356E-15
8 phalp2_23814
1gPzM
3 31,0% 158 3.393E-14
9 phalp2_19867
5DmT3
416 35,7% 123 4.614E-14
10 phalp2_37440
2KS36
1494 32,1% 137 1.160E-13

Domains

Domains
Representative sequence (used for alignment): 1IRN9 (195 AA)
Member sequence: 5EEm2 (216 AA)
1 195 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5EEm2
Method AlphaFoldv2
Resolution 70.30
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1IRN9) rather than this protein.
PDB ID
1IRN9
Method AlphaFoldv2
Resolution 85.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50