Protein

Protein accession
4Lkxc [EnVhog]
Representative
1IRN9
Source
EnVhog (cluster: phalp2_11666)
Protein name
4Lkxc
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MIKAKLVNALLTSKTSRTAARVALIAPTPILVGLAACGFISFKPTVEHTEVIEPVEVVQYRDVNIGGRQVTAHNLDCGTVRKDDLICLTCNIYEESRDQPLAGQILVAKTTMNRAKATGDSVCKTVWRHKQFSWTNFPLGKKPVKELDSWRRALEVAHLVMTEYGKAKSTDVNLIVMAGGQANNDIKWYHTQEVAPKWRKELDKVTVIGDHVFYKKG
Physico‐chemical
properties
protein length:217 AA
molecular weight:24202,8 Da
isoelectric point:9,35
hydropathy:-0,20
Representative Protein Details
Accession
1IRN9
Protein name
1IRN9
Sequence length
195 AA
Molecular weight
22369,36960 Da
Isoelectric point
9,61045
Sequence
LNKSHFSVANWRYGVVSAALCLLFYFYLTTLQFFTTPEVVKRGAIVSGVVEKEELTKEGRISACKKDKECRLLAEIGYYESRNQKTKAAAVGPMFVALNRKEANGWANTLRGVVYQKWQFSYTHDGSLERGFKEKSAYERMLYLAHKVYSGDVKDPTNGALWYHTHQVSPGWSKKLKHVVTLGDHKFYKRVKNES
Other Proteins in cluster: phalp2_11666
Total (incl. this protein): 3 Avg length: 209,3 Avg pI: 9,52

Protein ID Length (AA) pI
1IRN9 195 9,61045
5EEm2 216 9,61483
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2416
54zSE
377 40,7% 130 1.142E-22
2 phalp2_8528
1rrcd
232 41,1% 124 1.213E-20
3 phalp2_45
7E8g0
9 37,4% 139 5.733E-20
4 phalp2_12064
5eok8
21 36,1% 130 1.518E-17
5 phalp2_582
Vnfh
29 32,1% 199 3.841E-17
6 phalp2_38972
84qx3
543 34,8% 149 2.892E-15
7 phalp2_5410
2GVbS
418 30,8% 133 5.356E-15
8 phalp2_23814
1gPzM
3 31,0% 158 3.393E-14
9 phalp2_19867
5DmT3
416 35,7% 123 4.614E-14
10 phalp2_37440
2KS36
1494 32,1% 137 1.160E-13

Domains

Domains
Representative sequence (used for alignment): 1IRN9 (195 AA)
Member sequence: 4Lkxc (217 AA)
1 195 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1IRN9) rather than this protein.
PDB ID
1IRN9
Method AlphaFoldv2
Resolution 85.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50