Protein

Protein accession
16pzl [EnVhog]
Representative
4AMrZ
Source
EnVhog (cluster: phalp2_7379)
Protein name
16pzl
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MDTRGWSDIAYSFLVHQNGDIYEGRGWGIAGGHTAGYNSVSHAFCLIANTENTTPTDAAIQSLAFLVDEATRRYGPQRVEGHRAVASTACPGSRLFNMLGLVKAGSSFSPSQPAATRTLRVGSTGKDVEVLQTMLNVAVKTDLVVDGNFGPATEAAVKKYQTILKVSADGVWGPASQAAHNALFAFLEGLRNKDTEVARTSTLPSGTRPTLKRGDSGPNVTNLQHSLVYLGNKLTVDGSFGPETENVVKGFQAFFKVDGGADGIVGPSTWDKIDFLTAKLQMEKAVAYAKEQEAKAAEAAAAAAKKAAEQEAIATEKERQEALAAEAKRKADAAAAELEAARAAARTAAMEDLLERDKEASVQAELDRVKAERDGLKESLSEANEELGFLKKLINNFVKALSALNRPASQ
Physico‐chemical
properties
protein length:410 AA
molecular weight:43572,4 Da
isoelectric point:5,99
hydropathy:-0,30
Representative Protein Details
Accession
4AMrZ
Protein name
4AMrZ
Sequence length
408 AA
Molecular weight
44362,12990 Da
Isoelectric point
4,87181
Sequence
MDGRGWSDIAYSFLVDQKGNIYEGRGWGIAGGHTRGYNSVSHAFCLIANTQNVTPSDAAIKSLGWLVDEAERRYNNQLVNGHRDVAATACPGERLYALLDEVISITGTPSMPTLKRGSKGDDVTTLQSLLNLAVTKKIVVDGDYGPATEFAVREYQTILKVKVDGVWGPEAHAAHNALFAFLESLDNAKTPTAKEAEPASGKRALLKMGDAGVEVENLQKQLVKLGSEISVSSVFDLPTQKAVKGFQSFFKVEGGADGVVGPNTWDLIDFLTAKKDQDDARQAAEDAKRAAEQARLEAEQEQKDAEAKRTEQERKDALAAAIKAKATADALEAEAARLAAEAKELEDLLERDRIEVIQEELDRVNTEKEKVEEDLGDAEEEIGTLRKIIATLVNLVNALVNSEKNIKD
Other Proteins in cluster: phalp2_7379
Total (incl. this protein): 2 Avg length: 409,0 Avg pI: 5,43

Protein ID Length (AA) pI
4AMrZ 408 4,87181
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36521
1qwup
277 31,9% 272 2.264E-31
2 phalp2_22939
3bu4t
1 27,4% 270 3.794E-12

Domains

Domains
Representative sequence (used for alignment): 4AMrZ (408 AA)
Member sequence: 16pzl (410 AA)
1 408 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4AMrZ) rather than this protein.
PDB ID
4AMrZ
Method AlphaFoldv2
Resolution 84.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50