Protein

Protein accession
4NSyt [EnVhog]
Representative
5Eq5Q
Source
EnVhog (cluster: phalp2_18379)
Protein name
4NSyt
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKAFPLLRLGRRDPAVVLVRQKLGKRALRSGDLYDAGLEKAVKAFQRENGLEVDGMVGRNTYLALGLLQERQTPTMLMLHCTAQDVAEERVKVEPQAQNIVAYHLGLGWGRCGYHAIIERDGSVVQTVETNLADGFDPTDWAYGAGEYNAYAIHICYVGGVRKGKAKDTRTLHQRIALKNQVMKYLSNFPELVVLGHNQVHRKACPCFSVPKWGEAIGIEQNVSKADKFGIAKKMSI
Physico‐chemical
properties
protein length:237 AA
molecular weight:26350,2 Da
isoelectric point:9,33
hydropathy:-0,27
Representative Protein Details
Accession
5Eq5Q
Protein name
5Eq5Q
Sequence length
241 AA
Molecular weight
27042,74000 Da
Isoelectric point
8,53524
Sequence
MTFGILKFGSVGDQVRALQNELRKRGYALSVDGWFGEETRTVVLAFQREKGLAQDGIVGSQTLQALGLIALAPKTMPKYLMLHCSATPERSAGVNADQIVRYHMETLKWGRPGYSKIVEYDGRIVNTWDVNLTDGFQPFEITYGAAEWNPISVHICYIGGMDAAYKNPKNTMTTEQEASFAKIIKEVIRQCPDIKVVGHNQVHNKACPSFWVPDFCAKIGISLKNIETRDPFNQRAWLQTL
Other Proteins in cluster: phalp2_18379
Total (incl. this protein): 10 Avg length: 243,0 Avg pI: 7,98

Protein ID Length (AA) pI
5Eq5Q 241 8,53524
1gmAZ 244 6,20326
3Prmf 244 6,10720
4RyGb 241 6,90130
5EslQ 238 8,98742
5bCa9 260 8,25674
6BUF7 241 7,65891
gpcV 219 9,26993
her6 265 8,55142
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27876
8EUCV
18 40,0% 225 4.267E-61
2 phalp2_23766
13ajC
287 36,6% 210 5.828E-55
3 phalp2_7758
71AR5
15 39,2% 153 1.070E-48
4 phalp2_3314
2ZVzI
33 27,9% 254 4.495E-44
5 phalp2_21699
3i7KR
33 32,9% 191 6.974E-37
6 phalp2_3887
6RreM
7260 28,2% 152 4.981E-27
7 phalp2_32334
8DzzK
2 26,7% 146 7.000E-25

Domains

Domains
Representative sequence (used for alignment): 5Eq5Q (241 AA)
Member sequence: 4NSyt (237 AA)
1 241 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5Eq5Q) rather than this protein.
PDB ID
5Eq5Q
Method AlphaFoldv2
Resolution 95.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50