Protein

Protein accession
23vyW [EnVhog]
Representative
21saM
Source
EnVhog (cluster: phalp2_36602)
Protein name
23vyW
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFIPRLTAPKKDNIFYYSNKNIFFKCGYGMPNCTAYSHGRFMEITGMDSGCRGNAGGWIEEAKKKGKFKISDTPELGAIIVWKNPNTKDNGHVGSVEEIKPNSDIVCSNSAYKGTNFYTQTYSGGKSYKWKSNSTGKVYEFQGFILPPMKLLEKERWVEITAGIQFRKDAHSTKRADRIGTIKKGEKFYCNGNYEIVAKKKWVQGTYNGTEGWICALHIKDIANE
Physico‐chemical
properties
protein length:225 AA
molecular weight:25452,8 Da
isoelectric point:9,38
hydropathy:-0,65
Representative Protein Details
Accession
21saM
Protein name
21saM
Sequence length
238 AA
Molecular weight
26598,88780 Da
Isoelectric point
9,25284
Sequence
MNEFKPRLTAPEADNKNYFSKENPYVRDGYPMPNCTPYGMGRFHEAHGIWLPCRHNAEDWVKEAEAKGFEISNTPVLGSIAVWKVGKIGDGNDGAGHVCSVEILKTNLDFTGSNSGWFETKYADITQDPKYKKLFFYLQTFKASNGYAWTGSTGKKYELVGFILPKKKDIEPRGIYKTTGSLHLRSDAGKGNKSLLVMPKGSEFVSNGSFKLVDGVKWLNGQYKGINGWASEKYLKKG
Other Proteins in cluster: phalp2_36602
Total (incl. this protein): 3 Avg length: 255,7 Avg pI: 9,27

Protein ID Length (AA) pI
21saM 238 9,25284
7DQ49 304 9,18534
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18807
1gBg7
77 34,8% 178 1.795E-27
2 phalp2_4896
5TGNK
9 30,8% 201 3.336E-27
3 phalp2_16062
3WJNR
14 34,4% 203 3.336E-27
4 phalp2_11680
1Nyjf
74 32,5% 175 1.569E-26
5 phalp2_17558
5WoZV
75 34,0% 188 1.190E-24
6 phalp2_40337
3iVXO
43 32,7% 183 5.568E-24
7 phalp2_4546
3WKbr
13 27,0% 181 1.930E-19
8 phalp2_3062
21xLs
8 29,4% 180 1.930E-19
9 phalp2_40015
21q9r
12 31,6% 180 6.566E-19
10 phalp2_15440
410GZ
43 28,4% 183 4.108E-18

Domains

Domains
CHAP
Unannotated
Representative sequence (used for alignment): 21saM (238 AA)
Member sequence: 23vyW (225 AA)
1 238 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (21saM) rather than this protein.
PDB ID
21saM
Method AlphaFoldv2
Resolution 89.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50