Protein

Protein accession
23a00 [EnVhog]
Representative
8beKF
Source
EnVhog (cluster: phalp2_15487)
Protein name
23a00
Lysin probability
88%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MAYIIQLFDPKAWDMKFAGTPYDVLPKKVQPLKNWAKEGGADIASNLCFFNFASARYFPLYTLQTLYVDGTLCGKGEASTSHLITLPNGDRVSGWCANEKKEKMPLIESDKIYIPEKRNRSSHSLFGVTTDGKIFTVKSKKGYLQDQLAVLVLRDMLTIYRTHIIYLFEEDGGGSTGVYSALSDNLYAPQREGKNGRSVTSAFLAKLKPCAKIKRTLKYGCVGPDVIIYQIALGSITADGIFGGDTRTRTIQFQKERGLEKDGIAGPITLGELKIFGE
Physico‐chemical
properties
protein length:278 AA
molecular weight:30776,1 Da
isoelectric point:9,10
hydropathy:-0,22
Representative Protein Details
Accession
8beKF
Protein name
8beKF
Sequence length
253 AA
Molecular weight
27292,95430 Da
Isoelectric point
9,15788
Sequence
MLNLYILDPKEWHVWLDGPAYMSGKTMTVRDRALSNGADIVWNLGMFNMSNGYSVTTVHNHKGDLGYGGASDIVDINQGDYCKGYSNGIKDGFVFLDKPMGGSRTRCGVGRTTDGCIIIAQTSNKVTEKAVCASVNNGVSKRGKKVSIFVMEDGGGSTSQYSSISKLTFYPEGVRKVCTVICATRINVPKVERNLSLWKKGDDVRRLQEVLGGIECDGSFGFGTRSRLIQAQKALGLVADGSCGPLTRKALGL
Other Proteins in cluster: phalp2_15487
Total (incl. this protein): 15 Avg length: 282,3 Avg pI: 9,11

Protein ID Length (AA) pI
8beKF 253 9,15788
23Itu 274 9,59755
23hyR 271 9,23956
38MBR 258 9,45817
3Zu3H 265 9,45585
3iX6X 261 9,16968
40hgL 263 9,19243
41ljc 274 9,63598
4koiO 261 9,16968
5Uw8F 315 7,06818
71hmE 271 9,31060
7rUv8 447 8,31379
8k0f0 271 9,24833
8mknv 273 9,49937
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7999
Zqjr
4 29,1% 199 4.032E-42
2 phalp2_24674
5LYip
2 27,9% 168 6.688E-41
3 phalp2_35685
3Aso6
4 26,9% 271 5.839E-18

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 8beKF (253 AA)
Member sequence: 23a00 (278 AA)
1 253 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
23a00
Method AlphaFoldv2
Resolution 68.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (8beKF) rather than this protein.
PDB ID
8beKF
Method AlphaFoldv2
Resolution 74.31
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50