Protein

Protein accession
23gqb [EnVhog]
Representative
5nsH
Source
EnVhog (cluster: phalp2_17775)
Protein name
23gqb
Lysin probability
93%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MVYHFIYTNNATIKENAESTFNNIKKAGLNPEEIWIAADLEYDTWKKNKEICTKEKCTAYTKEYLDELKALGCKKLFIYTNTDYYKNYYDWNQLSDYPIWLADYAGAPDYPCAMQQYTSTGRISGINGYVDMNYLFDETMLDNNLKE
Physico‐chemical
properties
protein length:147 AA
molecular weight:17304,2 Da
isoelectric point:4,74
hydropathy:-0,68
Representative Protein Details
Accession
5nsH
Protein name
5nsH
Sequence length
144 AA
Molecular weight
16864,72370 Da
Isoelectric point
4,85674
Sequence
MVYHFIYTNNATIQENAESTVNNIKKAGLNPENLWIAADLEYDTWKKNKEICTKEKCTKYTKEYLNALKSLGCKKLFIYANTDYYENYYDWNQLSEYPIWLADYKGAPDYPCAIQQYSSTGKVNGINGYVDMDYLFDESMLNNI
Other Proteins in cluster: phalp2_17775
Total (incl. this protein): 4 Avg length: 161,5 Avg pI: 4,96

Protein ID Length (AA) pI
5nsH 144 4,85674
24IMr 197 5,06421
2ViyX 158 5,17055
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10789
3TIsE
2 39,4% 137 7.814E-25
2 phalp2_4531
3PYph
1 29,7% 121 1.996E-12
3 phalp2_6721
21tVf
5 30,9% 139 5.094E-12
4 phalp2_20361
2Sd5N
42 26,7% 142 1.896E-09
5 phalp2_36234
7vdI3
1 27,9% 129 6.566E-09
6 phalp2_38591
21A2U
2 28,8% 104 2.271E-08
7 phalp2_32129
6Dieh
5 25,8% 139 3.096E-08
8 phalp2_18475
6I6XC
2 24,8% 165 7.839E-08
9 phalp2_22731
85jlZ
4 26,2% 141 1.265E-06
10 phalp2_40311
3e30k
1 32,3% 102 4.336E-06

Domains

Domains
Representative sequence (used for alignment): 5nsH (144 AA)
Member sequence: 23gqb (147 AA)
1 144 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
23gqb
Method AlphaFoldv2
Resolution 95.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5nsH) rather than this protein.
PDB ID
5nsH
Method AlphaFoldv2
Resolution 96.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50