Protein

Protein accession
2SP7U [EnVhog]
Representative
2nG0L
Source
EnVhog (cluster: phalp2_19393)
Protein name
2SP7U
Lysin probability
83%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MQVSNGLLKDSGVSNWQNPDNNIMVGTKYIDSLLKMFGNNLEKTIAAYNSGEGTVQNLVKKYGDKWKDHLPSETKNYLPKVLSSYYN
Physico‐chemical
properties
protein length:87 AA
molecular weight:9818,0 Da
isoelectric point:8,83
hydropathy:-0,70
Representative Protein Details
Accession
2nG0L
Protein name
2nG0L
Sequence length
106 AA
Molecular weight
12682,34410 Da
Isoelectric point
9,40840
Sequence
ARGLTQIMKGTWEECVKRMGHSDWTWDDAFDPKKNLAVGTYYTNTRIPEMLKTYKIPDNIETRIAAYNWGIGKLNKTYKKYDSKWIEHIPTETKDYIWKYNARTIR
Other Proteins in cluster: phalp2_19393
Total (incl. this protein): 2 Avg length: 96,5 Avg pI: 9,12

Protein ID Length (AA) pI
2nG0L 106 9,40840
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31846
4Bigk
1 49,0% 104 2.560E-27
2 phalp2_8610
20rBi
11 41,0% 112 9.760E-17
3 phalp2_29294
tD4R
30 40,1% 102 2.021E-10
4 phalp2_34820
3OM0N
2 37,1% 78 7.147E-10
5 phalp2_39842
VBHH
3 33,6% 104 2.300E-08
6 phalp2_28433
1IScg
74 27,3% 106 1.896E-06
7 phalp2_15550
8kw4S
416 30,8% 120 1.896E-06
8 phalp2_3949
7gIqH
9 30,7% 104 9.140E-06
9 phalp2_16208
4IVZn
2 27,2% 99 2.347E-05
10 phalp2_35357
7w0rS
3 28,5% 70 5.410E-04

Domains

Domains
Representative sequence (used for alignment): 2nG0L (106 AA)
Member sequence: 2SP7U (87 AA)
1 106 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2SP7U
Method AlphaFoldv2
Resolution 84.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2nG0L) rather than this protein.
PDB ID
2nG0L
Method AlphaFoldv2
Resolution 95.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50