Protein

Protein accession
7sd9J [EnVhog]
Representative
2aR1M
Source
EnVhog (cluster: phalp2_31323)
Protein name
7sd9J
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNLVQPSTRLLAESCEQEGLLRNQCAYVLATAWHETSAYKYMREIWGPTPAQKRYEGRKDLGNTVAGDGKKFMGRGFVQITGRRNYTDWSRRLGLDLLKEPQLAERPEIAVRIIVKGMKIGTFTGKKLVRRPAEGRGLRPALCGSVSS
Physico‐chemical
properties
protein length:148 AA
molecular weight:16626,1 Da
isoelectric point:10,06
hydropathy:-0,51
Representative Protein Details
Accession
2aR1M
Protein name
2aR1M
Sequence length
130 AA
Molecular weight
14415,35790 Da
Isoelectric point
8,82464
Sequence
MSLANDLKVLGPKGNPTTIKLLAACAEEIMAEYEINTPLRVSHFWAQAAHECAGFRTMHEYWKPTPAQCRYEGRKDLGKVQPGDGYLFRGRGIFQLTGRANYETMGKKHGRDLIGNPDLAAQPEGALRIA
Other Proteins in cluster: phalp2_31323
Total (incl. this protein): 3 Avg length: 134,3 Avg pI: 8,49

Protein ID Length (AA) pI
2aR1M 130 8,82464
89z8o 125 6,58260
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23254
4Uvr8
48 44,0% 143 4.937E-33
2 phalp2_15253
11H4w
1 44,8% 98 4.561E-22
3 phalp2_37433
2Fv3r
11643 41,1% 124 4.687E-19
4 phalp2_36134
6NUh4
42 35,3% 130 1.003E-13
5 phalp2_1761
2LjWy
7 30,3% 135 4.342E-12
6 phalp2_11630
1plIc
23 27,8% 165 1.090E-08

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2aR1M (130 AA)
Member sequence: 7sd9J (148 AA)
1 130 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7sd9J
Method AlphaFoldv2
Resolution 65.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2aR1M) rather than this protein.
PDB ID
2aR1M
Method AlphaFoldv2
Resolution 92.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50