Protein

Protein accession
7Hsfh [EnVhog]
Representative
4JALa
Source
EnVhog (cluster: phalp2_37841)
Protein name
7Hsfh
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MERSEIITELKQYFKISELVGSKTNSTYGERAFRFFGTNILHALLIVRENIGKPITVNGRGMEQRGLRTNIQPIVKSKSDRGRLYISAHIQGRAFDFDVKGMTAQEVRDWIVANENLFPFKIRLEDGVTWVHMDDIDEDKNPKVYLFNP
Physico‐chemical
properties
protein length:149 AA
molecular weight:17248,5 Da
isoelectric point:8,89
hydropathy:-0,44
Representative Protein Details
Accession
4JALa
Protein name
4JALa
Sequence length
114 AA
Molecular weight
13156,94170 Da
Isoelectric point
6,36434
Sequence
MIPEYALRSLDALREKLNHPIYINNAGMGFDYSGVRPVGCKIGARWSGHKGYRKEVCWDLKTFDHHYLSALLEIIESDHKEYHIGKIEEPEKTMPRGWIHTTMVESPGSDLIIF
Other Proteins in cluster: phalp2_37841
Total (incl. this protein): 10 Avg length: 145,1 Avg pI: 8,34

Protein ID Length (AA) pI
4JALa 114 6,36434
19gvM 144 9,08090
28JeD 149 9,15311
2V8kY 152 10,02956
2sPvi 142 8,56432
3dC1K 146 6,95450
5jWgF 154 6,40714
6tn9m 153 9,04132
7ymB5 148 8,94172
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1968
4BR1x
969 28,9% 107 1.061E-10
2 phalp2_34764
3iKHD
290 34,0% 97 2.237E-08
3 phalp2_7525
4Yxu6
7 26,2% 118 3.064E-08
4 phalp2_25932
6bZlr
11 30,6% 111 9.691E-07
5 phalp2_31809
4kTVa
1 33,9% 103 5.676E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4JALa (114 AA)
Member sequence: 7Hsfh (149 AA)
1 114 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4JALa) rather than this protein.
PDB ID
4JALa
Method AlphaFoldv2
Resolution 92.25
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50