Protein

Protein accession
5sIa3 [EnVhog]
Representative
2a2cC
Source
EnVhog (cluster: phalp2_36636)
Protein name
5sIa3
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
LAVDVHTAGEPKRLKWLIANVRKFGFSWEVVPEEPWHIRYTEGDNPPQAVVEFMAKSNIQKPESAATPAANNAVSGAPTAKDDGGDLDLGDSGPRVVKLQEELAQRGFYNCSFDGQFGPKTEQAVVAY
Physico‐chemical
properties
protein length:128 AA
molecular weight:13943,4 Da
isoelectric point:4,98
hydropathy:-0,48
Representative Protein Details
Accession
2a2cC
Protein name
2a2cC
Sequence length
152 AA
Molecular weight
15980,89550 Da
Isoelectric point
8,09383
Sequence
LAVDVHTAGEPKRLKWLIANVRKFGFSWEVVPEEPWHLRYTEGDNPPAAVAEFMAKNNIQKPSGLAAPAASTAAAGAPAVKDDGGDLDPGDSGPRVTKLQEELAERGFYKGKADGDFGPKTKAAVIAYKQAKGFGAGPKAGKRVLDDLGIGL
Other Proteins in cluster: phalp2_36636
Total (incl. this protein): 24 Avg length: 163,5 Avg pI: 7,67

Protein ID Length (AA) pI
2a2cC 152 8,09383
1PZIw 153 5,75525
1Q02q 153 5,75531
1Qe6p 132 7,79773
1QiXk 128 8,66347
1R2Fo 176 6,11516
1ZDvL 170 6,11243
1qaRD 270 9,09657
246XZ 153 5,75531
38gAn 144 9,33091
523BF 166 9,24839
53C6e 166 9,52283
57fZ7 166 9,52283
59VZh 148 6,18138
5Ev3s 136 5,50255
5sHBN 152 6,94495
7WRGV 193 8,60861
AUco 152 6,95120
GGIb 143 9,12378
MH0R 254 9,20765
anxs 148 6,91420
iS3Z 181 9,72185
jKZ7 161 9,09786
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38906
22Pcz
5 72,1% 104 1.312E-39
2 phalp2_30242
4eMiX
353 30,9% 139 8.243E-09
3 phalp2_290
6Igwr
10 24,4% 147 1.325E-07

Domains

Domains
Unannotated
PG_1
Representative sequence (used for alignment): 2a2cC (152 AA)
Member sequence: 5sIa3 (128 AA)
1 152 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2a2cC) rather than this protein.
PDB ID
2a2cC
Method AlphaFoldv2
Resolution 70.55
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50