Protein

Protein accession
55q96 [EnVhog]
Representative
4ak5q
Source
EnVhog (cluster: phalp2_30225)
Protein name
55q96
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSFITLQTLLMATPANAEMLEEMDQVQRVTAKDVRQEPIVKEEFAPTWIVYPVDAGFETMSSDFGYRAKACNACSTNHQGIDFLQPKGSTVRAVFNGTVIEVADNGGGLGSIVLISHPDLGGIQTVYAHLIRGSQTVKVGDSVIPGQKIGKVGLTGTTTAYHLHFGVYVDGYAIDPEKWLKANKVKRFDG
Physico‐chemical
properties
protein length:190 AA
molecular weight:20591,2 Da
isoelectric point:5,93
hydropathy:-0,07
Representative Protein Details
Accession
4ak5q
Protein name
4ak5q
Sequence length
162 AA
Molecular weight
17573,77800 Da
Isoelectric point
6,35547
Sequence
MTANDVSQTPVVQETFEPQWIVFPVDAGFEVVSSDFGWRAKACNLCSTNHQGIDFIQKKGSTIRAVFFGTVVDVGDDGGGLGSFVLIRHPDLGGIETVYAHMITGSQKVRVGDSVIPGQKIGKVGMTGVTTAYHLHFGIYVDGRAIDPEIWLKAHHVKRFNG
Other Proteins in cluster: phalp2_30225
Total (incl. this protein): 3 Avg length: 185,7 Avg pI: 6,74

Protein ID Length (AA) pI
4ak5q 162 6,35547
4e7HF 205 7,94961
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5764
7GfkS
23 41,1% 119 4.929E-29
2 phalp2_39871
17EO3
18 41,0% 134 4.950E-23
3 phalp2_20604
4H0aV
4 37,5% 133 1.021E-20
4 phalp2_27075
2pChT
5 40,0% 130 2.337E-19
5 phalp2_5594
4aDQQ
125 34,1% 126 2.087E-18
6 phalp2_1367
1dUID
1 37,8% 103 3.901E-18
7 phalp2_19030
17y4D
9 34,3% 137 7.289E-18
8 phalp2_24104
8fsA2
7 38,6% 106 3.476E-17
9 phalp2_33044
4vnRi
8 34,1% 123 3.091E-16
10 phalp2_6449
8Gh7b
59 33,3% 129 5.769E-16

Domains

Domains
Representative sequence (used for alignment): 4ak5q (162 AA)
Member sequence: 55q96 (190 AA)
1 162 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4ak5q) rather than this protein.
PDB ID
4ak5q
Method AlphaFoldv2
Resolution 89.44
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50