Protein

Protein accession
81smS [EnVhog]
Representative
4GhkT
Source
EnVhog (cluster: phalp2_23143)
Protein name
81smS
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MKVSKHFSLEELTKSQTGERLGIDNLPSDAHLASLTLLCKKVLEPIRAHYGRAVTINSGYRGPALNKAVGGAATSQHCEGEAADIEIAGVANGDLANWIETNLDYDQLILECYKRGVPDSGWVHVSYKTADNRKQELTASVVGGKMTYTPGINP
Physico‐chemical
properties
protein length:154 AA
molecular weight:16610,6 Da
isoelectric point:6,19
hydropathy:-0,33
Representative Protein Details
Accession
4GhkT
Protein name
4GhkT
Sequence length
180 AA
Molecular weight
20843,55670 Da
Isoelectric point
8,28008
Sequence
MPPEIYTRDYQLNSVFTLGKLCITEHRDFIDINFEESKKYLDNLKRVCNELLVPISVLLGEVPYITSGFRCDALNKSVGGTITSQHSYGEAVDTVYVKHNLKEVFNKIAFESSIQYSQIIFEFGTWIHIAVIDEVLYPGKKLQKLIASRQNSKVVYTPVFNCQEFSRNRETYRENSKGRR
Other Proteins in cluster: phalp2_23143
Total (incl. this protein): 33 Avg length: 153,4 Avg pI: 7,10

Protein ID Length (AA) pI
4GhkT 180 8,28008
12M2i 154 6,05105
1BPOO 177 8,57921
1jesB 154 5,67380
360fZ 173 5,45679
3CJK7 157 5,37472
3F2az 157 6,51684
3H3QA 158 6,58465
3H4w1 157 5,56399
3XMlj 154 8,79260
3ZGo0 145 9,34419
3ZKMh 145 8,60906
3xiCp 157 5,57979
40cSB 145 8,90613
42zWT 157 5,81863
4OcGe 140 8,45176
4TJvg 147 8,60042
4b0DD 154 8,79260
5sbAb 158 5,75525
5skoU 157 5,95499
5yB5n 154 8,43364
6PPBs 108 5,87200
7BqTZ 151 8,45176
7L8hT 127 7,85407
7xX8l 150 9,49531
841gQ 157 5,95499
8hZxH 159 6,29460
8hplr 157 5,57979
8jsu1 157 5,95499
8nbhI 146 9,65570
8u8pM 158 5,96743
8uQHq 158 5,96465
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1343
17yCd
10947 30,1% 156 1.114E-51
2 phalp2_30101
38zFk
347 35,5% 152 8.310E-49
3 phalp2_38153
6WaM5
59 31,4% 159 1.591E-45
4 phalp2_2615
6LPFo
3 30,5% 167 2.987E-45
5 phalp2_21393
3LPeU
1997 30,4% 161 1.069E-41
6 phalp2_8264
3Vy7
8 34,9% 146 7.063E-41
7 phalp2_38677
GcsZ
89 27,6% 141 1.198E-39
8 phalp2_38661
86jWa
3 27,2% 158 5.775E-39
9 phalp2_37718
4e1ey
787 36,2% 127 2.032E-38
10 phalp2_29699
1f2kx
4387 26,7% 161 2.782E-38

Domains

Domains
Representative sequence (used for alignment): 4GhkT (180 AA)
Member sequence: 81smS (154 AA)
1 180 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4GhkT) rather than this protein.
PDB ID
4GhkT
Method AlphaFoldv2
Resolution 90.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50