Protein

Protein accession
4kGfC [EnVhog]
Representative
84cgR
Source
EnVhog (cluster: phalp2_13065)
Protein name
4kGfC
Lysin probability
94%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MRKIIIILMLCLFPLCFGFTTVEVERVCISAYVVKNDDMKDIVIQHLKQSEGFRSEPYYCPAGQLTIGYGHKIKADEHFDVISIEQADSLLMYDFNIYYELVKSELVGCNNNIVIAMTHFAMGTGFQYFKKSKICKMIHNGEDVRNELLKYCKYKNVNGEYIVSKRLLRNREFEVMIINTK
Physico‐chemical
properties
protein length:181 AA
molecular weight:21094,5 Da
isoelectric point:8,10
hydropathy:-0,02
Representative Protein Details
Accession
84cgR
Protein name
84cgR
Sequence length
183 AA
Molecular weight
20864,06240 Da
Isoelectric point
9,54933
Sequence
MKTSTKNAINLLIILCLGIVIFAQHRAANRPRTSQISPVISLDCSDYDIAIQHIKEAEGFRSTPDTKEGQSLVGYGFSRYVCEQKPMTEKEADIILRKQYDKAIYQAYRETRLEGRKLLAVACLIYNLKPASWQKSTIKQVVTCGDRERITAAWMSLCNAGGKPMNGLKKRRLWELQFYISQK
Other Proteins in cluster: phalp2_13065
Total (incl. this protein): 4 Avg length: 184,0 Avg pI: 8,31

Protein ID Length (AA) pI
84cgR 183 9,54933
4Rvcv 181 9,46971
4kX1S 191 6,12391
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35413
eK0d
13 31,9% 144 1.536E-20
2 phalp2_11015
4OJHa
83 32,8% 140 2.097E-20
3 phalp2_19495
3emsO
1 31,6% 158 1.200E-18
4 phalp2_16810
1lboi
1 31,2% 147 1.057E-17
5 phalp2_4451
31DIk
4919 29,5% 142 6.171E-14
6 phalp2_13264
3cxGa
4 29,4% 173 6.171E-14
7 phalp2_39178
3afgT
3 29,0% 141 8.402E-14
8 phalp2_2632
6RhYr
14867 30,4% 125 1.557E-13
9 phalp2_34911
4lF8s
21 26,3% 144 1.346E-12
10 phalp2_32792
2snIK
45 27,7% 126 3.386E-12

Domains

Domains
Disordered region
GH24
Representative sequence (used for alignment): 84cgR (183 AA)
Member sequence: 4kGfC (181 AA)
1 183 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (84cgR) rather than this protein.
PDB ID
84cgR
Method AlphaFoldv2
Resolution 89.86
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50