Protein

Protein accession
4GVnO [EnVhog]
Representative
4Huce
Source
EnVhog (cluster: phalp2_34979)
Protein name
4GVnO
Lysin probability
83%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MTIKYKAGVNLAGVRNEIMDILPLVESCFRDLNCMSITLTCTTGNHKLNDPHTNGFAIDIRMHDYSIELQNRLFNHVNKTLSGLYYTVLEFPGKQKAHLHIQVYKGTWNSILLHEDMKRRI
Physico‐chemical
properties
protein length:121 AA
molecular weight:13989,1 Da
isoelectric point:8,83
hydropathy:-0,26
Representative Protein Details
Accession
4Huce
Protein name
4Huce
Sequence length
98 AA
Molecular weight
11291,54150 Da
Isoelectric point
5,19698
Sequence
MWCLFDLIEAAFKLIGKDATITCGTNGHAGTDPHYHGYALDFRCNDLTEEEEKVVIGYIEEHVDRAFYYVFHENDGQPEEHIHIQVRKAIWPGLQKKG
Other Proteins in cluster: phalp2_34979
Total (incl. this protein): 7 Avg length: 121,3 Avg pI: 6,57

Protein ID Length (AA) pI
4Huce 98 5,19698
16IXt 118 6,15631
1laHJ 121 6,49513
2oOCt 136 7,83035
4BKlb 117 6,17956
4HYv6 138 5,28463
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22895
2Sm0p
48 29,2% 106 2.918E-18
2 phalp2_27035
6RNBK
3 27,1% 92 2.205E-11
3 phalp2_29395
1dlYO
26 29,1% 79 1.855E-09
4 phalp2_3896
6T31a
21 29,3% 75 6.579E-09
5 phalp2_37712
4caSx
4 28,5% 84 2.676E-06
6 phalp2_11554
11mVg
1 26,3% 95 2.441E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4Huce (98 AA)
Member sequence: 4GVnO (121 AA)
1 98 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Huce) rather than this protein.
PDB ID
4Huce
Method AlphaFoldv2
Resolution 90.49
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50