Protein

Protein accession
6GygZ [EnVhog]
Representative
7Kh8p
Source
EnVhog (cluster: phalp2_58)
Protein name
6GygZ
Lysin probability
99%
PhaLP type
endolysin
Probability: 84% (predicted by ML model)
Protein sequence
MSIFSKIADTQIPDFNVVYDNYKKEVASNKVIVNNILTSYGASIDKWGSVFEIPKSTLVALIAIESGGKQVGKNSAGAIGLMQVKEITVRECVSRFKTFTGQSMPTLAYNELKSKAPYLLNLTVNNQNLSSANTRLLEQKLSSDANFNIMIGTLCFRVALDATKVNGTSFLNKAIIAYNTGVYGRIRSKYDNKKVSTLQLFKDTGFQKETRNYLAKSLGKYGFIQVYIDDSLV
Physico‐chemical
properties
protein length:233 AA
molecular weight:25792,4 Da
isoelectric point:9,54
hydropathy:-0,11
Representative Protein Details
Accession
7Kh8p
Protein name
7Kh8p
Sequence length
229 AA
Molecular weight
25923,22320 Da
Isoelectric point
9,82094
Sequence
MFTIKLPLIAKKPELPFLRDLKNVTPKMISTYGKWFKEASLGTNVPLSVLYAMAMVESTGNHYTKSGTVNVTGAERSVGIMQISPASLYETFKFEIKRNRLTPQSQAIIKKYLPNFKYTLGKFIPMSPNKSTLDMFFEALKNPEFNIWASSIVLRRLLEDTAQIDGTMRLDKAIVKYNVGEYSRPTKTMAYKLGDTTTLIKSANLPIITKYYIVKAVGIDGAMMYFRNV
Other Proteins in cluster: phalp2_58
Total (incl. this protein): 8 Avg length: 236,8 Avg pI: 9,47

Protein ID Length (AA) pI
7Kh8p 229 9,82094
18ifP 237 9,41266
4f3FS 231 9,25361
4gGaP 253 9,58879
6zDai 237 9,33568
7K9IM 237 9,45501
Uhpj 237 9,38191
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8500
1ieaB
80 32,6% 205 2.242E-31
2 phalp2_21962
7Eddr
1 27,7% 191 1.309E-18
3 phalp2_16087
4a7iy
1 28,4% 158 6.055E-18
4 phalp2_31599
4oBb8
11 27,3% 252 3.080E-14
5 phalp2_19772
7LC4r
1 28,6% 213 1.744E-11
6 phalp2_12785
8nCAL
2 25,9% 204 1.774E-07

Domains

Domains
Unannotated
Representative sequence (used for alignment): 7Kh8p (229 AA)
Member sequence: 6GygZ (233 AA)
1 229 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6GygZ
Method AlphaFoldv2
Resolution 86.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7Kh8p) rather than this protein.
PDB ID
7Kh8p
Method AlphaFoldv2
Resolution 93.40
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50