Protein

Protein accession
gGch [EnVhog]
Representative
2717o
Source
EnVhog (cluster: phalp2_16914)
Protein name
gGch
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
VSYIIKKLGEMLTLLMLMAVVGSAQIAQADGDPFEYQPVQVIYIEEQSFAQKYANQGPLAELVFQYEAQMTLEALRAETIEAKANLVNAAEDAEADLYYSQQLSIDAAIAALWPHVDKTPYGFGNTPLIWDCSGLTQWYLYHRGFTDVIHSATAQVRDIRSQIVDAPIAGDLVAFQKSGASSYFHIGVYIGGGYMIHASNPQKDTNLQLVQEFADSENSKVVFVRY
Physico‐chemical
properties
protein length:226 AA
molecular weight:25057,0 Da
isoelectric point:4,61
hydropathy:-0,01
Representative Protein Details
Accession
2717o
Protein name
2717o
Sequence length
223 AA
Molecular weight
24934,73680 Da
Isoelectric point
4,62524
Sequence
MKKLGEALSLLVILSAVGSAQVAVADPEPFGYSPTQIVYVQDESFYSRYANEGPLASLAMQQEFQMEMEKLRAETMEARAFLIKEAEEAEADVYYNQQIAVDIAITDLWKHVDSTPYGFGNTPLVWDCSGLTQWYLNHRGFQDVIHSATAQVRDIRSQIVDAPIAGDLVAFQKFGATYDYFHIGVYIGGGLMIHASNPEKDTNLQTVSSFAESENSRVVYLRW
Other Proteins in cluster: phalp2_16914
Total (incl. this protein): 14 Avg length: 221,6 Avg pI: 5,59

Protein ID Length (AA) pI
2717o 223 4,62524
1q3h1 235 6,11459
2Ta8V 222 5,03550
2emWi 212 4,50792
3aVEq 221 4,78950
4bfvq 233 6,43914
4bgQW 234 6,84253
5CbJf 168 7,02714
5b9SY 233 6,11220
5tdLX 221 4,62387
7IKRJ 202 4,67173
7K45n 236 5,99870
80n93 236 6,84781
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16083
4957B
139 39,4% 137 8.197E-28
2 phalp2_896
8m9Rp
1 22,0% 218 7.901E-04

Domains

Domains
Disordered region
NLPC_P60
Representative sequence (used for alignment): 2717o (223 AA)
Member sequence: gGch (226 AA)
1 223 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2717o) rather than this protein.
PDB ID
2717o
Method AlphaFoldv2
Resolution 82.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50