Protein

Protein accession
1gJGp [EnVhog]
Representative
6F7cq
Source
EnVhog (cluster: phalp2_4954)
Protein name
1gJGp
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKDVQLEWFGTAAGLRMDPDKHYGLQCVDVIDHYAEYIFGVPWPQSIGGVDGAKNLPRVAPKEYWTWHNVGLPAPGDVLIWGGSPTNAWGHTAVVDFDGPVTPEGAEVIQQNSNGLANQAAHRAFMRWYQGGTGQLTGWLRPNPDKVRGGTSAPGAGGVKLLTVTAPVVIVRTSPRVEPGNVAAAYPNGIAKGARVAAVGYVQGQDPYPNDGVQDDAWVKTKSGYFIWANGLGNNLAGLARL
Physico‐chemical
properties
protein length:242 AA
molecular weight:25781,7 Da
isoelectric point:6,51
hydropathy:-0,24
Representative Protein Details
Accession
6F7cq
Protein name
6F7cq
Sequence length
228 AA
Molecular weight
24648,80740 Da
Isoelectric point
9,49337
Sequence
MTTFTQYLKPNPSTPCRPGWCLEYVRKAFNLPIKYPTADAAWEASRFKHRDWNFPANVAVPVWFDVKGVPAGHVALRMSDGSVYSSTNPNSNVARRHPSIADLMAVYASAGLPLTYLGWTEDVCDFTVVKPVPAPPKPVQAPSGKLLKVTAPVARVRTSPAVRSNNIAPGYPDGIAKGATVAAVGYVRGEDPFPSDGSTDDAWIKTKSGYYIWANNVGNSLAGLKKLN
Other Proteins in cluster: phalp2_4954
Total (incl. this protein): 8 Avg length: 238,6 Avg pI: 9,06

Protein ID Length (AA) pI
6F7cq 228 9,49337
1jlTD 242 9,85246
6Kmn9 221 8,46833
fs58 236 9,79773
A0A9E7QC99 250 9,45005
A0AA96NG32 245 9,44979
A0AA96R2X2 245 9,44979
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2661
6Wimo
13 35,1% 202 1.791E-27
2 phalp2_1465
1Mi2x
11 33,1% 151 1.093E-19
3 phalp2_22068
5H7aG
6 29,9% 224 1.955E-15
4 phalp2_22733
8bk5h
6 29,1% 158 5.138E-12
5 phalp2_29700
1f4uS
3 30,9% 155 5.686E-11
6 phalp2_22787
7odQz
5 30,5% 200 4.620E-10
7 phalp2_6871
1HYPo
2 31,0% 161 6.227E-10
8 phalp2_11356
7uQxV
2 24,6% 154 7.624E-07

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 6F7cq (228 AA)
Member sequence: 1gJGp (242 AA)
1 228 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6F7cq) rather than this protein.
PDB ID
6F7cq
Method AlphaFoldv2
Resolution 87.36
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50