Protein

Protein accession
16GfC [EnVhog]
Representative
bmLo
Source
EnVhog (cluster: phalp2_30781)
Protein name
16GfC
Lysin probability
99%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MAGKHRKPSPARELIPTAAGATVTALTLVTTTGTMPAITAPVKTEAVTQPIRRPQPAVPHEPLMMLAPPMLAPASVEWYPSYTPPPQYYMRQRPVVQRQQPRHASSVMTRTAAPAPAQAPAPRHAAPQHTQTPVPQQAPIKQEPVLTAAVKAVHAAVSQVGSRYVWGGDSPGSFDCSGLVQWAYRQSGINLPRTAAAQAGAGRPVSMSALRPGDLLFYRYGGGIEHVTMFVGNGQIAEAATFGVPVHVRPLYTEGLVEARRLIN
Physico‐chemical
properties
protein length:264 AA
molecular weight:28174,1 Da
isoelectric point:10,34
hydropathy:-0,20
Representative Protein Details
Accession
bmLo
Protein name
bmLo
Sequence length
214 AA
Molecular weight
21751,11950 Da
Isoelectric point
10,11614
Sequence
MAGKHRAASSARSLVPVAAGATAAVVTLLVGGVAVAAPVKPAAVQAPVRVDPVRFAYPPFTPAQAALYLPPLLPAPVEASYVAPAPTVKAAGVAGGIAARAVAAALGQQGVPYRWGGASPAGFDCSGLVLWAYAKVGISLPHFAADQAAQGRSVSLDQIQPGDLIFYYRPIKHVAMVVTGGRNPQIVEAPTFGVPVHVRPLYTNGLAVIKRLVG
Other Proteins in cluster: phalp2_30781
Total (incl. this protein): 6 Avg length: 221,7 Avg pI: 9,91

Protein ID Length (AA) pI
bmLo 214 10,11614
6SCui 218 9,79650
6SNwP 196 7,05658
bb91 204 11,77607
loS8 234 10,38394
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2894
wl1m
22 37,0% 143 6.881E-29
2 phalp2_16997
8aQrJ
8 29,3% 201 2.555E-26
3 phalp2_22894
2Sg1U
2 29,8% 144 5.723E-25
4 phalp2_7359
4vhzG
1 32,0% 156 5.723E-25
5 phalp2_8990
3SsWr
1 33,1% 157 6.863E-24
6 phalp2_27467
7KKUL
3 34,1% 196 1.375E-19
7 phalp2_12851
1ltcQ
3 26,7% 228 8.737E-15

Domains

Domains
Disordered region
NLPC_P60
Representative sequence (used for alignment): bmLo (214 AA)
Member sequence: 16GfC (264 AA)
1 214 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (bmLo) rather than this protein.
PDB ID
bmLo
Method AlphaFoldv2
Resolution 73.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50