Protein

Protein accession
2bagu [EnVhog]
Representative
2GFKA
Source
EnVhog (cluster: phalp2_6921)
Protein name
2bagu
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDLLIIDAGHGGNDPGASKFGYVEKDLTLVIGKRVQELLSIYNPHITRTIDKTIEPNDRTTLIRDKYQYCLSIHLNAGNGNGVEAIHSTFSTKGKRIAELITNELKATGLPLRNTPVFSKTRSDGKDYYYMHRDTGNTVTVIVECLFLDNADNIKLLNVEAIAQSIAKGFKTFIEGIVKVDNPTNRSKYYKNGDSHIIETTPGNIDIKIIGDTLNVVGLNGINGTFFDTPKPELANSCWAIATNEGKAIGGNAMLVSYNKDIKRGTIVYDDDGSIEIVRVNSINEFSKPHRWSISGYSVYPYLNFEEEKMSGGINYKTAHTYIGFKGNKIYLIVKPNHLIKEILPLIKDLGLEGCIVLDGGGSSQLKHPNGNYNSSRKINSAVILKQI
Physico‐chemical
properties
protein length:388 AA
molecular weight:42859,3 Da
isoelectric point:8,41
hydropathy:-0,29
Representative Protein Details
Accession
2GFKA
Protein name
2GFKA
Sequence length
442 AA
Molecular weight
49323,81060 Da
Isoelectric point
9,05099
Sequence
VRIAVSAGHNVYVNGIFDPGATRNPYIEAEITKETVSILIPMLRAQGHEVIDVTPYNERFASSKAHHELRCSRVDTFKADIFLDIHINAGGGTGVEVWVYSKKSKACPYAEKVADNISKDMNLSNRGVREKPSYWSVSLCKAPAMIIEGAFIDNKSDMEKLTPEKYARAIAKAFGEVKEPIPAEEKKDILYRVQVGAYTVKANAEKMLKELQKAGFKGFITETNTGEPKEIEKPLTKPLSEYYEKYGLKIIETDPDNIYVAVLPGKSLREFGIYGVNGTWQNNPEAHLPRSIWGLAGNGNKAIGPNSHTNSPNDHKRGTIIYYEDGALEIKRINHIKEITKPFKWCIGGGSLIPNIIDEEKFASDIYRFTHHTGIGYKGNRIYLIVTSTHCSMAEFRNRVLKLELDKAIFLDGGGSTQMNYKGNKGIHSSRKLSHGVFLKKA
Other Proteins in cluster: phalp2_6921
Total (incl. this protein): 6 Avg length: 428,3 Avg pI: 8,94

Protein ID Length (AA) pI
2GFKA 442 9,05099
32Gs 452 9,24478
4SDHK 392 8,84198
5Pxd6 444 9,07781
7Get7 452 9,03365
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35731
45zz1
4 24,6% 467 1.674E-31
2 phalp2_32481
1c5or
14 22,7% 465 1.417E-29
3 phalp2_36520
1qvSh
1 20,7% 457 9.505E-26
4 phalp2_12996
3nSWw
2 21,1% 434 5.793E-22
5 phalp2_21191
14MMo
2 20,0% 433 7.733E-22
6 phalp2_5941
6ffn9
61 20,7% 462 1.676E-15

Domains

Domains
Representative sequence (used for alignment): 2GFKA (442 AA)
Member sequence: 2bagu (388 AA)
1 442 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520, PF05036, PF09992

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2GFKA) rather than this protein.
PDB ID
2GFKA
Method AlphaFoldv2
Resolution 92.30
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50