Protein

Protein accession
4Bu7w [EnVhog]
Representative
7G0GZ
Source
EnVhog (cluster: phalp2_20670)
Protein name
4Bu7w
Lysin probability
98%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MANIYPENKRYQAVDTAVQNVGKVAATTRKAAKSFDANKYSGITSNIKTYSPSYKPTANAQEKISSLGQVSVPYGGSTRYESFHPGVDIANKIGTPIPSFSGGTVTESTSGFKQGDKGYGNTVIIQDASGNKWRYSHLNNGYVKVGQKIQPGTIIGQMGNTGQTYSTSGGTGSHLDLRIRDAYNKYIDPYSLL
Physico‐chemical
properties
protein length:193 AA
molecular weight:20736,8 Da
isoelectric point:9,58
hydropathy:-0,61
Representative Protein Details
Accession
7G0GZ
Protein name
7G0GZ
Sequence length
196 AA
Molecular weight
21311,49310 Da
Isoelectric point
8,70899
Sequence
MPLTEELRGQQFGANQDENANFQNIASALSIPLRIRNQIQPQGSNIQKRSPKTYGNIEQFISDMGTITTPFMGSTRSEAEHPGIDIANKIGTAIKAFAPGVVKEVVTGKKQGDKAYGNYVVVEDPYGAKHRYSHLSQSYVRVGDRINAGDDIASMGATGNTYSTTGGTGSHLDYRIRDAAGIYLNPYSYLAKFLNS
Other Proteins in cluster: phalp2_20670
Total (incl. this protein): 5 Avg length: 197,2 Avg pI: 9,53

Protein ID Length (AA) pI
7G0GZ 196 8,70899
1locp 198 9,80475
2eeMs 206 9,79837
2umwu 193 9,77562
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18865
1IU43
9 53,3% 133 1.767E-52
2 phalp2_21998
4ZChj
22 35,7% 154 7.070E-19
3 phalp2_20604
4H0aV
4 34,0% 138 1.314E-18
4 phalp2_5933
67ENK
29 33,3% 138 2.130E-17
5 phalp2_34915
4pbkX
159 31,9% 141 2.972E-15
6 phalp2_18623
3ebV
83 32,6% 144 1.385E-14
7 phalp2_27905
l4Bb
39 28,8% 142 1.617E-13
8 phalp2_21800
4e6HN
71 31,4% 127 1.617E-13
9 phalp2_18491
6LzOo
5 29,5% 132 6.372E-12
10 phalp2_39519
5tggt
99 32,8% 137 3.974E-11

Domains

Domains
Disordered region
PET_M23
Representative sequence (used for alignment): 7G0GZ (196 AA)
Member sequence: 4Bu7w (193 AA)
1 196 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7G0GZ) rather than this protein.
PDB ID
7G0GZ
Method AlphaFoldv2
Resolution 77.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50