Protein

Protein accession
4wbQ2 [EnVhog]
Representative
4GF1h
Source
EnVhog (cluster: phalp2_14454)
Protein name
4wbQ2
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSTMPTYTVKRGDYPSSIAKKFTGSEGRHAELLDANPDFRKGNGRWKTLKTGDVLQLPAGWADGADRETQPALPEPEPESAPAYCAVCEHDLCMCAKTPEPPPGSRFPATKTKETPTAVYAALQRAWLKLFNDVPSPESLSVLVAHWALETGWGDSCWNWNLGNVKSRPGDGLDHTYYPCWEVLPKAAAFALAKNAGVRADGKPGPDVVVATEQDDGTAVVWFYPDHNMCRFRAFRSLDAGAEEYLKVLFHRFKRAWASVTTGDPSGFAAALKGMDYFTAPLDGPKGYRTALVSVFNRVAAQVKRAA
Physico‐chemical
properties
protein length:307 AA
molecular weight:33556,6 Da
isoelectric point:7,58
hydropathy:-0,40
Representative Protein Details
Accession
4GF1h
Protein name
4GF1h
Sequence length
299 AA
Molecular weight
32587,55150 Da
Isoelectric point
9,06891
Sequence
MADDTHKPAPSNGKNPVLRRGDHGPAVATWQKIIKVDADGVFGEVTEKATRVFQMMNGLKPDGIVGNKTWARAVSLSEPQPDPEPERPLTTRLPAKRTKVTPEELYAALARLAPELSRDALLVLLGHWGLETGDGAGCWGYNLGNVKGRPGGSDGRSWQFFACNEILAPSAANAIVARAGDREPRRSEDDPAAFGRPEFDKVSKSAVTTSTTANGQVVLWAFPPHPVCCFRAFRTLEDGAKDYLEILKKQFGSSWPFVIAGDTAGFSHALKLKGYYTADESHYAKALKSRRDKFAKLIQ
Other Proteins in cluster: phalp2_14454
Total (incl. this protein): 2 Avg length: 303,0 Avg pI: 8,33

Protein ID Length (AA) pI
4GF1h 299 9,06891
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12780
8asgj
7 42,0% 200 1.015E-41
2 phalp2_24286
3gmlB
71 37,7% 209 8.457E-34
3 phalp2_10542
8aiy3
19 32,6% 196 1.257E-24
4 phalp2_23921
1O9wW
65 34,8% 204 6.472E-23
5 phalp2_30410
4RtBC
33 29,9% 207 3.283E-21
6 phalp2_38073
6x3xX
3 27,7% 263 4.407E-18
7 phalp2_10695
31VV0
4 28,2% 205 6.939E-16
8 phalp2_18002
3ao4M
1 32,8% 195 9.943E-15
9 phalp2_33015
4jrKx
2 24,5% 212 1.183E-08

Domains

Domains
PG_1
Unannotated
Representative sequence (used for alignment): 4GF1h (299 AA)
Member sequence: 4wbQ2 (307 AA)
1 299 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4wbQ2
Method AlphaFoldv2
Resolution 87.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4GF1h) rather than this protein.
PDB ID
4GF1h
Method AlphaFoldv2
Resolution 87.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50