Protein

Protein accession
6UW7w [EnVhog]
Representative
4PA5c
Source
EnVhog (cluster: phalp2_13495)
Protein name
6UW7w
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGGRYLTDLADVLRRAGLTVTEVDGWQTRSRSSGGYDSGRPTHVMVHHTASGPSSDGWPDVNYMTYSSDNRPVANLYVNRAGAWWVMAAGATNTNGKGGPVDGCPADSMNTHAIGIEAGNNGTGEPWPPAQQEAYTTAVAALCDAYDIPTGRVLSHAEWAPDRKIDPAGPSRWAPSGTWPMDPFRADVAPGGPGGAHPPPHDHAP
Physico‐chemical
properties
protein length:205 AA
molecular weight:21566,4 Da
isoelectric point:5,49
hydropathy:-0,59
Representative Protein Details
Accession
4PA5c
Protein name
4PA5c
Sequence length
166 AA
Molecular weight
17901,63540 Da
Isoelectric point
6,68906
Sequence
MGSRYLTDLANVCRSLGVTVHEENGWQTRARSAGGYNAGLPNHVMCHHTASNPSSDGQSDVNYMCYGSDNRPIANLYLSRKGHIWVMAAGATNTNGKGHDSWGGGVPNDSMNGYAIGIEAANNGVGEPWPAVQQNVYIMLVSALCAHYGIPNNNVRAHFEWAPDRK
Other Proteins in cluster: phalp2_13495
Total (incl. this protein): 8 Avg length: 179,5 Avg pI: 5,34

Protein ID Length (AA) pI
4PA5c 166 6,68906
1IpBO 192 5,87496
4HUxc 198 4,83492
5kj8J 140 4,91853
6UUUn 279 4,67190
e61q 98 4,60006
e7cP 158 5,65255
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33273
5GVax
183 64,8% 168 6.155E-89
2 phalp2_40201
2esCA
52 38,9% 149 4.662E-30
3 phalp2_12497
QM59
22 36,4% 173 2.245E-29
4 phalp2_26033
6Xke2
3 41,3% 116 1.201E-26
5 phalp2_9395
8DfXZ
44 30,3% 188 7.901E-26
6 phalp2_7405
4FizF
295 28,2% 177 2.393E-18
7 phalp2_2296
4Fcqb
4 28,4% 176 2.907E-17
8 phalp2_6052
7IofZ
9 26,0% 173 7.411E-17
9 phalp2_16202
4HSUT
2 30,9% 139 6.568E-16
10 phalp2_10382
1L8PE
1 29,0% 162 1.224E-15

Domains

Domains
Representative sequence (used for alignment): 4PA5c (166 AA)
Member sequence: 6UW7w (205 AA)
1 166 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4PA5c) rather than this protein.
PDB ID
4PA5c
Method AlphaFoldv2
Resolution 97.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50