Protein

Protein accession
waOC [EnVhog]
Representative
1gqnf
Source
EnVhog (cluster: phalp2_18804)
Protein name
waOC
Lysin probability
91%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MILGVDYASIDKNETPNFELAYKSGIRFAIMRGTYGMWVDPVRVRDLQKSSKLMTTGSYLFPIYTDDPIAEANAFADAVGPLTHGSFPPILDIEFPGGILATNKTPKQAVDWLTQLGLQLHKRYGALMLYTSGRVWHEDLQDAISVFFSQCPLWLARYSFKTRDEAHFTDKVLPGVVPPQLGDSDDYWIHQFQGDAVHCPGFSSTVDLNLFRILRLGTRGSRVSWLMDRLGMPRSKIEKENVFDAQVEKVIKEYQESHNLTIDGIVGPVTFSHVCWENPS
Physico‐chemical
properties
protein length:280 AA
molecular weight:31620,7 Da
isoelectric point:5,75
hydropathy:-0,20
Representative Protein Details
Accession
1gqnf
Protein name
1gqnf
Sequence length
221 AA
Molecular weight
24220,67360 Da
Isoelectric point
8,74386
Sequence
MFPVITPKAASPEAQVATFIRAMKNAGGIVPGVDLPPTLDIEFPGNGIATTGMKLTDVINWLMTAVGALRKEYGTAPMIYTSKRVVWEDLQNKLPAELSDCIPWIKSAYRLKARQSVDKVVPKEPPIPPPFSWWAIQQYQGDALNFKGFSSTVDVNRFNTQSVGCRGAFVSWIQRKLAMAEGSPGLFDTATAEAVKEFQNKINTQIDGSVGIGTFSHLCWL
Other Proteins in cluster: phalp2_18804
Total (incl. this protein): 30 Avg length: 284,2 Avg pI: 6,31

Protein ID Length (AA) pI
1gqnf 221 8,74386
1Z47K 293 8,24301
1zFkz 295 8,39019
2SA3B 283 5,43622
2SnFa 283 5,43622
2kJVg 304 5,69063
2mFSE 290 7,60934
3gGHe 274 5,67147
4FnHx 279 6,09561
4K356 272 5,16208
4KBrm 296 5,50897
4NtaN 290 5,65294
4Y4uA 293 5,18920
4cq3W 300 8,38213
4dKYF 275 5,37722
5fcIX 299 7,70006
5kAHS 299 8,31289
6GBaa 256 5,80272
6GBcH 284 5,94623
6HWXB 238 5,88115
6I0ly 298 5,29105
6QhCm 304 5,96533
6W1cD 297 7,60269
6WH6k 285 6,08543
7Ga7X 296 6,10697
bmSO 291 5,48697
fJkj 280 5,59764
gjCL 248 5,33448
wbVW 323 5,84165
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39458
4VDTp
155 18,6% 274 3.963E-13
2 phalp2_5981
6El6s
21 19,7% 248 1.802E-12

Domains

Domains
Unannotated
Representative sequence (used for alignment): 1gqnf (221 AA)
Member sequence: waOC (280 AA)
1 221 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gqnf) rather than this protein.
PDB ID
1gqnf
Method AlphaFoldv2
Resolution 96.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50