Protein

Protein accession
14s00 [EnVhog]
Representative
1goRp
Source
EnVhog (cluster: phalp2_19059)
Protein name
14s00
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDTGIIQSAGQPVANPLPVIWLKLPTAPLTLPDWAKFGRLSAKEIYNLLAQIGYDQSQWNYYAVGSDNQLGRYQFTAQTLESYGLLLPGSVAAYGSACVNHRQCWTPAYVRSATSSYANYLYSTLSLWDFLSNQAAQDHLAYQVLYDLYVELSKTDGILDADGADVVAGMLSVAWTLGTSGATTWRQTGAGSGIPAFNSGRYAITVLSS
Physico‐chemical
properties
protein length:209 AA
molecular weight:22642,1 Da
isoelectric point:4,91
hydropathy:0,00
Representative Protein Details
Accession
1goRp
Protein name
1goRp
Sequence length
220 AA
Molecular weight
23984,52580 Da
Isoelectric point
5,63055
Sequence
MADLGFLQAAGIPVPTPLPVSWLGRADAPFSLPSWATIDLLSADQNRNLLSQIGYDKSTWNYKLIGPQNQLGRYQFSTQVLENYGLLAPGSNKAYGTACVNYQTCWRPVTIRNTNSYANYIYNITSLNGFLNSPASQDHLAYQILYDTYRSLLSVSAITNADSTDIVAGMLYVGWDLGAGATPTYDNPTGTGAYAWRYSGQGIGVNSFNSGRYSIVMLGQ
Other Proteins in cluster: phalp2_19059
Total (incl. this protein): 78 Avg length: 222,2 Avg pI: 5,52

Protein ID Length (AA) pI
1goRp 220 5,63055
15mTe 259 7,65982
17KP3 218 5,25104
18vRf 257 8,84656
1AGVG 258 7,65027
1AP5k 257 6,83662
1LLvJ 225 5,01004
1QHQt 221 4,44978
1YSoH 220 5,02794
1evsy 215 5,27866
1hATY 221 6,02945
1hCIM 223 9,09709
1hn20 211 4,60307
1hoYA 222 4,69265
1hzvH 222 5,23961
20jMu 222 6,51684
25g2w 222 5,37091
25hfH 222 5,85813
2826o 222 5,08637
2DMIL 221 5,45304
2Dy5u 235 5,08222
2Ero4 222 6,51684
2LuHY 215 5,20153
2LuUq 240 6,00194
2REX 221 6,02035
2bDsm 223 4,99964
2bFZM 219 5,53148
2bFlN 222 4,69265
2bGdv 221 4,82975
2bGim 221 5,21551
2cOZo 208 4,95007
2lpYA 219 6,24754
2lqjT 220 5,17464
2lrf8 220 6,25959
33DCV 257 6,73635
34ny 207 4,60733
3WpxM 215 5,33305
3b4La 220 6,95160
3bcVO 212 5,14594
3jzCp 222 5,07035
466Dc 231 7,00435
46rBD 221 5,85199
47Gzd 230 4,96576
47lx8 223 4,77092
47mdx 223 8,88724
4838x 209 4,66076
49j2V 215 4,98065
4V9iP 220 4,71169
4Ysvg 221 4,97667
4aMCg 215 4,83236
4aYyj 215 4,83066
4anvG 220 4,79593
4azBT 215 4,88329
4fCQ2 221 6,28573
4rmlc 208 4,51014
5mzqb 208 6,08242
5ov4O 222 4,87016
5wf4y 215 5,33311
6KXnz 257 6,32262
6QAI6 230 4,65894
6QAxG 210 4,70055
6QBaq 221 4,93603
6QBkv 224 6,01364
7LZlr 220 5,03454
7Vu1u 217 5,25183
RNEu 212 4,80678
SAW1 220 4,71169
TPil 221 4,35997
Uah0 215 4,48070
bAQM 220 5,58684
ekz7 220 5,21563
enGG 220 4,87795
eovT 220 5,73417
lPcj 207 5,52568
mdkF 220 4,84469
mlpe 216 6,37645
mxS1 222 4,84725
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31764
47oM5
2 25,8% 155 6.555E-27
2 phalp2_39644
6MwRH
23 29,8% 184 2.093E-23
3 phalp2_9072
5vXSO
10 27,9% 222 1.341E-22
4 phalp2_27312
4lfKE
8 25,0% 228 2.194E-15
5 phalp2_114
5ccAn
9 27,8% 147 2.623E-08
6 phalp2_37836
4IL0X
8 25,1% 175 1.557E-07
7 phalp2_7798
7szmD
9 26,0% 150 1.232E-06
8 phalp2_29497
1OI6t
15 21,2% 216 7.476E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 1goRp (220 AA)
Member sequence: 14s00 (209 AA)
1 220 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1goRp) rather than this protein.
PDB ID
1goRp
Method AlphaFoldv2
Resolution 89.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50