Protein

Protein accession
5bQfF [EnVhog]
Representative
1MMvC
Source
EnVhog (cluster: phalp2_11679)
Protein name
5bQfF
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTTWIRPVDGGSISDSFDGHKNRAKPSVNPGVDYAVATGTPAKAVADGTVINTVPTFAGAGGRMVFLSFPSGHTADYLHLSRIDVQPGQAVKQGQVIGLVGGSGLGKENGYGAHLHFSFRVGGKPTMGAGNIDYEAFRGAPTSAAPAKPSAAPAKAPTATAAGSRAYPGRMLKQGEPANADVLYLQNKLGVNPPGPFGPKTHAAVVAFQRSKGLTADGIVGPKTWAALG
Physico‐chemical
properties
protein length:229 AA
molecular weight:23272,0 Da
isoelectric point:9,86
hydropathy:-0,17
Representative Protein Details
Accession
1MMvC
Protein name
1MMvC
Sequence length
209 AA
Molecular weight
21155,63010 Da
Isoelectric point
9,79831
Sequence
MTTYIRPVQAGISDNFAAHQARRSVNPGTDYVVGIGTPVVAVADGVVGGTVTHIGGAGGRMIFLDFPDGHNADYLHLSRIDVAPGQAVKQGQVIGLSGASGKGYERGYGPHLHFSFRHGGRHTTGAGNKDFEAVVSGGGAPAPAAAPASKPTVKKGSKGKAVAYLQSKLGITADGAFGPITDKAVRAFQTSKGLVADGIVGPKTWAAIG
Other Proteins in cluster: phalp2_11679
Total (incl. this protein): 116 Avg length: 218,0 Avg pI: 9,72

Protein ID Length (AA) pI
1MMvC 209 9,79831
1JAxH 222 9,66028
1JJCZ 225 9,66073
1K1Kn 226 9,98810
1K4s3 225 9,66028
1LP5T 214 9,86297
1Mntk 226 10,16197
1WAAv 209 9,95825
1X06j 213 10,13715
1gMOi 282 4,76938
1hIII 273 10,78216
1hUoG 270 10,22522
30jqB 228 9,77381
3dCAf 242 9,27031
41UrE 216 10,03910
44tmU 210 9,91512
46GSH 195 10,18731
46okJ 211 10,08532
48aQO 226 9,72836
48cey 211 9,40866
49v3l 205 9,83660
4HSmy 299 4,35065
4J25V 212 9,75408
4Yq32 223 9,78567
4ZPLf 225 9,93930
4ZQOk 157 9,75621
4ZZ33 214 9,95626
4aERH 220 9,73107
4aP5d 223 9,90739
4abjL 205 9,83660
4agKH 196 10,44757
4bm6J 222 9,76221
4bxjP 195 10,45292
4by3r 197 10,16874
4l74k 210 10,06514
4oHAX 225 9,72842
4pDue 222 9,81939
4pPGI 213 9,83551
4pk0f 225 9,81881
4rKrR 217 9,57280
50FsN 224 9,87090
50NG2 222 9,81997
50Ntg 225 9,72791
51435 210 9,66673
51Ndj 173 9,87212
51Nqk 205 9,83602
51usi 217 9,94104
52h4X 225 9,76111
52vYQ 224 9,85523
53mfo 217 9,72688
53nec 214 9,95626
55DDk 214 9,92273
56A9E 223 9,69606
56Ag1 226 10,07094
56FsU 168 9,77742
56nSR 205 9,83602
56ybS 225 9,89643
57Muw 186 9,77233
57MzN 205 9,79347
585WH 213 9,72333
58rHq 225 9,76111
58rVy 228 9,72688
5BNAH 214 9,92273
5BNM2 175 9,73848
5BNZW 224 9,79824
5EgRs 225 9,76111
5a3Jt 202 9,89108
5aHpI 205 9,99449
5ahbD 210 10,03226
5bjZp 228 9,81881
5cgYz 213 9,91512
5cvmE 210 10,18131
5djIU 220 9,94562
5eKjd 168 9,75576
5eMZV 224 9,79889
5fFgj 225 9,74138
5fk5g 205 9,84988
5gae2 228 9,81997
5gdGN 204 9,87148
5gfkI 228 9,72842
5h7yB 226 9,78567
5hthQ 162 9,86793
5it8z 213 9,66634
5iv6f 222 9,81997
5jf5P 210 9,84982
5jhiG 229 9,72417
5kTmL 209 9,43239
5m3dr 226 9,99088
5nrtu 207 9,93092
5tAHf 221 9,81939
5tCcm 217 9,62889
5vVue 226 9,89495
5wbfo 205 9,87025
5ysGc 225 9,78567
5z4Yl 225 9,94717
5zTqB 222 9,72991
5zUaw 218 9,83686
5zi8P 217 10,25068
6QiiA 255 10,04213
6RqCE 291 5,10996
6Ux0Z 208 9,83602
6Ux0t 225 9,98901
6y6HH 219 9,90668
6yIqA 228 9,87148
6yeVE 225 9,76111
7G3A4 210 9,66634
7G3fp 211 9,99487
GkN2 224 9,73229
Mj6Q 223 9,76169
Tjrv 225 9,91577
UnTC 229 9,62972
UnYw 228 9,13048
A0A6J5M709 221 9,78516
A0A6J5MST5 214 9,84982
A0A6J5N6B3 228 9,76169
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_29729
1mI3z
7 44,1% 154 3.743E-39
2 phalp2_428
7zQ20
48 35,5% 214 1.311E-38
3 phalp2_25483
3axoQ
3 38,8% 206 1.442E-36
4 phalp2_21800
4e6HN
71 38,5% 148 2.761E-28
5 phalp2_11281
6RW1f
1 37,2% 137 9.610E-28
6 phalp2_40071
41qFE
139 32,2% 158 1.563E-21
7 phalp2_14972
vSj7
5 32,0% 212 7.355E-21

Domains

Domains
Representative sequence (used for alignment): 1MMvC (209 AA)
Member sequence: 5bQfF (229 AA)
1 209 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1MMvC) rather than this protein.
PDB ID
1MMvC
Method AlphaFoldv2
Resolution 91.06
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50