Protein

Protein accession
4bzmB [EnVhog]
Representative
3gmlB
Source
EnVhog (cluster: phalp2_24286)
Protein name
4bzmB
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MANQEVPKVNTPVSEAEMANALSQAARELFNIELTQPQLALLVAQNNLETASRKSTHNFNVGNITHSNDSFDYFIGGDKTKDKNGKWIPTKFKFRSYPTLKDGVKDYISNIKNRGHGIVWNNILTGDPAAFSKALKKTRYYEADEKDYTEGLKAHVSNFNKNYKAPPSNVAETTIDKITNFVNKFLKAIASMDVK
Physico‐chemical
properties
protein length:195 AA
molecular weight:21864,4 Da
isoelectric point:9,32
hydropathy:-0,64
Representative Protein Details
Accession
3gmlB
Protein name
3gmlB
Sequence length
210 AA
Molecular weight
23477,35510 Da
Isoelectric point
5,69097
Sequence
MMLEAEKTPVTPTDVARALRAAWLRLLEVVPAEQQLAVLMAQSALETGRWKSCWNFNLGNIKGGGKWGGDTCQFRCNEVINGKVEWFDPPHPQTTFRAYAHLTDAAADYLWLLRRRFAQAWEYVLRGDPVGFSQALKRQKYYTAPEPPYTKAVKSLFNEYLRLLMNGDSSPPPPPDEAPDTEPAGHGLEALALSIAAFDAVSASREDREP
Other Proteins in cluster: phalp2_24286
Total (incl. this protein): 71 Avg length: 226,1 Avg pI: 6,09

Protein ID Length (AA) pI
3gmlB 210 5,69097
14YSJ 234 5,67750
169jw 189 4,88920
174K9 247 5,66636
1AMsh 237 5,99051
1IeUX 227 5,76833
1IegA 216 6,65990
1Iekf 237 5,06063
1NCW6 260 5,48334
1Z7rx 238 8,39986
1puoB 233 5,42570
20MKN 225 6,42095
2570H 207 6,94512
25nbX 236 5,88206
2S36l 232 5,14583
2TwXU 230 8,71614
30uSg 253 8,31057
36iPZ 253 6,34678
3Lofd 195 4,64917
3f2Dl 240 5,10030
3fGu6 228 6,31671
3mTXZ 228 9,29932
3nsU 222 5,55819
42dYf 236 5,57502
4AEjY 259 4,72732
4DzM3 256 9,39706
4HZl4 234 5,11786
4Hklj 235 5,99540
4Hrrl 186 4,92700
4I0HA 225 6,41913
4IJPx 238 4,98600
4ItxD 204 4,96508
4MXw3 243 4,86004
4Ngn9 237 7,65197
4QlD3 230 5,55916
4b3SL 242 4,91989
4c1VN 201 5,70143
4cd9R 230 5,16026
4dP2K 242 5,09626
4eRW5 222 9,21403
4euxW 177 5,09302
4fbkD 233 4,97065
4fnd5 232 5,52290
4vY0g 189 5,47993
5DQak 196 4,63678
5H9rx 254 5,71393
5He3F 242 5,31504
5Id7i 231 6,11203
5Jwl5 270 5,14242
5kpss 178 5,44395
5nhBu 256 9,64345
6FVEl 249 5,43542
6G0i0 195 5,94157
6Gps4 233 5,31310
6LjfI 200 5,00867
6RO5I 241 5,05380
6T1te 252 6,41879
6U4ck 227 8,53047
6UcF9 233 5,50187
6VHq3 168 8,83895
6VMaq 232 5,82653
86EMj 215 4,65428
8atdV 190 7,01356
8dGMa 181 5,76696
8irlg 222 7,63736
bj0k 225 4,98997
fD3k 238 6,33791
hulY 243 6,66792
hxaJ 223 5,55922
nM8S 236 5,60179
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35889
4RlWa
4 45,1% 186 1.414E-66
2 phalp2_12780
8asgj
7 49,7% 169 3.637E-66
3 phalp2_30410
4RtBC
33 47,2% 165 2.405E-65
4 phalp2_18002
3ao4M
1 40,3% 181 1.380E-44
5 phalp2_23921
1O9wW
65 35,8% 209 3.539E-44
6 phalp2_14454
4GF1h
2 33,0% 212 1.005E-41
7 phalp2_14445
4Ff3O
1 32,4% 188 3.523E-41
8 phalp2_10695
31VV0
4 31,4% 181 2.077E-39
9 phalp2_4683
4HkTR
2 30,1% 159 5.319E-39
10 phalp2_10542
8aiy3
19 29,7% 249 9.955E-39

Domains

Domains
GLUCO
Disordered region
Representative sequence (used for alignment): 3gmlB (210 AA)
Member sequence: 4bzmB (195 AA)
1 210 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3gmlB) rather than this protein.
PDB ID
3gmlB
Method AlphaFoldv2
Resolution 84.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50