Protein

Protein accession
4F1mZ [EnVhog]
Representative
4E9CJ
Source
EnVhog (cluster: phalp2_23125)
Protein name
4F1mZ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VRLVVLHTAEGARTYQSLGSYFSSPSSGVSSHTGIDDTPGVIGEYVRRSDKAWTQGNANPYSVAAELCAFAAWTPAEWQMHETMLANTAQWVAEECAAFGIPLVRLNAAEAQGGASGVCQHVDLGAAGGGHWDCGPGFPMDQVIAMAAGGAMGPIPTEQEDNMMLLDPRSGGYWVAFRDGAVHAYRGAPMLGGTNNDKYNRQKWPCIGIAGRRDGNGYTLGLDPGNRDINFYEFPYDGSARP
Physico‐chemical
properties
protein length:242 AA
molecular weight:25777,4 Da
isoelectric point:5,03
hydropathy:-0,31
Representative Protein Details
Accession
4E9CJ
Protein name
4E9CJ
Sequence length
243 AA
Molecular weight
25102,68890 Da
Isoelectric point
5,62015
Sequence
VTLARVWIASPNYSSRGGSGVRLAVIHTAEGALTYQSLGSFFANPSSGVSSHVGIDDTANTVGEYVTRGNKAWTQGNANPVAVAAELCAFAAWTPAEWDRHPNMLANCAAWLAEEAAAFGIPLVRLTPAQAQGAGRGVCAHSDLGSWGGNHSDPGVGFPMDRVISMAAGGAPAPGPSPTEEVEMFLAYATNDSTHDNTIKRNNQYVVRDTGISAVATPADSNALQAKLGPLVGLSGNMLNSLK
Other Proteins in cluster: phalp2_23125
Total (incl. this protein): 94 Avg length: 260,8 Avg pI: 5,31

Protein ID Length (AA) pI
4E9CJ 243 5,62015
11EQv 262 4,92558
11FBc 264 6,21974
15Gl6 245 5,85705
16iIo 304 5,34130
1NBPd 284 4,66804
25k6U 257 7,09188
25kgC 246 5,70978
3X1as 286 6,19519
3X3dM 304 6,37798
3Xk3j 304 4,81616
4E1jH 264 5,42906
4E6tW 263 5,59184
4E7F1 262 4,77416
4E7Kg 259 5,40615
4E8ev 293 5,62288
4E8qk 300 5,19789
4E8y3 253 4,68083
4E8yn 237 4,60978
4E9hV 240 5,01140
4E9p4 271 4,73568
4EDt6 267 5,83949
4EJl4 262 4,74136
4EJxC 261 4,96661
4ENOA 274 5,65368
4ENq0 287 6,31188
4EOOy 264 5,75554
4EOxM 265 8,15385
4EQBs 220 4,48098
4EQKJ 259 4,66173
4EQcd 291 5,49715
4EQkL 246 4,92819
4EQmF 287 4,56147
4EQtO 249 4,80985
4EU0k 256 5,27559
4EWiR 256 5,83295
4EXkR 244 5,09103
4EdBK 242 4,44034
4EdU6 266 6,93876
4EdzA 259 5,78515
4Efug 285 5,36864
4EgZx 295 7,04902
4Egr8 240 4,87675
4Eh1o 223 5,34715
4EhWE 262 5,65857
4Ehau 248 5,18993
4EhvR 240 4,59318
4EhzU 272 5,57610
4EmVV 254 4,99640
4EmYr 253 6,12801
4En3f 263 5,05653
4En8E 253 6,11397
4EnJ6 265 5,04437
4EnRL 257 5,43735
4EnRV 238 4,67162
4EnsH 258 4,83918
4EnwL 313 9,12410
4Es8O 256 5,37080
4EsyK 287 4,96354
4F0Qx 213 5,36454
4F0vU 238 4,69828
4F0wg 242 5,69012
4F13c 226 4,21554
4F1Iy 243 4,73744
4F1V2 264 5,27354
4F1cQ 259 5,46094
4F1zc 233 4,62439
4F2JQ 250 4,26971
4F2Vh 242 5,46822
5EE07 270 5,44042
5F79G 256 4,42897
5Fq0k 292 6,24237
6ENj3 277 5,04261
6HWpL 266 4,40942
6HZuG 249 4,45631
6I0MF 256 4,92217
6I3Py 267 4,87635
6I3vf 243 5,45537
6Ic4T 256 4,69021
6IcMZ 269 4,75824
6IdLv 279 5,36534
6Ieji 244 5,27667
6IgIt 259 5,28179
6Ih84 210 4,62160
6Kemy 263 4,83446
6RPHq 259 4,06651
6TOJR 320 5,65675
6XDkK 283 5,65113
6XUZq 282 5,23251
6XX5C 239 5,27281
jE74 274 5,34721
jEh8 213 5,15725
jFcR 278 4,59898
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32349
gl93
55 73,2% 168 2.867E-97
2 phalp2_10922
4ERj9
80 64,8% 185 1.442E-90
3 phalp2_34463
4EULK
4 69,2% 156 5.917E-82
4 phalp2_28710
4EhZZ
178 70,7% 164 5.356E-81
5 phalp2_20908
6XFXa
45 46,3% 207 1.168E-69
6 phalp2_13766
6Tcyj
5 41,6% 185 3.753E-59
7 phalp2_17988
1farV
1 52,4% 162 6.172E-49
8 phalp2_12038
4Xxfh
2 38,2% 188 2.303E-35
9 phalp2_19863
5Bmz6
51 35,7% 165 9.656E-34
10 phalp2_25881
5B1MM
2 32,1% 165 5.083E-29

Domains

Domains
Ami2
Disordered region
Representative sequence (used for alignment): 4E9CJ (243 AA)
Member sequence: 4F1mZ (242 AA)
1 243 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4E9CJ) rather than this protein.
PDB ID
4E9CJ
Method AlphaFoldv2
Resolution 81.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50