Protein

Protein accession
4Ms1p [EnVhog]
Representative
XX49
Source
EnVhog (cluster: phalp2_38720)
Protein name
4Ms1p
Lysin probability
96%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
VADATEFFAQLGEFSRRVFSMLGLQNTSSPWVTGVFPEGQPTGAVLHYTGSRDAYSTALWFMNPELESKVSAHVIIGTEWPQGVRERHAHDLLLVRELPTMVLQCVAPNVIAHHATWCNRHSVGLELVNWGEVRFNSELGWVVYPQGWTKRYQARGNDFPQSALKRYWEPYSAAQVRCAVEVLRWYRRWMGGRMRPEWVLGHEQVQGVATIGATGDKRDPGPLLPLHDMRLAALGDDVVEQLGWFQKFSSDPLYASKIRDGMLIDWYLGHPGAVVGERALDVAQKRFAAAVYELGTAGSWKAAFGALGKTCLRLLGYYI
Physico‐chemical
properties
protein length:319 AA
molecular weight:35832,5 Da
isoelectric point:7,86
hydropathy:-0,17
Representative Protein Details
Accession
XX49
Protein name
XX49
Sequence length
398 AA
Molecular weight
44837,00200 Da
Isoelectric point
6,62694
Sequence
MEGVMGSKAQAVEFFRQLDIFSDRFVKAELGLNTATSPWGSSLEADEPKGAVLHYTADDDLLRVLRWFLDPKWQSKCSSHAVVADRKLGTTQEMAKDLPLVAELPVTVVQCRLPSQEAWHATWCNASTYGIENVNVGEVRKAPDGSDGWVCWRPRDKSSPEWTLPWKSPYKTPVGLYGRFWEPYTSEQIEANVALLRYVRDYFGEGRLQRPWIVGHEAVQGVDTRGRGGSGPPMRTDKRDPGPTFPIHGIRYAVFDGWTPVGRYDWFNTYRGDPRWGQSDRDTMVVRVVRAMAGRPETPGAADPSPETAWARFKSGFQATLTNGETPFGVWGKLALWLLGFHVSSLKEGELRDPDLDSEDCQSVWIFQRLAGITTDGKPGKVTRENIWKRLQDRGFIA
Other Proteins in cluster: phalp2_38720
Total (incl. this protein): 12 Avg length: 369,0 Avg pI: 7,11

Protein ID Length (AA) pI
XX49 398 6,62694
11CTO 328 9,90172
1CiNq 394 6,77500
1CkGm 358 6,13863
2DhaN 368 6,55680
3XeZI 383 8,45040
4FL5z 366 6,31080
4G652 370 7,14923
4Gv07 381 5,77333
4RjRG 391 6,10055
k2uc 372 7,69972
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21299
1CkLW
10 25,0% 391 3.236E-37
2 phalp2_8546
1CIvy
7 26,5% 410 8.031E-37
3 phalp2_23755
Y34i
2 25,3% 407 1.013E-34
4 phalp2_18741
TfjB
5 24,7% 331 2.248E-25
5 phalp2_32422
RBb9
43 19,5% 363 8.039E-11
6 phalp2_9573
1lcx7
1 21,9% 282 9.724E-09

Domains

Domains
Representative sequence (used for alignment): XX49 (398 AA)
Member sequence: 4Ms1p (319 AA)
1 398 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (XX49) rather than this protein.
PDB ID
XX49
Method AlphaFoldv2
Resolution 88.19
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50