Protein

Protein accession
1fReJ [EnVhog]
Representative
2TfSz
Source
EnVhog (cluster: phalp2_40265)
Protein name
1fReJ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNTNGWADIGYSYGVCPHSDSSGKGYVLEGRGLRKEQAAQPGGNTTYYSVTLMLGEGEKPTDVQIKTVRELRAWLRGKGVGAEVKGHKDFISTSCPGSILYKMVKDGTFSKGSPAKSNSWPFSGNFALGKTDRAKYPTQTKKVQKRLNDLGYKPKLAVDGDFYVKTEKAVKWFQKKKGIKVDGLVGKVTWGKLFP
Physico‐chemical
properties
protein length:195 AA
molecular weight:21422,3 Da
isoelectric point:9,81
hydropathy:-0,63
Representative Protein Details
Accession
2TfSz
Protein name
2TfSz
Sequence length
223 AA
Molecular weight
24917,25220 Da
Isoelectric point
8,96164
Sequence
MYVTVLRSVLGWPDTKNIPNAPTNKGIVIHYDGGSRNLTAKEHPACLDYWKWCRDFHIKTNGWKDIGYSYGICPHGVLFEGRGFGREQAAQPGGNRDWLSVTLMLGKKESPTEKQIAAFNEFRTKLVRTKKIAEAVSFHSMFFATDCPGDIVRNKIAHGEFSKLLPVADADQYWSALQELPLLNDLGPNAKAVVKSMQRALNLDDDGDIGPNTWSAIIKKVLK
Other Proteins in cluster: phalp2_40265
Total (incl. this protein): 3 Avg length: 231,7 Avg pI: 8,31

Protein ID Length (AA) pI
2TfSz 223 8,96164
28yqU 277 6,15756
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36432
11r0n
207 27,9% 247 4.876E-34
2 phalp2_37752
4kTIO
51 28,0% 239 1.343E-26
3 phalp2_26920
4uKrS
7 27,7% 259 6.345E-26
4 phalp2_15099
bgFu
21 30,4% 161 5.565E-25
5 phalp2_784
240oa
2 25,9% 235 3.704E-22
6 phalp2_15382
1LWAy
5 30,2% 152 4.445E-19
7 phalp2_15057
7s7pP
172 30,0% 173 8.183E-17
8 phalp2_13951
8FOsh
28 27,5% 243 1.738E-15
9 phalp2_26274
RjMA
376 25,8% 251 1.738E-15
10 phalp2_39859
14dHo
34 24,5% 253 1.082E-14

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 2TfSz (223 AA)
Member sequence: 1fReJ (195 AA)
1 223 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2TfSz) rather than this protein.
PDB ID
2TfSz
Method AlphaFoldv2
Resolution 84.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50