Protein

Protein accession
53M90 [EnVhog]
Representative
4odrF
Source
EnVhog (cluster: phalp2_37668)
Protein name
53M90
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKKLTLTVLLLLLTGCGVAQADAPGVPPTNEMQGAPLPETTTTLPDLSFLNTTTTQPKPRPAPQPAIIQPQTSEPTEAQWDALAFCESSNRHDYPPVSGGFSGLLMFHDATWNGYGGQEYAPQAWQASRAQQIEIALRLWRARGWQPWPGCRAKLGFN
Physico‐chemical
properties
protein length:159 AA
molecular weight:17384,5 Da
isoelectric point:6,71
hydropathy:-0,41
Representative Protein Details
Accession
4odrF
Protein name
4odrF
Sequence length
182 AA
Molecular weight
20060,77890 Da
Isoelectric point
4,97787
Sequence
MSHLHTEEQEQTSMDATSTIRATVCAAAMIITLQSCTQQQPEAPAPQHEPIEFNGYDMTDNAGKAVQNLEIFLQSVTTTTTTAMQQNRNTATPTPQHSSVWDDLAQCETGGDWATNTGNGFGGGLQFAHQQSWSTWRAFHGEQYAPHPWEATREQQIEVAERVLQSSGWGAWPGCARKLGLR
Other Proteins in cluster: phalp2_37668
Total (incl. this protein): 27 Avg length: 171,6 Avg pI: 5,58

Protein ID Length (AA) pI
4odrF 182 4,97787
11mSs 162 5,43338
1Jkx2 167 4,57170
1JvDY 164 5,06091
1JwcS 152 5,60321
1Jz9Q 163 5,79504
1LAN9 166 5,22779
2759U 182 4,54225
44lJB 172 5,21335
4Aumj 176 5,13400
4XnH2 166 4,86879
4l6Tn 184 4,74989
4rrnb 185 4,57079
589ZD 167 6,34615
5BTxx 158 5,72843
5eIjs 178 5,32248
5f9w7 160 8,86242
5gRNB 179 5,04238
5gyOS 159 8,50385
5mday 181 5,08728
5yr1X 158 7,71904
7XNo0 167 4,76950
882lG 211 6,89516
8mxRU 180 4,64076
8mzu1 180 4,54947
Sq1O 174 4,80053
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31660
4ICMI
8 38,4% 125 1.398E-17
2 phalp2_3298
2S9RW
1 33,7% 160 9.395E-15

Domains

Domains
Disordered region
Transgly
Representative sequence (used for alignment): 4odrF (182 AA)
Member sequence: 53M90 (159 AA)
1 182 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF06737

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4odrF) rather than this protein.
PDB ID
4odrF
Method AlphaFoldv2
Resolution 70.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50