Protein
- Protein accession
- 5SAgr [EnVhog]
- Representative
- 4SEzC
- Source
- EnVhog (cluster: phalp2_3635)
- Protein name
- 5SAgr
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MLRGIDISGYQNDMNPNAIDTDFMIIKATEGIDYINSSCHRFVSHCIDKKIPFGIYHFARPNDASMEAIFFFNAIKKYLGLGIPILDFEDTRCSNLWLNKFVEKFHSLSGIYPWVYMSSSFINQLGYGSQYIKDNCGLWLAGYPDNYSYYISDTNCPYNTSGWNLVA
- Physico‐chemical
properties -
protein length: 167 AA molecular weight: 19175,5 Da isoelectric point: 5,42 hydropathy: -0,14
Representative Protein Details
- Accession
- 4SEzC
- Protein name
- 4SEzC
- Sequence length
- 140 AA
- Molecular weight
- 16047,06770 Da
- Isoelectric point
- 6,54469
- Sequence
-
MAILKGIDISNWQKGFSLANTMPDFVIVKATEGLNFTDKCCDGFVQDAIKLNIPFGYYHFARSNDAEKEATYFYNQTKGYVGKGIPILDFEVLNSNTWLETWCKTFYQLSGVKPWVYMNSDYINNRGYGTTWVKANCGLW
Other Proteins in cluster: phalp2_3635
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_38994
84ePK
|
348 | 37,3% | 142 | 3.904E-26 |
| 2 |
phalp2_20117
23DDy
|
70 | 38,1% | 144 | 6.656E-25 |
| 3 |
phalp2_14322
3WLdQ
|
4 | 46,8% | 94 | 2.394E-21 |
| 4 |
phalp2_2231
4klcC
|
109 | 37,2% | 145 | 1.582E-20 |
| 5 |
phalp2_39219
3xEyz
|
21 | 41,7% | 127 | 7.628E-20 |
| 6 |
phalp2_19241
7XkuR
|
1 | 30,2% | 152 | 7.684E-17 |
| 7 |
phalp2_36349
n1oN
|
6 | 31,7% | 123 | 1.971E-16 |
| 8 |
phalp2_40151
8tQ4T
|
5 | 30,0% | 140 | 5.055E-16 |
| 9 |
phalp2_36157
6TiaL
|
25 | 28,4% | 130 | 3.655E-13 |
| 10 |
phalp2_20806
5TBA5
|
9 | 30,4% | 128 | 6.836E-13 |
Domains
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4SEzC)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50