Protein
- Protein accession
- 5geC2 [EnVhog]
- Representative
- 4bLvY
- Source
- EnVhog (cluster: phalp2_30232)
- Protein name
- 5geC2
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
LSNHSSGTAIDLNATKHPLGKSGTFPAEKVPMIRALAKKYGLKWGGDYRNRKDEMHFEIELSEAKVAALIGSLNKGDN
- Physico‐chemical
properties -
protein length: 78 AA molecular weight: 8495,6 Da isoelectric point: 9,35 hydropathy: -0,58
Representative Protein Details
- Accession
- 4bLvY
- Protein name
- 4bLvY
- Sequence length
- 87 AA
- Molecular weight
- 9979,25080 Da
- Isoelectric point
- 9,79251
- Sequence
-
AFRMTRSSDRVLSNHASGTAIDLNAIKHPLGKSNTFNKDQRNTINLLITKYGLNWGGNYKKRKDDMHFEIALSQYEVEQKIKELGLK
Other Proteins in cluster: phalp2_30232
| Total (incl. this protein): 16 | Avg length: 121,6 | Avg pI: 8,85 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4bLvY | 87 | 9,79251 |
| 4o5hV | 81 | 9,92905 |
| 5nsWA | 75 | 9,63108 |
| 6xvuQ | 107 | 8,87319 |
| GvGV | 69 | 9,72314 |
| SZfX | 62 | 9,69845 |
| A0A6J5MPT8 | 157 | 8,96673 |
| A0A6J5NL41 | 157 | 9,48815 |
| A0A6J5P9P7 | 155 | 9,09747 |
| A0A6J5Q6R5 | 153 | 7,77729 |
| A0A6J5QD47 | 161 | 6,58897 |
| A0A6J5S5Q6 | 141 | 6,98815 |
| A0A6J5S7G2 | 155 | 8,44402 |
| A0A6J5T520 | 153 | 8,90677 |
| A0A6J5T5N8 | 154 | 8,38336 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_39525
5vm8a
|
9 | 35,8% | 92 | 1.183E-09 |
| 2 |
phalp2_31920
4Q8kv
|
1 | 44,8% | 58 | 1.500E-08 |
| 3 |
phalp2_5848
5qUoi
|
18 | 39,1% | 69 | 6.776E-07 |
| 4 |
phalp2_5841
5lEAl
|
2 | 34,7% | 69 | 4.552E-06 |
| 5 |
phalp2_34659
5Brmp
|
2 | 36,1% | 72 | 3.056E-05 |
| 6 |
phalp2_555
GIx4
|
26 | 32,8% | 73 | 2.816E-04 |
| 7 |
phalp2_12185
6C5go
|
1 | 32,3% | 68 | 7.290E-04 |
Domains
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4bLvY)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50