Protein
- Protein accession
- 1gG6x [EnVhog]
- Representative
- 4CBwF
- Source
- EnVhog (cluster: phalp2_23118)
- Protein name
- 1gG6x
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MAKICIDPGHYGKYNRSPGVPEYYESEMVWKLSQLQKKYLEQLGDTVILTRNDPNKDLNLITRGKKSKGCDLFISNHSNAVGGGMNENVNYAAIYHLVEDTTTKVDDISKNFANVIAPVIANTMGISYRVLTRKAESDRNADGIKNDNYYGVLHGARLVNTPGLILEHGFHTNTKNVQWLLKDENLDRLARAEAEAIHKFFNKDEYSGSTEPAKPVEPVKEPEVTSKAFKVRVSIKDLNIRTGPGTNFAKVGYFIPTGVYTITEIQNGAGSKDGWGRLKSGVGWISLDFVEILK
- Physico‐chemical
properties -
protein length: 294 AA molecular weight: 32726,7 Da isoelectric point: 8,59 hydropathy: -0,47
Representative Protein Details
- Accession
- 4CBwF
- Protein name
- 4CBwF
- Sequence length
- 241 AA
- Molecular weight
- 26791,05320 Da
- Isoelectric point
- 8,61912
- Sequence
-
MQKKYLEQMGHTVIMTRTDPNKDLALVSRGKASKGCDLFASNHSNAVGSYMDESRSNVAVYHLVQDHTTECDDVSKDFAEKIAPVIGEVMGLESRVHERAAQSDRNGDGFKNDNYYGVLHGSRLVKTPGVILEHGFHTHSATVRWLLDDNNLERLAKAEAEFIDSYFGGSRPGIVVPSSDVPYKVRVTIKDLNIRTGPGVNYAKTGKYVIPGVYTIIEESCGWGRLKSKIGWIRLKYATKI
Other Proteins in cluster: phalp2_23118
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_19894
5T9s7
|
90 | 42,3% | 243 | 3.844E-57 |
| 2 |
phalp2_39497
5jS1N
|
14 | 29,8% | 305 | 2.338E-41 |
| 3 |
phalp2_17577
6kJnH
|
104 | 32,2% | 245 | 2.682E-27 |
| 4 |
phalp2_36719
6eCmw
|
993 | 32,9% | 243 | 1.494E-25 |
| 5 |
phalp2_29154
7mKv1
|
17 | 27,0% | 240 | 5.955E-13 |
| 6 |
phalp2_22979
3p5T3
|
17 | 27,8% | 187 | 1.978E-12 |
| 7 |
phalp2_9278
79Kdt
|
5 | 29,7% | 175 | 1.035E-09 |
| 8 |
phalp2_18497
6MPko
|
7 | 29,4% | 156 | 5.002E-07 |
| 9 |
phalp2_27233
3PKbz
|
18 | 26,2% | 183 | 9.208E-06 |
| 10 |
phalp2_37678
3Xhrk
|
501 | 25,9% | 162 | 9.335E-05 |
Domains
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4CBwF)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50