Protein
- Protein accession
- 4CQCh [EnVhog]
- Representative
- 4CQCh (this protein)
- Source
- EnVhog (cluster: phalp2_26945)
- Protein name
- 4CQCh
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MRVSQRRLKYKLKKYGVPTKFIDGWDSKKIDVYGTSPSVGVVLHHTAGTNSLNWIVNGNPYAPVRACHFLVNRDGTVEVVSGTSAYHAGSGGPVTFKRKTLPDVTVPKDQGNEYLYGIEIESLGTTAAINGTKGGMTLEQVISTALLSAALLNALRPFNLSYPVDRVIRHQDWTTRKPDVKQDLDWWHQVVGIARRNVIDSEKTRREVTAFVKANSKGVLVVKPTPTPEPEQIIVGKIPKSDPPAVVVCEVVPAKPKPAPKPVVKLSDLKPRQKNASVGTVQEALQKEVGLAKQVSPVFNAATKAAYAKWQKKLGYTGADADGIPGRASLIKLGKKNGFTVKAT
- Physico‐chemical
properties -
protein length: 344 AA molecular weight: 37345,7 Da isoelectric point: 9,91 hydropathy: -0,31
Other Proteins in cluster: phalp2_26945
| Total (incl. this protein): 13 | Avg length: 310,4 | Avg pI: 9,67 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 1569H | 309 | 9,80811 |
| 2ENvW | 352 | 10,08790 |
| 48Mla | 307 | 9,30010 |
| 49Ahl | 296 | 8,81736 |
| 4A5gp | 299 | 9,91255 |
| 4BgCk | 297 | 10,13690 |
| 4KPgX | 344 | 9,96419 |
| 5we5G | 291 | 8,94436 |
| PoKH | 297 | 10,02491 |
| Syr8 | 306 | 9,07052 |
| dPIb | 296 | 9,88985 |
| tRdO | 297 | 9,91029 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_30054
2ICDq
|
63 | 67,8% | 221 | 3.815E-75 |
| 2 |
phalp2_19071
1jII3
|
225 | 29,3% | 347 | 7.057E-28 |
| 3 |
phalp2_12291
7ggQn
|
288 | 26,0% | 330 | 1.761E-24 |
| 4 |
phalp2_13355
486o0
|
7 | 25,5% | 340 | 1.346E-20 |
| 5 |
phalp2_30251
4g4tR
|
4 | 23,1% | 255 | 4.927E-18 |
| 6 |
phalp2_27801
7zeFm
|
19 | 25,4% | 318 | 4.927E-18 |
| 7 |
phalp2_30257
4gSoB
|
83 | 25,0% | 271 | 4.188E-11 |
| 8 |
phalp2_37674
4urEs
|
29 | 26,1% | 237 | 9.884E-11 |
| 9 |
phalp2_25908
5JMoV
|
6 | 21,6% | 231 | 9.706E-10 |
| 10 |
phalp2_2829
8E642
|
18 | 22,6% | 234 | 2.218E-08 |
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50