Protein
- Protein accession
- C4NTF2 [UniProt]
- Representative
- 8Fz4V
- Source
- UniProt (cluster: phalp2_9404)
- Protein name
- Uncharacterized protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MLGPKTREGIAQAMKIKGVRDAKQLFTRGVRGIVWHWTAGAHGIIDLERDHYNWLFDRMGNVYDGNHSVQDQVNYDVRSGVGASHTKSMNTGWLGLSVDAMAGAIESPLNWGTNPLTWEGIDAMLDWTMDLCKEYDIPVSPWTTLSHAEVEQTLGVRQRFKWDYKVLPGDTRARDARKVGDELRNRMVNRHAA
- Physico‐chemical
properties -
protein length: 193 AA molecular weight: 21849,5 Da isoelectric point: 7,07 hydropathy: -0,53
Representative Protein Details
- Accession
- 8Fz4V
- Protein name
- 8Fz4V
- Sequence length
- 256 AA
- Molecular weight
- 28260,67920 Da
- Isoelectric point
- 7,05636
- Sequence
-
mtipaswlpaatmkrvilhwsagthtasaldrahyhfilegdgnavrgdrsikdnegpirgnyaahtracntgsigvslacmagatespfrsgrypmmktqwdamidfvvelcrkyridptpetvlshaevennlgiwqrgkwdytrlsfdasikgaraigdrirtavrdrlaeglvereieeplggdraepvamalrgsttapwlnfrrapmgdiigglprgtsltiqdtddgwyqartpagylgwvsahyvcld
Other Proteins in cluster: phalp2_9404
| Total (incl. this protein): 29 | Avg length: 266,6 | Avg pI: 7,38 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8Fz4V | 256 | 7,05636 |
| 1ozsV | 271 | 6,52838 |
| 2B2Dj | 256 | 6,24225 |
| 2B2zm | 256 | 7,05636 |
| 2CiDn | 357 | 7,11075 |
| 2G98 | 294 | 7,78535 |
| 3Yi0p | 250 | 9,59543 |
| 3YikC | 251 | 5,67631 |
| 3Zbkf | 280 | 6,38924 |
| 4VXIr | 308 | 7,84988 |
| 6RlF6 | 281 | 6,79888 |
| 6RyGY | 248 | 6,59482 |
| 6VXe8 | 256 | 6,16961 |
| 726IM | 258 | 8,58598 |
| 7Rdyc | 257 | 9,19501 |
| 7lIZV | 258 | 7,82010 |
| 7lIZu | 258 | 7,70500 |
| 7lxqj | 280 | 6,17234 |
| 7og3Z | 278 | 6,96252 |
| 7pFgt | 285 | 6,71788 |
| 7rNQe | 279 | 6,59624 |
| 7w50p | 319 | 6,63592 |
| 7wDNY | 279 | 7,68437 |
| 7xExm | 263 | 9,29571 |
| 8D0k3 | 232 | 9,51490 |
| 8jfg | 248 | 9,64816 |
| j9cS | 306 | 6,86555 |
| A0A068CC96 | 174 | 6,64257 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_1021
6Rshc
|
90 | 64,1% | 170 | 7.214E-91 |
| 2 |
phalp2_19974
6PE0f
|
7 | 55,2% | 192 | 2.140E-82 |
| 3 |
phalp2_21458
8eHRi
|
401 | 55,4% | 175 | 4.014E-82 |
| 4 |
phalp2_4723
4RvuF
|
8 | 54,3% | 186 | 1.441E-75 |
| 5 |
phalp2_20043
1rrOC
|
14 | 52,3% | 172 | 2.735E-69 |
| 6 |
phalp2_22282
7wBZw
|
14 | 41,9% | 181 | 6.844E-54 |
| 7 |
phalp2_29754
1BARz
|
1 | 31,2% | 163 | 5.688E-21 |
| 8 |
phalp2_30152
3nf4u
|
1 | 27,4% | 164 | 2.630E-15 |
| 9 |
phalp2_24708
6km5N
|
73 | 20,7% | 241 | 1.284E-09 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Sulfitobacter phage EE36phi1 [NCBI] |
490913 | Schitoviridae > Aorunvirus > Aorunvirus EE36phi1 |
| Host |
Sulfitobacter sp. EE-36 [NCBI] |
52598 | Proteobacteria > Alphaproteobacteria > Rhodobacterales > Rhodobacteraceae > Sulfitobacter > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
FJ591094
[NCBI]
CDS location
range 63453 -> 64034
strand -
strand -
CDS
ATGTTGGGTCCAAAGACTCGTGAAGGCATTGCTCAAGCCATGAAAATCAAGGGAGTTCGAGATGCAAAGCAACTGTTCACTCGGGGAGTTCGTGGGATTGTGTGGCATTGGACTGCTGGTGCTCATGGCATTATTGACCTTGAACGTGATCACTACAATTGGCTGTTTGACCGTATGGGTAATGTATACGATGGCAATCACTCAGTACAGGATCAGGTCAATTACGATGTACGTTCCGGAGTGGGTGCGTCACATACCAAGTCTATGAATACTGGTTGGCTTGGTTTGTCTGTCGATGCTATGGCTGGAGCTATTGAATCACCTTTGAATTGGGGAACCAATCCTCTCACTTGGGAAGGCATTGATGCCATGCTCGATTGGACTATGGATCTCTGCAAGGAATATGACATTCCAGTATCACCATGGACGACTCTGAGTCATGCTGAAGTAGAACAGACTCTTGGTGTTCGTCAGCGTTTCAAATGGGACTACAAAGTTCTTCCCGGTGACACTCGTGCTCGTGATGCACGAAAGGTTGGGGATGAGCTTCGTAATCGAATGGTGAATCGACATGCAGCGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0001a41c23_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8Fz4V)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50