Protein
- Protein accession
- 5knE4 [EnVhog]
- Representative
- 5knE4 (this protein)
- Source
- EnVhog (cluster: phalp2_28879)
- Protein name
- 5knE4
- Lysin probability
- 98%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MADPFGNFMSGIVAGTDRSQVPQSLRQRVALALLMKKQAYPKTLGEGLASIGDSLGDAWATRAIEREAAAAEAAGAAARSDVTGAPPAAPTGYAPPDSVENEPGPAAINRAAPIQPPAALPPPPPGALAPPPDMTNTPAGQQPPSTMFSPQRMQGAPGLTSSRDQIAGMLDPRQMTNQRVPPPDQQIPPGASPADGGYNMIDAQAGFKRPTPGYIQDAITRNVADPDRQAYLGSLVGGEAPRGPADVSPTGADGPFQFTRGTGRQYGLLGPQGDQRRDLDASVQAADRLTNDNAAAFQKINGRPPSPAEMAVLHQQGGVTGSRMIAGTGNAPAGNLAVNNIPPGASPDQAVAKIKSYYGMPDQATGTQRNAITQQLVAGQQQGGPDQRLAFNGPTPPAPPTAAPPPPQQPITAAPPQQPITSAPQATPPPGYVRDIPPEPVPPPTMTPLMQRIQQKIDSTPASQRDSVKEALAPTLANEQAKLAQEQAIYKDKVTQRHEAIKQMEDQKATAAGRVVEVAKETEAVTKARDENRLRAQFANLPPEEVFKKVNESHKIAKSGQDALVASAAAMKAFKEGAITGYGADQKLNVAKLFTALNLTDKGNLIANTETFKSAMQPVVAAILHQTSGTSQLSEGELAFAKQAAAGNITLDAKTIPRLMEIIDKRSREVIKDHQTLTGAMFGDDPKAKAVYGVDMPTQAEAPREFRSEAEVAAAGLKPGTKIRVNGRNATVQ
- Physico‐chemical
properties -
protein length: 733 AA molecular weight: 76718,5 Da isoelectric point: 8,37 hydropathy: -0,51
Other Proteins in cluster: phalp2_28879
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_38769
1eVZS
|
13 | 28,5% | 494 | 8.700E-48 |
| 2 |
phalp2_15004
6XBZb
|
1 | 23,1% | 859 | 1.147E-43 |
| 3 |
phalp2_26961
4F1Fi
|
2 | 24,6% | 746 | 3.537E-34 |
| 4 |
phalp2_15654
2TjbM
|
18 | 24,5% | 729 | 6.210E-34 |
| 5 |
phalp2_16451
6Eklt
|
7 | 23,5% | 756 | 1.149E-25 |
| 6 |
phalp2_35818
4Cb1M
|
1 | 24,3% | 538 | 1.865E-24 |
| 7 |
phalp2_14465
4HQ71
|
1 | 25,9% | 754 | 1.539E-18 |
| 8 |
phalp2_26959
4F0NP
|
9 | 21,8% | 688 | 4.930E-09 |
| 9 |
phalp2_4803
588t7
|
6 | 22,4% | 467 | 6.500E-09 |
Domains
Domains
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50