Protein

Protein accession
4RKvr [EnVhog]
Representative
4RKvr (this protein)
Source
EnVhog (cluster: phalp2_40572)
Protein name
4RKvr
Lysin probability
93%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MIDSSKLMNRKEGQTKFFFKANKKIISINTFLQRSLNDKRKKSAKKRNEEFLRRRKKQEDRLEDRKSDTDGGVKKFGGKLGQLTGATNLVNGFKNAIGKLLFGFFAIRLLKYLPVLKNILPLINGAANFISSIGIGLIDGFANIINIGYQAYDASKVFMKQIGGEDMEASFDKFMGAVTSVIDILLLATLIRASDSFGGPLKKPRGGGGGGFFFRRGRGPRKPKDPSDPKKPDPDKPGRGFPAGGVLAGVLAAALIFAATRGKVKLTPAQALNSIKNARNLREVLKKSRREVAKSRLNARRAVEAEAEVKQTRRARRKDFSQQRFESQQRVEAEAEVKQTRRERRKVFSQERVEAEVGGRGGTATSGQRTPPSGSKKLPYGRGVNPAEMRSEVGDEMFEQRKRADDAYKNITGKEPPKKSKGSQGSTSTSKPGKMQPARTVVKPAIITDKALANRILTDEKFGSLFLEHLREKFNLRNPQGRITKGVTLDEYAALSGTTTDQLIKSELAKTGPTSFKGRLKLTQQQVVPLKPQIKVPRTKEGFIKPKLPTAEMLQRIVPKNPKQKISFLKRIFGFTDNVFSRIPVIGPLVDFGINLAFGDPLPKAATKAIFAAIFGGIGMAAGSVVPGAGTLIGGILGGLAGDIIGGVLYDMVVGNISPSELPDPKEYNPYVDPKLTSKALNLEQKQNVQRATDLRSAIRQGESDGNYSATYSGYLKGFPRAGEDLTKMTISQVIQYQKDYIDYQRSLGIPPNKRSAAVGAYQMLYPEKAAAYVGIPLNAKFNKENQDKMLDYYLDMAGRKQYERGEITAEVYNDRLAGQFASLKTVTGEGVYDYDGINRATKSVLDLIRKEYPDRSQGLNQETSYGEQASNTFIIQRRNLMQSLGQGESTQAPSIFLSASDSYDNKNILYEIG
Physico‐chemical
properties
protein length:914 AA
molecular weight:100670,2 Da
isoelectric point:10,01
hydropathy:-0,53
Other Proteins in cluster: phalp2_40572
Total (incl. this protein): 5 Avg length: 894,4 Avg pI: 10,09

Protein ID Length (AA) pI
226GP 913 10,07791
3Hr6e 881 10,08519
8p0Xn 880 10,16281
8qi1N 884 10,13238
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10405
1UQEg
7 29,3% 702 6.344E-50
2 phalp2_843
7YTVM
6 26,4% 699 1.584E-45
3 phalp2_40363
3BWeg
13 26,9% 657 4.576E-44
4 phalp2_26438
1Phed
42 23,6% 865 5.442E-28
5 phalp2_21201
16ol5
2 23,5% 859 6.679E-24
6 phalp2_28457
1Sn2p
7 23,3% 633 9.750E-19
7 phalp2_474
8zFll
8 25,1% 673 3.540E-17

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4RKvr
Method AlphaFoldv2
Resolution 56.08
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50