Protein

Protein accession
53VMD [EnVhog]
Representative
4grXe
Source
EnVhog (cluster: phalp2_27297)
Protein name
53VMD
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
VVAQAAVLIVPSLKDFQAKLEAQLTAIEKAQDAIAIKIEGDTKKLLADYAAAKGWIESQEIEINAKVDTRYLTEIRQKYENLSRQFKNGLMLSVKVSGISLLPQLAQAAAAANDSMVELSRTALLLPGILSGIGASVASLVTGMGGLKDAFKEYSDAQKNAAQEGLKARNAAANVRDAYRDLGRSVKDARRELEDLNAQLRDAPLDEAEAIIRLQEARAEAAVTFEKSALQQQKDALAVIRAENDLADTRLRNSRTIEDVATANAKGVAASDSVVEATERLAKAVDDAATKNTKLSDSFAQLSPNAQQFVEAITGMDSQWTALKNAVQDRLFDGLAGEVTRLAASDLPMLQSGLSAIADEINGNFKTAIQSLQTSENRGFLSQILGNTAEAQGEFDQVIESLTNGFLQLSAAGSDYLPRLYRGLGDVMERFERFVAVSDADGSLDRWIESGIESFRDLGNALLNVGSILNSISDAFTGGGGKGMLELLEESSGRMAEFLRSDAGQQRLLDFFRETRGQLAEWKPFLVEIPGLMQSIASAGESWASKLLPFLTVATDFLNSHPGLVSAVLQAYLSWKVISPVISGISGLLSGDLVKALKGASGEVGMGFKTGGLTGAFSTLAGMIGVGGVAYSAVTAMIALLAYKYIDVQNEAAQAAQSHADALRNIANQVDETTGKLTQSGLIEQLKELSNYQNFNLAGDPNFDLMAKARGIGITEDKIVAALQPGAAGQSEFEAIQNRNREVVSKALQGQKDSLYWGAKSQLDKMGITFDDVVSAILGDEAQNKRLTDAGQGFNAVDLLQGINGPLGIGSVPGLGEDSIIAANLNRGLIEKRRNTQAQSDRVNSANVALNGNATMRPGTVFDQLGNPQASATTSGGVKISVDTPAPEIKLKFPEIDELIQEKGGSVSADANGSTIEFLDPRTAREFATFDKYAAGGRISGPGTGTSDSILARLSNGEFVINAAATRQHLPLLEKINNGELPGYAGGGLVLPDLPRFAEGGGWWDQVKEFGGGIVQGAKDAASSLKEFVTNPVESVKAMAPLVGLGGEGAPGVKDSWVAAGKSTIAYDDWSQGREATGAGKNVFGILSAVIPGAAATKAVQSLKAAPVAAKAAEAAAPTVSARALPPSAVGSEAVKVSGLSMDQLRANTGRTQVATTGGTVDAIHAYLDAASVGTRSIDVGSGRGFASKYGDTQEPNPKPNVNPTYRDLSEVAERAYDRVTNTNVLNVIPDVAERAALVQHIGRILSDKGRAVITTRGKDVADAKNKVPLSSEPNAWMIGKGKNQTFQKGYEPEELREFVQSVLGEGFSVEVAKVGSAPAAIQIARRLDSRAAADGTPARNRQIDNEKFEKTWARAIAADRPKVNQRTLVEKKLNDDFFDSPLVQIAKAQNRGDAMPGLDGVHMAGMRGTSEDILDIIKHHVEGNLFLNPNMNLKEIRRAPPEEIGGAYATANHASGLVLINSEFVDDPERFIEANNRLVAEGFFPELGPNNVYGSTVAHELIHIAHEDRLALSRNSQEMALRNAFIDLNPEFGLNPLDDKSTLGPAAFASVSKGRGMGSSSHLAIEKKFRGFLGDTLPGYSFFNKDALADPEAVFSDFYNSAEALASGGEDVYNMANSAGTGSKVLYGLLRDYWIKTNATRMAFLRAAGGDPNAPDSGVGKIRYPYIKPYRWGGLVRFATGGLNWFKPQSPPTPSPSPSPGGTTKAVGTLGSAASLGELYGVGAPPPAAPASPPALSVPVVPPKPPASSSIPLSTTLGVSTPVPAASPTTSGVPLPAEAGFTPTAVPQVRGTSIGGALAPLLAKAAPYGLPAESAISYGQPGFPSWVYNLGAPFKMLASTYKNHQAGSGYNRGIDWAPDGVAWNTPEGAGIMTRFAKYLASLGVMEQVIYQNPFTGETVGVYNGKLVGPGTDMPWYYAKDWAGHQNHIHTRTSFGIPNPQELSVLLNSGGLSGRKSNNSVLNSVLSQLGALAPSSRSNMPMAMTPGGIGLPQYTFGSGAAGSDVQKPLTGEQILQNYISIVSDSWSNIIENLVKNAGQIALRFIGSFFGLDFSQIINAANSVMGDVGGGIADLFDPNFGEDESEALDPLANLPADATVDQVIQSSQFGMLPPQAQADYYSMYESAAAGGEQFNAVEALRDVSAQAEQMLSGQAPGGVGQAAGAVAAYDPTKGAEQWRPVVQAILNNVAHKYGITNKKAWEDDIIGQINLESRGNPNVDNPNDSDGKGGTQQVFGLGQFHPETFAKHNKTGGDIRDPIAQIYAMIDYLASEKYGVIPDGGVNWKGVGWRNGKGYAAGGAISGPGTGVSDSILARVSNGEFIVRASETRKHLGLLQAINEGTLPKFNNGGLVQPMPLVPGFQPPAPPPPPPPPPTPPVVPDATKVPNAAVPEPQPTAPQVTPEPLAPTPDPVTDPLEQVSGIGDQIGSALGGIGGVTQGAQAPAGADPAGDPRAVMAQAPANLDHNHPAVSGAISAGAGAVAGAITTAMQAASIAGMGMSSGASAAAGPAIGAASGLVSGVAQAAGGAINGAVNILSSLMVGTLSSSPGGTGGAYGTPLLPQGKNPQGFNGGGGQVVNNWGGVTTSNPDEFYKIQQRKELQNASPFLARR
Physico‐chemical
properties
protein length:2609 AA
molecular weight:272662,9 Da
isoelectric point:5,19
hydropathy:-0,18
Representative Protein Details
Accession
4grXe
Protein name
4grXe
Sequence length
2291 AA
Molecular weight
235981,91410 Da
Isoelectric point
5,38364
Sequence
MAEYVVAQASVLIVPSLKKFQKQLEIELRRNQALTKPVTIDVEANTKKFIAEGMAAKKALEANPVNIRFRVLDATKQITEIRHKYEDTAREMKKGLVLNLKVAGMSLLPQLASGLAAVNASIVQLSQSAVLLPGILAGVGSSISTIMTGLGGVKDAFKEYGDAQKNAAQEGLKARNSAINVKNAYRDLGRSIRDAQRNLEDLNAQLRDAPLDEADAIIRVAEARAEAADKAQKSGLQQQKDLIALQRAENDLVTTRLRNSRLVTDAAEANAKGVAGADSVREATDRLSKAQDEAATQATKLSDSLKELSPNAQKFVTTVTGMADEWSAFRMSVQDKLFAGLDEEITRLGQVSLPTLEKGLGGIADALNGNVKAAFKALGSETNQGFFEKIFGDTAKAQEQLSDAFDPFVDSFLRLASNGTGFLPRLTEGLTDLLTRFDNFIVRAEGDGSLDRWTNDGIDALKQLGNSLINVMSIMDSLSEAFTGSGGKSFLQLLEEGSKRLADFLNTDEGQDRLKKFFVETREELAKWKPFLEQLPGLMQNVAAAGQQWANILVPILTDIGKLLAGQPALAGAVLTAFLAWKGIFPILKGLYSGIGTVNKVFDLFKTQLDNSGQRTTGFKNKVGELSSALISPAGLVGAATLAATWIGTNLANAHIDAEAAAQRQKDVVDQLRQSLDDVTGSATKATNALVAKDFREGINAATGKTNGDLLKNVSDPNALINKVAAGDLDGALSLTKGATGSDIETTEFWSKFGPSVREFGMTSDDVAKAVNGEPDAKKRFEDWNSQQTLTAGPFGTGMDLPWLNQAPGWLVQGAENAGAVTTAPDLLDIQNQLPDQVRTGSQLQGMVYEKTLGLNQAQSDIRTDNTRGFGRFRLKPGSPFEALGVVADPGVNQDRGGLVVKSQPEGPALDEFRNNGVTFTPDGTDRFIVSISPDSVLKYFERFNTGGLISGPGSGTSDSILARLSRGEFIVNAKSTQKHLPLLEQLNGGGEVPGFSDGGLFGGSFGAPPVVPAPKPQIPLTATGMLNHVNSQKPPAPSFIGGGGAASAGETAAAASIPKMEIGSSKPSRLGAAGSAVGNFFKGALGLGAPGYEETSTAKVTFRNNMSVSDSAWNNLNAPLPSGDKPNPSAALDSRDKVISGLYPWLKKGWWEPDAYGKSAPKPAAPPVKPPVAPPKPVPKPSPTVSSPPVKHGTVGAPGAGNGVPHLGETMGLGPADYGRTPTPSNLPAVSGVPAALGAPIDGLSGVLTDYQGLPVGSAINYGGEGFPAWVYEAGNAFGVSASTYAGHQEGGGTNKGIDWSPTGLDPHTPEGAARLTAFAKHLASLGTMEQVIYKNPFTGELVGIANGKPVGPGTDQPQYYAADWDGHTDHVHTRQSKAIPTPAQLRSLSNMPDGATPTGVGLPQLSYTQGSGGGLPSVLGGDSSGKGGLRLPSPQEYADYVAQSWMGTLQNMVRNAGSIGLNFLGSFFGLDLSQITGTANSIIGGIDLPKEGEGDDELSLPADEGVANILGGLGGLPPGYEDALASASGAKYDPKGGAEQWRPVVRKVLAERAAVYGIKNIKAWEDALIRQINTESGGNPGAANLNDTDGNGGTQQVFGLGQFLPTTFAAHNVTGGDISDPVAQIYAMIDYVASKYGMDSSGGPNQIGRGVGYASGGRIKGKGNGRSDSIIARVSNGEFLVKAPMAQKHLGLLQAINADQLPGFSDGGLAGFVDGGDLLDFGKGIWDGGADLVSGLDSAIKDPIGTVKGMAPLAGLGGAGAPGVGDSWLNLGKSAIAYDEWTGGQQAHAAGRNTFDIATAFLTGGYGAAAKGGAKAATVAAKPEVKAAEAVVKPEVKAAEAAAAPRISLSRNLPYTKGNIPVDPNNIYRGIAVDDFNSVEGLFGKDGIMTGAGKGGTDRLSFSHGFPLDIYTDGIVKSNIMLEGKSSLIGDGAKASKPGQYGYVDQLELSKVLSGETPIRVWQRKAAPGQSITDQHAWEAIFDSFPKKASGGSISGPGSGTSDSILARVSKGEYIVKASSAQKNLGLLNAINSGLPGFADGMLWPTTAPAPVTPPPPPPVPAPAPPAPVAGAPDPVGPQAAAEAGVPAPPTDPASSAPAEGEQAALSDIGAALGGLGGALGDGAAAPEGGTPEGDPRSALGAAPQNLDHNKPAVSQGIQAAAGAIAGAVSTAMQAASVAANASAPGSGQGVSSAAGIVNGMIAAGGSAVSGAVNILSSLGVGSVTPSASTAGAYGSPLLPSAQGQDPYRGPAVVNNWNGGVHTSNNDEFYRIQQRRELQAAAPSLPPGA
Other Proteins in cluster: phalp2_27297
Total (incl. this protein): 6 Avg length: 2132,5 Avg pI: 5,70

Protein ID Length (AA) pI
4grXe 2291 5,38364
4O3xI 1930 5,02453
A0A2I2MPD9 1991 6,16756
A0A649VC61 1983 6,22929
A0A0B5A360 1991 6,22929
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27883
8IrIz
99 30,2% 2367 3.912E-258
2 phalp2_11861
8iHQw
8 23,8% 2546 4.332E-139
3 phalp2_24482
4HJjP
2 23,7% 2065 3.355E-123
4 phalp2_383
7hFrk
4 24,5% 1767 1.226E-79
5 phalp2_22229
75u16
6 23,6% 1462 1.614E-79
6 phalp2_14941
8N0Ls
31 21,3% 2123 4.102E-51
7 phalp2_36889
7ukoa
30 22,3% 1854 9.351E-51
8 phalp2_36243
7zfxz
7 21,7% 1760 8.524E-32
9 phalp2_24934
8MTbO
5 20,9% 1683 1.008E-30
10 phalp2_6553
11DGv
4 20,7% 1912 1.855E-28

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4grXe) rather than this protein.
PDB ID
4grXe
Method AlphaFoldv2
Resolution 45.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50