Protein

Protein accession
5BmpD [EnVhog]
Representative
4M8sC
Source
EnVhog (cluster: phalp2_5727)
Protein name
5BmpD
Lysin probability
89%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MASLFSVFARVHLDTSAVEKEAQQSGTKSGKAYADGFYKDSSGRLRQANGRFATDAQKAALTSGSGAGTKFGNEFGNKAGKGMTSKFKDQLKGLAGFSAAAFVPLGIAGAVAAIGKIGIQYENNLNIFKAVSKATGEQMAQVAAKARELGADVKLPGVSAAGAAEAMTELAKSGLNVQQSMDAARGALQLARAASISEAQAAEITGNAINAFGLQAQDSTFLVDELAAAANSSSVEVQDVSNAFKMAAAVFSSFQSPTVGAKESVTELNTAIAILGNAGIKGSDAGTSLKQALLQLTGPSMQAKDQMALLAQRAAGANISLEEQNDVLRGSKSVREKALAAIAKHNRGLQDTGDIAFTSAGKMRALPEIIDLVTRGTKGMTDEEKDYAITQIFGADASRAVIALMKGGLPAYNKMRTAIMQQGAAAQVSAAKNAGMGGAIDNVKSQIENASIAVYNAVKGPITTALNAIAAALPGIFATIGGFFSFLGQHIGLIRDLAIEVGIMVAAFKTYQLTLLAVTAATKAYAAVQTLIAFVQLAAQVRNFAGAMALLNATFLANPVGIVVVAIAALVGGLILAYKHVGWFRAAVDGAWKGIKIAISATVGWITGTVWPALKTAWDAIAAAAMWLWHNVFEPVWKGILAAVRFAIAFITAEINIGKAIFNGIATVVSWLYNNIFKPVWSAIATAVHNAWTIIQVVFAIMKIGVSALFAWYKAAWDNVLHPVFNAIVSVVKWVWNNGIKPQFELIKAAWTAVAGWFTGIYNSKVKPLFNTLSLYIAAIIIVLKAKFELVKAAWTTVWSAISTFYTTNIKPIFTKLIGFLQDYVVKGFTNTVNNIKSAWAKVQDAAKKPVTFVVNSVINPLIRGFNKIAGAVGVKDRIAEIGGFAAGGRIPGAPSAVDNRLAQGPGGLLKVASGEYIVNARDTAKALPLLKWVNAGMKGGPGKAAEYLGRKVTDYPGDGSEGWAFSHGGRIPGYADGGLVGWVKNLGKDVWGALSNPTSLIKAPLEAALSKIPGSGMIRNFLVGTGKKLINGLTGFASKIVGGGGSTPVGGRVGAAASFIKAQDGKPYVWASAGPNGYDCSGIVSAAYNILKGRNPNQHTFSTANAESYFNQHSLSGPLIAGWSHPGQSPASASVGHMAGQIAGMPFESRGSRGVIVGSGARRVGQFAQRGAAGLAHGGLVGNRQVQLMDQGGAWPTGTVRANMSGHTEQVLTGGPGGDINTLADLLTAILDSIRALGPEVAAALERPTRRAVQLGRSRGAAT
Physico‐chemical
properties
protein length:1264 AA
molecular weight:131319,5 Da
isoelectric point:9,86
hydropathy:0,21
Representative Protein Details
Accession
4M8sC
Protein name
4M8sC
Sequence length
1277 AA
Molecular weight
133953,69320 Da
Isoelectric point
10,42146
Sequence
MVALASAFVRLRPQPDSTEFRKQGAKMGTEAGTAAGRTFGDGFSKDATGRLRTANGKFATDAKKALGAAGGASGSSFGDGFYRDASGKLRQANGRFATDAQKRMLEGGGAAGASFGDAFNREGGGRIRKGFSSLKNELGPLLVPLGVAGAVAAIGKIGIEYENNLNTLKAVTNATGAEMAKVSERARALGADINLPGVSAAGAARAMTELSKAGFTVQQSMDAAKGTLQLARIAAIGEGEAAEIAANAVNAFGIKATEVNKVVDQLAATANSSSVEIKEVSYSFKQAAAVFSGVQGPAIGAQESITELNTAIGILGNNGIKGSDAGTSLKQMLLQLTGPSEQAKDQMAFLALNARSANISLEQQNQVLHGSKKVRGEALAQIAKMNPGMQDMGDIAYDANGKMRGLREILDLTAKGTKNMTQEEKNYAVTQIFGADASRAVLALLKGGLPVYDQMRAKVLQQGSAAKVAAAQNAGLGGAIDNVKSQFENAAISIYDQVKGPLTQGLNALAGVLAPLAGGIATFGAFVRENGETIKQWAIAIGVVTLALKINSAMLAVQAAGGLLAYVKGIRIVTGTTRVWAGAQALLNATLLANPIGLAIVALAALGAAIYLAYKNSETFRRIVQVVWGAIKTAIAATVDWFVGTAWPSLKRAYDQLAAAAMWLWKNIILPVWNGIKAVIGVVVTVVRGYISLIVAEFRLIASVATWLWKNIFTPVFAAIRKVVEIWWFAMRVAFAAFVKIIRAAVMPVVRSAGSLFSTIFRVIRAAVSAWWADMKRLFGLFRQYVVGPLMNALRTMLAGWRTIFSAVSRVVRGWWTGTVSPIFATVRRGWQALAAAFSAVYNGKIKPLFSAFIGFMRNNVAGAFKTAVNTIRAAWAKVQEAARKPVAFVVNRVINPFIGGLNAAAKVVGVKDRIAPISGFASGGQVPGSRPPGGADNHFARVKQTGKLFAIGSKEFITNSKSTEANLPLVKAINAKRGKVTRDDVDPFLDGNDGHGRGDGIGDLFKKIMAPFRGAASVIADPGKAMRNVANSLLAKIPGSGQLVSMIRSASSRLISGAANWLSNLGLSGGAIGGGGVAGGWKGMQRLISGRFPGLGMISGFRRGARTLSGNRSYHSLGRAVDYPPNRALAAWIRGTYGGKTKELITPFQDLNLHNGRPHRYTGAVWNQHNFAGGNAHVHWAARDGGLVTGKTGMPVKLFDQGGFWPSGTAGINLSGRTEYVEPNRDGKAGADSGPVRLHPADIDLLGRVISREMATTVGAANYAAGRRTGLYIQGA
Other Proteins in cluster: phalp2_5727
Total (incl. this protein): 15 Avg length: 1201,5 Avg pI: 10,05

Protein ID Length (AA) pI
4M8sC 1277 10,42146
2eC8I 1329 10,23244
2eCbd 1215 9,75428
2kO6o 1249 10,03433
3gGNW 1244 9,80205
4AjHo 1170 11,36483
4N5p8 1246 9,98114
4qAom 1092 10,33804
5keps 1256 9,87328
6TKlR 1195 9,84788
6UfTI 1250 9,82165
7yFHT 1070 9,56242
7yFHb 1085 9,79953
bhU7 1080 10,11040
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39889
1d7HT
37 28,0% 1125 6.506E-140
2 phalp2_6983
383zW
1 25,0% 1014 5.809E-104
3 phalp2_9509
11Acs
224 26,1% 876 5.498E-85
4 phalp2_812
3M7B9
1 26,3% 862 1.963E-80
5 phalp2_24626
5tJdD
1 23,9% 1019 2.113E-63
6 phalp2_9232
6ThaE
1 23,4% 1303 7.129E-51
7 phalp2_3959
7qClx
89 21,8% 1078 6.521E-50
8 phalp2_32490
1ep70
5 21,9% 920 1.251E-44
9 phalp2_13205
2G5Se
2 23,5% 835 4.478E-37

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4M8sC) rather than this protein.
PDB ID
4M8sC
Method AlphaFoldv2
Resolution 61.51
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50