Protein

Protein accession
4HBmX [EnVhog]
Representative
4EkvC
Source
EnVhog (cluster: phalp2_21881)
Protein name
4HBmX
Lysin probability
95%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
VASGEFNAGAGGLLGTNDLQQSIDTFKSAVDDLKSTVANLGTSMGSGMLRHPAMGMLSEGQTFTRGSLNGMAMGGNGSFLTSSQQRGAVILGQGQGGSQGSSSPTPTMNDTASGGGANGGGSSFGGPRGGGSGSGSSGSSGSSGSASPPPTMGSTAAGGGANGGGATFGGGSGGGPGGPGGPTGPIGGPRGPGMGATPASFLSRAGGAAMGGFAAMGMSQYGNQLLFNAYGQQMAATYGYGAQSAQNNAFGNGNGMVNAAAAGSAQGAASEFTTLNQMAGGGAIGYGNALQAANAFAFTNQGLGAAGGAQLASSVYNPVNSLRMQMLTGVSSINPQSGQMNSMASVIGGLASDNYLGGGGYNSRTGTFRQSALNASFTPGRGTSYMNLQSLGYNPTQISDIQSMTSQANQAANRGHTSLSNVLGTMGHAEYGSEGQIQSADAQLKKWGINLSTIQKQGNLSSLGMAQSQSQSSTFNEAISGATSVMTKFTQAVNYFLDKTGLNKAMGLGGGISAGAASTGLSGIMSGIAGTAMSIASIGGGAAGVSTAQPMTSQGSRPGSSLSGVSGSAATAVKDAEGQVGKPYVYGGSNPQTSFDCSGLVMWAYGQAGVKLPRTSQQQWAALKNKSVPANQAREGDIVFQAGSDGTAQAPGHEALMISSKQIVEAPYTGANIRIRAYNPNEWQHVGRPVGKGGGSGSSSVSGTQTGSTSSSLGAGNSGVGNYGSTEEVDAVASALGGGTSGGATHLMSTTTNGSATSGSKGNLNVGTAGGGGAGANKALAKKMAAQMYGWSGSEWSQGLDPLWTQESGFRSNAQNPTSTAFGIAQFLDSTWGPYGPKTSNPGLQIKYGLEYVHNRYGNPLAAEAHEKQYNWYGTGTRSAAPGMALVGDRGPELIDLGGGGQQIHNAQATSNILRGNSALPAQSPWTASPGQQLLLDTMTPANNHARGSGGGVTITLTVEKGAFQLTGTGISGSSDVQSLADAFGQAVESRLQKSELIRNIARGNTG
Physico‐chemical
properties
protein length:1007 AA
molecular weight:99437,9 Da
isoelectric point:9,10
hydropathy:-0,29
Representative Protein Details
Accession
4EkvC
Protein name
4EkvC
Sequence length
1120 AA
Molecular weight
110217,01660 Da
Isoelectric point
9,18276
Sequence
VLGTNNLQKSIDQFDKAVTRLEKVAQQMGGSGSGSPFGSSKNNGGGTGNGGGSSFGGRPTRAGYGIPGATDYDSPAGPTLPEGGYYGHSSIRNPDGTFKNGGNPKFGGQARNADGTFASGGGGAGGNGGGPSFGGSGGQYAKGNGGHGLSSGGGSLLAGLSGIVSGLGSMGESSMGAQVPMSTMVQQGLLLSPNGTSNAVGTNRMLAAAYGSNSTSAPTNNYAFNAADASTGFSALQQMSGSYLPSMNSTGRFGLGAAASFAGANPSMSYTDAVAMAQQMAMPSMSLKMRQMGYGTSPLQMGTGQANSMGGVVQGMMQRWYGSSSTSNTGLAAALAPGQVGYTNLQQLGLSGSALSGTISTMEAYNRVVSGSGGKINDQQVQNLFQQAQGGNKSAISTLNKYGVTQSDISAQKNVAAQKTEGQSDIYNSFTAGLDDASTALGKFQSILNNIVSTPGINQIVGGGMGALGGGQSMMGNMAGGAEMLGGGLMAAKALKLLGGGGGASGLLSKLGMGGGEAATMDAASVGGGGLLAGGGSAAAALGPAAAAAAIPLTLANIHHSNGKNWLQQGPGSPTGGWNSFSGLWNDIKTGWGLWGGSNGTASPSTDPRGVQSRYARGGVITGGVRGLDTQPAMLAKDEAVLNANAAHALGYDNIERLNNAHSPGGIGGNTQMRGGVLYAASGAAILSDAKKYAGHKYVYGGPSNPQSGWDCSSFASYVLGHDMGLQLPGGSWASTTGSGASHGAVASSFAHLPGAHKVSNKPSDIQAGDVLVWSTHVGFGVGPGTMFSAYDTAQGTLQTPKNLSNAGGPGGETLTIMRYGAGGGTGSGSGSGGGTGSGTTGGARGGSGGGSGGYTSTSEASNVSAALGGGGGVSIGAGQYGAATTSSSGSGPTSGNVSTTGGGSAAQNKVLAQKMAQQMYGWSGSEWTNGLLPLWTQESGFNSSAQNPTSTAYGIAQFLDSTWGPYGKKTSNPGLQIKYGLEYIKGRYGDPLKAEAHEKSNNWYATGSTSTRSGLAVVGERGPELVNLPAGARVTDAGTTAKLMGAQGVAQAPWSASSGGMTSGGVSLSFGDINITMSGSGDVNSRQNASNAASEIVSQVKTMLDREGIYAAIRSGNKG
Other Proteins in cluster: phalp2_21881
Total (incl. this protein): 6 Avg length: 1110,8 Avg pI: 9,21

Protein ID Length (AA) pI
4EkvC 1120 9,18276
3QQy8 1169 9,22641
3f49b 1173 9,22641
3fZGM 1188 9,41633
4CNJn 1008 9,09670
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20458
3AFCX
1 26,7% 1217 5.298E-72
2 phalp2_2629
6QN8H
1 29,1% 1103 1.476E-62
3 phalp2_24523
4U9ON
1 26,7% 1087 1.632E-41
4 phalp2_6148
5zWfk
51 27,4% 1107 2.194E-32
5 phalp2_21244
1jpge
68 27,9% 1051 7.955E-31
6 phalp2_23188
4O9u9
9 23,5% 953 6.378E-20
7 phalp2_13767
6TgWs
10 21,7% 986 6.150E-14

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EkvC) rather than this protein.
PDB ID
4EkvC
Method AlphaFoldv2
Resolution 52.44
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50