Protein

Protein accession
4Xz3j [EnVhog]
Representative
4EoQg
Source
EnVhog (cluster: phalp2_31864)
Protein name
4Xz3j
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MDATAAKADALGKKNPTITVKADTSGASSDLDKAAAKADALGKKTETVTVKADTSDVSQKISDLTSGPFSLLKTAIVGLGPAAVPVLGTVTAAVLPLSAAVGSAAAGLAAFGTVSKLVLTQAGTAATAASTAQATYTAAIAKAAGQYTIASAAAKTHTQQVAALAAEHKADATAAVAQAAALSTAYSGMSSQQIALSKQVGDLDKAYKGMLTSVTPLVSSALVPWMKDVQNLLQYIKPLISPVAALFKSWGDELNRSLTGNTANITKFMDAFAGRSATNIRLFGDALVEFAKGAGSLIHDVAPNLAGAADGVLKLAQSFAAWAGSQKAADDIKAFFSWAQTNAPTVRKFLDDLNTTIGHLVIALASAGPSDLKVLTAGLSALAALPSGAIVAIADGYVALSLGLRAAAVATTLYEIASKDAAGYTLLMRIQFGLLQVQEKAVALWTGLVTAATWLADAAETAYVAVMLLADAASLPLIVVIGGIVLAVAALALGIYELATHWNTVWHDILAVSQDVWGWIKNNWPLLTAILLGPIAVAVLEIVQHWDTIKDAFSAVSAFIAADWKAFTSSLLAPVAAIVHGISVAWAAIEAPFEAVIDWIKTHWQDTLVLLTGLLGVAILVIIHFWPQITTAVETVFNGIASYFTTWWATATSRFTAAANILKGVLTAAWSAISTALKAAWAAIAAYFTSWWNAEVAYFKGPITAIENALKAAWSAITSVAKTAWGAISSYFSSWWNSELSTFKNDAAAIGNALKAAWDAIESTAKSVFGGILTYFGGFWSALKTGFATAVSGVATAWNKLETIVKTPVQWVVTNVYDKLIMPFWNATAGAIGLTKLPKLAGGGMVPGGYSPVDNQLVWMRSGEGVLQPGAVMALGGPAFIDAANRTYGDVPVGSSGAGHYASGGTAPGNVGQGLHTGTTAEPSGSAPAGLGSIGGILSGLVHDAAGVISEVETGVFDALSSVGTRLIDGAVKLIPGSGGVATAMRDYPPKIWSAFLSWVTAHAGNGGDIVKYAESFLGKIPYVWGGTSLSSAGADCSGFTGSVLSHFGYDPPRTSEAQGSWVKRAGPQAGGLAFYHSPAGGADPGHVAIVASGSQVVSQGGGMGPKMMALHAMPLLWTGIPPGGFKSGSGGGSTAMAGGGSASANQALGQRMAAAMGWTGAEWAALNAVEMREAGWSTTARNASSGAYGIAQFISGPSEYAQYGGSSTTAGGQIAGFLNYVKERYGDPIAAEAHEAQYGWYDQGGWLQPGMTMVMNATGHPEPVLNPDQWDRLTSAVGSGDGIGDKLDRIAGLLMAGPRATAGGVSEVLNGTAHTASFRARYPRNN
Physico‐chemical
properties
protein length:1327 AA
molecular weight:135870,0 Da
isoelectric point:6,33
hydropathy:0,31
Representative Protein Details
Accession
4EoQg
Protein name
4EoQg
Sequence length
1191 AA
Molecular weight
124971,71380 Da
Isoelectric point
7,77916
Sequence
VAFDAGSIEAHLTVNRDDFDRTLTQAKADADRWARDPVTVKIDADDDPAKAVFDAVDKRKATTGKDVAFKIGADGGQATTEADRIQARKDKLAADVNFKVRADTAEATGDLVKLQAEKDKAGTDTSFKVKADVDTSGLDDIRKKVTELSAGPFGLLKTAIVGLGPAAVPVAGAVIAAFAPLAPAIGSAAVGLEAFGQIAKLALTPAATAATAVYKAQNTYNTAVASGTKQATAYATEQKAIATAYAGMSAQQITLSKQVGVLEAGWRSTLKAVTPLVSSALTPWLKDVQGLLGYIKPLIAPIATDFQAWGVQLGRALDGNQAKIKSFIDLFAARSAGNIASFGDALISFAKGAGALIHDIAPELGGAANGIAGLAASFDGWASSQKAADDIKGFFTWARAETPLVRQFLDALTGSVGNLLKALAFSGGGDLRVLTTALQAISALPPGAIVAIADAYLAISVGLRAATLAMGAWNIASTIAKGIQAALSDEIKVTGAALVIQKIATLAIAAADAVVDAAETIYIALMLVADAVSLPLIAVIGAIVLAAAALAFGIYELVTHWTTVWHAILAVTQVVWNWIKGNWPLLVGILLGPVATAAVLIYQHWNTIKNDAIDVWDAIKSFFTGWWSGEVASWKSTLNTVTGIFRSAWNTVSSDARSVWNAIKSFFTGWWSAEVSGWKTIISQFTGFFRAAWNTVKSDAQSIWNAIKSFFSGWWTTEVAGWRATIATVTGIFRNAWNAIYSDIKSSWGQIKSWFGSFWSSLESGFNTVVGRIKTIWKGFQTDISAPVNWVISNVYDKYIVRFWNDVAGAVGLPKLKGLAEGGIVPGGYSRSDNQLMWMRSGEGVLQPGAVDALGGPGFIHWANAKYGDLGAGQNAPGHFQFGGIFHDIGSLLSSPIREVLSIGKTLEHAVDDGVLAPFKAITRIMVNDLAKIPGPKGDGMVGIMQKMPLKMWDGFVSWVGNHLPFASSGSGGASGPGSAKGNAITDFAERYLGTPYVWGGTSPSGWDCSGFTEFVYDHFGWTPPRTSEEQFGWVKRIAAPVPGALAFFAGSPIDPPPGHVGIVTSPNTMIDAYGTGYGTIYNTINGSSGTVMGFGIPPSGFKFDDGGWLQPGASLVVNTTGRPEPVLTGEQWGAVIGGGPATERLDAIIAHLGALVNATAAGPSATAGGLGQVLNSTARNAAFRNRYRTR
Other Proteins in cluster: phalp2_31864
Total (incl. this protein): 2 Avg length: 1259,0 Avg pI: 7,05

Protein ID Length (AA) pI
4EoQg 1191 7,77916
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26986
4JjfT
15 27,1% 956 2.000E-48
2 phalp2_10370
1Ifz3
14 26,8% 967 2.000E-48
3 phalp2_7749
6XVyZ
2 23,8% 1351 3.481E-48
4 phalp2_31829
4v9VQ
2 21,8% 1402 1.290E-43
5 phalp2_33432
725yQ
26 26,9% 1074 6.219E-39
6 phalp2_28099
7ucAa
137 25,1% 982 2.975E-37
7 phalp2_5099
1ICO0
13 22,9% 1161 2.711E-36
8 phalp2_26448
1QZgj
1 22,1% 946 8.865E-31
9 phalp2_3859
6Ic0G
5 22,7% 1011 1.168E-30
10 phalp2_23567
7wB1x
1 22,4% 1034 2.673E-30

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EoQg) rather than this protein.
PDB ID
4EoQg
Method AlphaFoldv2
Resolution 49.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50