Protein

Protein accession
6DqfE [EnVhog]
Representative
4EsFG
Source
EnVhog (cluster: phalp2_28711)
Protein name
6DqfE
Lysin probability
84%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MTSSSIVYRLIAHDSASRTFNTVGRSASSTERTLAKLGQTAVKAGAAMAAGLAVGLGESAKKAAEFQSEMTRISTQAGGTAKDVKVLSDQVLKLGTSTQQGPQHLAESLYHLKSVGMDNVSAMKALKESADLAAVGHANLEETTNALAGAWRTGIKGATSFHEAVSTVNAIIGAGNMSMDAFNAAIGTGILPSAKTFGLSMKQVGAALALMTDEGIDSASAATRLRMSFSLLGAPSKAAEKQLGKIGLTGLNLADAMRGPKGLIGAIGLLKEHLDKSGLSASKQSQLLSRAFGGGRSSSGILLMLNNLDVLEKKQEQINRSTGKFDDAVKMQRKTAEAQWHLLTSNLEVMGIRVGTKVLPPVTSFIHFLATDAMPTAARFGRAMAAIVPVDQIKQGFSQAQGMVGDFLKGFSGTKKATSDLLGGLFDTSPHLGSSKASVASKGPALAPMPHFGVGQVAPTTGVQGPALAPMPHGGSGLVASLVSPKAKPPKSAAQKIGETIRRAISGGFKDINWGNLGSILGKGLGDAIGWVGKHTADFTKKIAKVFAGLDFVEIGKGFGASAIPLAIGFIKSVFDPLFSLDFWKKHWLDTILAVISIIPIGRVAGVLGKVFEHIPFLKVFEPLLKGVGKLGGWIEKAFGKAVKFFGTNLWRGLAKVFPEAASVVERESGLLTTRIGVWGIKLMDKGKAAIHFLGNGIRDGAGWVIAKVGEIVGLVVKPFVKAGGWLIRKGADVARGFGGGIARGGRSIGGFAKKWVIDPVVGAFSRAGSWLLSKGRALVSGFRGGVSAGAKAIGGWVFDRILVPVANWMARSGRWLIGKGSALVSGFKSGVSAGAKVIGSWTISHIVSPVLSRFTKAGGWLVSKGGSLISGFKSGIVGTMKGIGSWIKKTMVDPVVNAVKHFFGIRSPSRVFMGIGGHIVSGLMKGMAKTSGTAIAKKVFGSLPKALGSIVKKGLVSITKLPGKALKALGGLGGDVLGLLGLGGSGGGSSANQKIGEALAAARGWSGPQWAALKNLWNGESGWNERALNKSSGAYGIPQSLPASKMGSAGSDWKTNASTQIKWGLSYIKSVYGNPLNAYSQWLARSPHWYAQGTGGAARGLAWVGEKGPELVNFKGGEDVMSNPQSMAFAKANNIKLPGYASGTITNAADRVRRDHQRVQDAKDDVARAKRRHKGVQAAETRLRAAQKELQAANIALKNAQRSAKTSIANTIATGLLKTLSTGTSSAIASAIKSLATKLLNAGYNRTAASMQKKGGRLEKLADKRASVQKTIAAANQYASDQASTIKDFLSISGTSATDIGGLISQMSGQQKTASSFVGLTRSLKARGASKDLLQQLSDAGPGSQLATILGQRNVTTQDISKLNGLVASGGKLATSFGKDMADLMYDTGKHAGEGFLAGLKATEKDLQKQIDKLAKGLIAAIKKALKIKSPSVVMRDEIGKNVVLGWVAGMDMHSHLVGGAAQRLADTASGVSVRRRYVPTVASQGGAREDALWERLASALEQQASRDSHLTGELRLDSGELLGVIQGAVKPQIKASANMQAYRAKVGRRSGG
Physico‐chemical
properties
protein length:1554 AA
molecular weight:161572,8 Da
isoelectric point:10,32
hydropathy:0,00
Representative Protein Details
Accession
4EsFG
Protein name
4EsFG
Sequence length
1172 AA
Molecular weight
119054,11830 Da
Isoelectric point
10,06463
Sequence
MLGAVRVAVLPDLAGFTPAVRRGVAGANLGAEGKKAGKAYGSGLSGALKSLKGTLAVAGIAGLAIGIDKSVKAASEFQSQMEKIHTQAGGTQRDVTSLSKAILGMTNAQQGPQQLAQAMFHLKSVGLDNVNAMKALKAASDLAAVGGADLEDTTNAVAGAWRSGVKGAQTFGQAAATVNSIVGAGNVKMGDLVNAIGTGILPSARAFGVSFKSVGAALALMTDEGIPAQQAATRLRMSLSLLGAPSNAAEQQLKSIGLTGLKLANDLRSPGGIVAAIGDLKSHLEASGKSASAQAQILSHAFGGGRSSSAILTLINNYGVLQRKQDQVNAGISKFGEDVAAQKQTAEAQFHILGANVERLGISIGNVLLPAAAGFTKFLNTFLVPGLGHLASALTGSGKGSTILRDSLFGVAAAFAALKSIGLVTRLISSSAASFGRLALVFRSMKDAAAAGTGIKTLASGVGTLAARQGTAVAGAGALGAAIGGISLSAAGATLGLGAVAAGVGYYIYTQRNSLTLSQQINRQFDTERAKLGFNAAAYDKLAGSIGKQARAQEQVSQGGGRLDAVVQNNTARITTYATQQNKAVTAGKNLSDALNSIQIRMNTSRGDAINLAQAAGVSAKQLAAGGEAGRKAAAKVAAYGQANEAAVQPVKGLNTDLQVTSNKALSATDRINGLTSALGKLLDPLTSNSDNLVTFYNDLKTATGNLRASGGAMGYLTQKQRDSRGSFNAVLGDLESLITNTHQSSTQLDANRKMVEKEIPRLYDLAGSNKSARAEVAALAKAVRGQTGDIGAGHGNRADLTKQTGQAGQTASTAKGDITRLATAIRHIPSFEGFKLQMTGSGAYTIHGPGANFIGPATGRGGVGSRPRAGGGPVTGPGGPRDDRAGLFALSNKEWVIQARSASKYGSTAMSAVNAGTAQIVVPGLAAGGLVEAGSQAVLSGQYAVNMTGTFRKDLTHSMASAMREAIKKDKAAAAAAFGAGEQGDTGARTRSAAVAQAYAASQLGRFGWGPGQMGPLIMLWNQESGWSAYAVNQSSGAAGIAQSLGHGPVTLGDYVGQINWGLGYIKSTYLSPAGAWAHEQAFNWYGNGLRDGVFDRPTLIGVGERGRERVNIEPLSRPQSPATGGSAAGARQPQTIVGAQFTGPVYLQDKTQALLLGQQTSWAISKRNWG
Other Proteins in cluster: phalp2_28711
Total (incl. this protein): 57 Avg length: 1362,6 Avg pI: 10,06

Protein ID Length (AA) pI
4EsFG 1172 10,06463
15Nab 1004 11,20050
1Ncaa 1168 9,88747
1dYr0 1029 10,59430
1qXxZ 990 10,47600
2S3pY 1479 10,29517
2SnJP 988 10,66837
3yJJB 1259 6,40623
4E3Um 1263 9,99732
4E4g7 1144 9,68762
4EAko 1604 9,47519
4EKEw 1140 9,67259
4EKJ7 1084 10,34984
4EKRe 1141 9,65628
4EL03 1189 10,32875
4EL4E 1396 9,65706
4EaDa 1321 9,79998
4Elu8 1020 10,32244
4Etgf 1296 10,53312
4Fbsy 1504 9,74577
4FwX2 1223 9,13570
4HDAa 1474 10,43796
4KBXd 1485 10,18235
4LtDi 1553 10,23669
4XMz1 1336 10,17635
5nEd2 1224 9,81281
5z3Li 1210 9,26142
5zEB1 1486 10,14605
6CDb9 1560 10,20323
6DXZS 1336 10,39941
6DhV8 1532 10,05038
6DhXs 1547 10,26435
6Dift 1554 10,21896
6Dinr 1554 10,21877
6EJUx 1550 10,19653
6EZjr 1554 10,31406
6Ewu1 1651 9,90952
6F71q 1558 10,18815
6Fh0h 1362 10,12039
6H151 1562 10,26351
6HASP 994 10,30110
6ISjY 1340 10,09996
6ITLs 1495 10,26377
6J11L 1121 10,04122
6Kl7h 1554 10,28008
6KlZj 1551 10,33984
6RUZf 1589 10,30535
6RXV5 1582 10,30148
6SIIe 1599 10,31412
6SJWz 1147 9,92576
6SK3Q 1082 9,70270
6SLGB 1524 10,23605
76lrH 1485 10,15894
7p6IC 1475 10,00538
ghjk 1515 10,06559
kRn2 1557 10,23096
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15332
1p6ht
1 23,8% 1283 2.350E-84
2 phalp2_3559
4Edib
5 29,9% 751 1.859E-73
3 phalp2_283
6Eo3G
9 23,8% 1238 2.072E-69
4 phalp2_26956
4EAD1
1 22,7% 1248 2.670E-52
5 phalp2_11162
5EZIV
1 24,1% 1072 1.508E-33
6 phalp2_12237
6UhJU
1 22,7% 1137 2.859E-31
7 phalp2_4612
4jjYp
24 21,8% 1042 4.515E-30

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EsFG) rather than this protein.
PDB ID
4EsFG
Method AlphaFoldv2
Resolution 56.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50