Protein

Protein accession
7AuK7 [EnVhog]
Representative
7AFQR
Source
EnVhog (cluster: phalp2_3994)
Protein name
7AuK7
Lysin probability
62%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MRFSRSEWRWIPLADDVIYVPVAPSAKGFTATLVKEASGAARSASAAMDKEFARGGRESGRSAARGVDAGLSRGDIGVNSVKARMAKLAASMRSIGRRAGSEGAAGINAGLRDVDYHKVGDEHGRRYGHGFVSGMRSGLKGMIATVALVNGSITAMVRHVGTAATVIGWASRMMRHFAGSVLVGATAMQALGAAGLGKLAGWLKVVASLASRLARDVARAAAAVLVFSAAVRTLGRINRISRIIGALTVGLAALIGIASTAAPALAALGAAIATVGSAAGGIGIAGVSALGAALGSLKAGLSGVGDAFKEMGTSAGGGAAKAVDNTKNIERAQRNLAKSVEAEKDAQKDVARARDDARKKLRDLDLQLRGAALSEKDAQLSLREARRDLAKGGFESGIDRERAVLAVQDAELRLAEVQRDNRDLSKEAADTRAKGVNKADEVVEAQKRLRDATQETKDAQEELNEARNPKDSGTSGGVDKQAEALAKLSANARNFVETAMGIKPAWDAVQRGGQDTLFAGLPERLQGLADTWLPRIGGAMNTVNAGFNRGAKGVADWLTSAQGIPIMSSWLRTSSTMAAQAGTALGGIVPGLAAIAAGAGEAFAPMVAGATDGAKALSDTLIQAQKSGRIKTFFVDAFNQVKQIIQNVTAVMGPLISAFMQLGRTSAAGLAPGLKSMGDAIRQATPGLVQMAERLMPALGQALTNLSPIIPGIVHAFSPWATILSVMAPHIATVMSHLGPMAPILLTLAVTVKAITMAMTLYNAVMAVASVAQGVFYAATGRSTAGLRGNMIALAAHRVATIAGTVATTLFGAALAVATSPITWIVVAIGALVAALVLFFTKTEVGRKIWAAAWGGIKAAVGAVWSWLQNTVWPAMMTAFRAVGAVAMWLWRTIITPVWNGIKAAIGVAWTIIKAYFTAWSMIFRNVIGPVVMWLWRSIITPAFNAIKAVIGVVWNVVKFIFNGWMLLFRVVGAVVMWFWRTITMPAFRAIGAVISWAWNTLIKPAWDAFKRGLDVIGNAVKWLWNSIIKPTWDALGKGIAWVVDNIIMPAWDAFKRGIETIKTVFETVVDGIGKAWDKLKSFVAKPINFVIGTVWNKGLLPAWNTIAKFLPGLNPMQPLAEVKFADGGPVPMGRGAQRGKDSVHALMMPDEHVWNVRDVRRAGGHSAMYRMRSMVESGRPFTWTPGGVGSASEGGPLPRFEKGGAVSGGDKLAPLSGEGGLKPIAVMMRRLIFRMWKQITDIGGYRQDAYPEHPSGRALDIMVPNAKTGDEVNGWVHANSKKFPIEHTIWQQRWRPQGNPKGEPMEDRGSPTQNHMDHVHSWWKEQNVDPNKVPGGLVGYDGLSDADKLNIIKKKISEILDKAINPIKQGMASIIGSAPPEWLNIPGKALDITKTKAIDTAFSLAGKLGDKLADAYDKAKDITKVVTNVVTAPIRSIGGLFRDQGGYLPKGLSLVRNETGKPEAVLNWEQLDNVKAMMEAFRAVFSGQSPEEIGTRQQKLSDEMTARHENEVKGLSGQQLANVQKRHDAERKALEDSAARAEGYRAGAETIRDTPKTAVESMAKDTADFFGFGRLFEAVSGLIQPQDATTSSTAGAAGTSGATDPVYGDGSGVESGEEPSTTKMPDFEPEGADRYPWAIADQAKKMNLPKRAAIIGVATGLVESGDPMKMWANNAVPESLKFPHDAVGSDHDSIGLFQQRSAGWGSVADRMDPHRSAASFYNALLKVPGWETMDMGAAAQAVQRSAFPGKYAQKMGRATELVDKFGIYDQGGWLKPGGLALNLSKRPEPILNGAQWASIDAMLDALPSASEFKSVADLGASVMRSTGRAPSDPDDGQSTAGRAGGPLVYVENQYTHDPDEAARKTGREVRRATRSEQLVGGW
Physico‐chemical
properties
protein length:1883 AA
molecular weight:199934,3 Da
isoelectric point:9,76
hydropathy:-0,05
Representative Protein Details
Accession
7AFQR
Protein name
7AFQR
Sequence length
1863 AA
Molecular weight
197527,62450 Da
Isoelectric point
9,62392
Sequence
MAEDTVYIPLAPSAKGFMATVVKEASGAARAGSAAMEKEFARGGRESGRSAARGVDDGLSRGNIGVNSVKARMAKLAAQMRGIGRTAGHQGALGINDGLNHIDYTRVGDEHGRSYGRGFVRGVRNGLVGIAATFGLVNAGVRGTVRHIGTIATATMWASRIMRGFATQVMAGAVAMQLLAGQGLAKLAGWLKTVAFLAGRLARDVARATAAVLVLSAAVRTLGRVMRVTRVIGMLTVGLAALIGLASTAAPALAALSAAIVTLGSAAGGIAIAGLSALGATIAGLKVGLMGVGDAFKQMGTSGAGSAAKVVDNTKDIARAERGLTKAVEAEKDAQEDVSKARDDARKKLRDLDLQLRGAALSERDAQLSLREARADLAKGGFETGTERERAVLAVQEAELRLAEVQRDNNDLAKDAASTRRKGVEGSDEVVAAQERLRDATEATRDAQEALADARQPKDTGASAAADKQAEAMAKLSTNARSFVESAMGVKPAWDAIQRGGQDTLFAGLAQRLPQLADTWLPRLGAAINTVNGGFNTGARSVVDWMNSAQGIPIVSSWLRTSSGMAAQAGTALGALAPGLASIAAGAGEAFAPMVAGATEGAKSLSNMLVQAQQSGRIKQYFTDAFNQVKTVIQNVTAVVGPLWAAFMRLGQISASGLAPGMRSVGAAITQATPGLVQMAERLMPALGQALTNLAPIIPGIVQAFSPWATILAVMAPHIATVMSHLGPMAPLLLTLAVTVKAITMAMTLYNAVMAVASVAQGVFFAATGRSTAGLQGNMIALAAHRVAMLAGAVASGIFAGALALATSPITWIIVAIGALVAGLVWFFTKTELGQKIWTTVWNSIKSAVQSVWEFLKPVFQWIGNAFGTVVGFIRDHWRLILPIIMGPLGLLISVVSKYWTQIKTAFSVAFQAIGAVVMWLWRNVVTPAFNGIKMVIGVAWNVIKFFFGLWVGLFRNVIGPVVMWLWNTVIGPAMRGIGGVIGWVWNTLIKPAWDSFRRSLDILGEAFKFLWNNVIKPTWDALGAGIRWVVDNIITPAWDALKSGLSAVGGFFDTIVTGIGNAWDKIKSFVAKPINFVLGTVWNKGLLPAWNTIAGFLPGLNPMKPVAEVAFKDGGPVPMGSGAKRGKDSVHALMMPDEHVWDVRDVRRAGGHGAMYRMRNMVDSGRPFTWTPGGLSPVSEGGPLPRFEKGGAVAAGQKLSPMPGEGGLQAIGQLMRRIIFKLWPKIKDIGGYRQDNFDEHPSGRALDVMVGSDKKLGDQVNAFAHANNPKFPLQHSIWQQAMWYPPKMRREPMGDRGSPTQNHMDHPHLWWKPQNVNPNVVPEGLVTDGFGGPSTAEMLNIVKKKISEIIDKALNPIKQGLTSIVGSPPPEWLGIPPKIFDITKTKAIETAFNLAAKLGDKLKGAYDAAKKVTSIVTNVVKQPFKAIGGLFRDQGGYLPKGLSLVRNETGKPEAVLNWDQLTTVKDMMEAFRAVFSGQSPEAASAAQQRISDEMTARHEQEIKGLKGRQLDEAQKRHDMERKALEDSTARIEGYRAGATAIRDTPLVAAESMAKDTADFFGFGKIFDTIAGLIPRPGDAASAGTAGGAGTSALSTTTTPSATDPVYGDGTTIEQGQTPSTTVMPDLNHEYDPKGGAEQWRPMAKEAMKRVGFDYNNTAQVDAMIKQIESESGGNPGIVQGVQDVNSGGNEAVGLLQIIPGTFATHRDPSLPDDRRNPMANMVASLRYYKSRYGMDLTTTWGHGHGYDSGGWLNPGLTMAVNKTLKPEAVLTAGQWASIDSMLESLPSAAEFKSVADLGAAAMRSSGRMPNEDEDAQSSSGHRDAPLVWVENQYTHDPDEAALKTGREVRRATRSEQLVGGWG
Other Proteins in cluster: phalp2_3994
Total (incl. this protein): 49 Avg length: 1786,6 Avg pI: 8,58

Protein ID Length (AA) pI
7AFQR 1863 9,62392
1Yf8R 1914 6,00728
1dMtS 1816 9,23988
4YHgR 1815 8,29968
6FC4Z 1687 9,35792
6Qpjo 1390 9,26786
72ukV 1691 9,43226
7Aebk 1877 9,65545
7Am7j 1875 9,58840
7dhSx 1830 9,73597
7gRaR 1915 6,15665
7jGYT 1704 9,84853
7jHbo 1847 9,69580
7mBce 1937 6,59323
7pb0M 1705 5,40109
7qGY7 1739 9,40357
7tGoh 1680 9,50111
7tGpR 1607 9,50588
7vaZ0 1717 5,68313
7vnjh 1749 5,59559
7w0kW 1919 6,11891
7w0kn 1911 6,23679
7xjmf 1793 5,94765
7yeUa 1956 5,81545
7zSjR 1875 9,59091
8Itva 1704 9,69432
8Itvf 1847 9,68961
8Itvi 1847 9,66589
8Itvp 1830 9,60490
8MGSJ 1840 9,75918
8MHYs 1872 9,68562
8MI2F 1873 9,71108
8MIY4 1839 9,69264
8MJnK 1966 6,26777
8MQ28 1829 9,74809
8MmBC 1840 9,68375
8MpF3 1847 9,68420
A0A7T0M0T0 1863 9,62392
A0A2P1N2Q9 1830 9,60490
A0A160DCU0 1829 9,74809
A0A1B3AZ86 1863 9,62934
A0A345L307 1824 9,75434
A0A4Y6EFQ2 1829 9,61599
A0A514TZV4 1830 9,62366
A0A649V4K6 1828 9,60503
A0A8T8IZ31 1158 5,02459
G9FHZ7 1180 4,92450
A0AAE8XA19 1678 9,38758
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39181
3bZcI
8 35,2% 1237 6.551E-233
2 phalp2_18324
5inLv
5 27,2% 1706 5.412E-194
3 phalp2_3995
7AGUw
47 32,8% 1284 1.589E-186
4 phalp2_33467
7m6oK
34 25,4% 1713 6.404E-106
5 phalp2_12266
72uge
6 23,5% 1906 1.656E-103
6 phalp2_24330
3Pu9x
29 26,3% 1216 1.943E-92
7 phalp2_21847
4rAYx
3 25,1% 1298 4.898E-90
8 phalp2_36905
7yda0
31 23,9% 1343 5.318E-82
9 phalp2_36243
7zfxz
7 24,8% 1747 4.395E-80
10 phalp2_9474
HzvZ
51 24,2% 1581 7.282E-71

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7AFQR) rather than this protein.
PDB ID
7AFQR
Method AlphaFoldv2
Resolution 50.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50