Informace o publikaci

SoluProt: prediction of soluble protein expression in Escherichia coli

Autoři

HON Jiří MARUSIAK Martin MARTINEK Tomas KUNKA Antonín ZENDULKA Jaroslav BEDNÁŘ David DAMBORSKÝ Jiří

Rok publikování 2021
Druh Článek v odborném periodiku
Časopis / Zdroj Bioinformatics
Fakulta / Pracoviště MU

Přírodovědecká fakulta

Citace
www https://academic.oup.com/bioinformatics/article/37/1/23/6070085
Doi http://dx.doi.org/10.1093/bioinformatics/btaa1102
Klíčová slova SOLUBILITY; WEBSERVER; TOPOLOGY; ACCURATE
Přiložené soubory
Popis Motivation: Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins. Results: A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info