Mining metabolites: extracting the yeast metabolome from the literature

Chikashi Nobata, Paul D. Dobson, Syed A. Iqbal, Pedro Mendes, Jun’ichi Tsujii, Douglas B. Kell and Sophia Ananiadou

Metabolomics. 2010

Go to article

Abstract
Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies are able to supply the macromolecular parts list, the metabolites are less easily assembled. Most metabolites are known and reported through the scientific literature, rather than through large-scale experimental surveys. Thus it is important to recover them from the literature.

Here we present a novel tool to automatically identify metabolite names in the literature, and associate structures where possible, to define the reported yeast metabolome. With ten-fold cross validation on a manually annotated corpus, our recognition tool generates an f-score of 78.49 (precision of 83.02) and demonstrates greater suitability in identifying metabolite names than other existing recognition tools for general chemical molecules.

The metabolite recognition tool has been applied to the literature covering an important model organism, the yeast Saccharomyces cerevisiae, to define its reported metabolome. By coupling to ChemSpider, a major chemical database, we have identified structures for much of the reported metabolome and, where structure identification fails, been able to suggest extensions to ChemSpider. Our manually annotated gold-standard data on 296 abstracts are available as supplementary materials. Metabolite names and, where appropriate, structures are also available as supplementary materials.

DIY Light Box

In Sheffield we're interested in algae for producing biofuels. We sometimes grow them in conical flasks stood on light boxes, like the ones used in hospitals to look at x-rays. I was astonished at how expensive (and dim) they are. As ever, Instructables to the rescue! I flippin' love Instructables.


IKEA do a good line in plain, semi-transparent plastic tubs that diffuse light nicely. I balked at the price of their bright LED arrays, although perhaps the light would be less spiky than the fluorescent tubes I went for. We'll see. Here's what I used...
The Samla box doesn't go with the lid (but sort of fits) - I picked it because it is more shallow than the Snålis box. I lined it with foil, fitted a plug to the light and put it in. Because the lid doesn't quite fit you needn't even drill a hole in your nice box, although if things start getting a bit hot this might be a good idea along the sides.

The light is more uniform than it looks in the photo above, but more lights can be daisy-chained onto the same power supply if you need it to be brighter and widely spread. For £15 I'm quite pleased with it.

Now to save the planet