EDIT is an EU funded Network of Excellence program with the goal of reducing the fragmentation of biological taxonomic research and coordinating an effort to facilitate taxonomic research using the World Wide Web. Within EDIT there has been much discussion about this can be achieved and I want to explain what my NHM colleagues and I have been doing as part of this project.
Integrating diverse sources of digital information is a major challenge for biodiversity informatics. Not only are we faced with numerous disparate data providers, each with their own specific user communities, but also the information we are interested in is heterogeneous and often specific to each community. Coupled with this we know that our resources are limited so decisions on how to achieve this must scale to the needs of many research communities. Also the technical abilities of these communities is limited so we have to do this in a way that respects this fact. Coming up with a single "standard" specification or approach for biodiversity data integration that solves all these problems might at best be described as difficult, and based on the products of past efforts is arguably futile. What unites us is a common goal to share our data to as wide an audience as possible, and it is generally agreed that in one form or another, we should do this on the Web. So how do we achieve this?
Over the past few months my NHM colleagues and I have been tackling this in three ways:
In the context of EDIT we have created a template CMS ("Scratchpads") that we have crudely adapted to the needs of biological taxonomists. Using the Drupal CMS we have inserted modules handling various data types (e.g. bibliographic literature, images etc), and are offering them as templates for communities of taxonomists to build content. Users obtain sites through an electronic registration procedure. To date we have 8 such sites, one of which (http://www.milichiidae.info/) is being used by an EDIT exemplar group. These are the taxonomic groups fortunate enough to receive core EDIT funding. Functionally these sites have very significant limitations. However ever they do allow communities to gain an initial web presence and proven popular with those that use them, though decidedly less popular with two expemplar groups that don’t.
In the context of EDIT we have mounted one such system at the NHM in recent months. Phasmid Species File is an extensive database of biosystematic data (taxon names, classification, images literature, ecological and geographic data, keys etc) on stick insects and their relatives. It is based on the Orthoptera Species File model and is the first of several systems developed by David Eades and colleagues that will be mounted at the NHM. The next will be on cockroaches and will be mounted at the NHM in two months time. Phasmid Species File (http://beach.nhm.ac.uk/) is currently only visible to researchers inside the NHM domain but will be accessible to others shortly. In the coming year the external collaborators (lead by Paul Brock will be adding a further 20,000 references and 3GB of images to this database.