| | | | | |

Data flow

The data architecture of the DNA Bank Network is based on the GBIF infrastructure. Since many institutions joined GBIF applying different database structures each, the installation of wrappers (BioCASe, DiGIR) has become a standard to combine different sources and integrate data easily into networks. The basic principle of GBIF as well as of the DNA Bank Network is to record all data sets only once. Stored at only one place they can be used as a linked reference for different applications.

The DNA Module is one of key components of the networks database system. To find related specimen data of a DNA sample the module sends a query to the respective specimen database via the BioCASe Provider Software (wrapper). A copy including few specimen attributes is as well stored in the DNA cache (speed up queries). By following the BioCASe and DiGIR protocol it is so possible to connect any GBIF compliant specimen database worldwide.

The DNA Module is currently used by three of the four project partners associated with their own specimen databases. The DSMZ in Braunschweig applies its own system for DNA data input.

Another BioCASe wrapper using the new DNA extension for ABCD has been installed on all three DNA Modules and the database in Braunschweig separately to offer all DNA samples and its related specimen data on the central webportal (www.dnabank-network.org). Specimen data are recalled by the same wrappers which are used by GBIF (see above).

 

Fig.1. Dataflow within the DNA Bank Network. Specimen data (green lines) are transferred by BioCASe wrappers (grey boxes) to the DNA Module and the web portal. Wrappers for specimen data are the same which are used by GBIF. DNA data (black lines) are transferred to the web index and web portal via a second BioCASe wrapper installation on the DNA Modules and the database in Braunschweig.