Data flow |
The data architecture of the DNA Bank Network is based on the
GBIF
infrastructure. Since many institutions joined
GBIF
applying different database structures each, the installation of
wrappers (BioCASe,
DiGIR)
has become a standard to combine different sources and integrate
data easily into networks. The basic principle of
GBIF
as well as of the DNA Bank Network is to record all data sets
only once. Stored at only one place they can be used as a linked
reference for different applications.
The
DNA Module is one of
key components of the networks database system. To find related
specimen data of a DNA sample the module sends a query to the
respective specimen database via the BioCASe
Provider Software (wrapper). A copy including few specimen
attributes is as well stored in the DNA cache (speed up
queries). By following the BioCASe
and DiGIR protocol it is so possible to connect any
GBIF
compliant specimen database worldwide.
The DNA Module is currently used by three of the four project
partners associated with their own specimen databases. The DSMZ
in Braunschweig applies its own system for DNA data input.
Another
BioCASe
wrapper using the new
DNA extension for ABCD
has been installed on all three DNA Modules and the database in
Braunschweig separately to offer all DNA samples and its related
specimen data on the central webportal
(www.dnabank-network.org). Specimen data are recalled by the
same wrappers which are used by
GBIF
(see above).
|
|
 |
|
Fig.1. Dataflow within the DNA Bank Network. Specimen
data (green lines) are transferred
by BioCASe wrappers (grey boxes) to
the DNA Module and the web portal. Wrappers for specimen data
are the same which are used by GBIF. DNA data (black lines)
are transferred to the web index and web portal via a second
BioCASe wrapper installation on the DNA Modules and the database
in Braunschweig. |