Key IT-results of the BioEnergieDat Project
Transcription
Key IT-results of the BioEnergieDat Project
EnviroInfo 2013: Environmental Informatics and Renewable Energies Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 Key IT-results of the BioEnergieDat Project Clemens Düpmeier1, Oliver Kusche1, Liselotte Schebek2, Andreas Ciroth3 Abstract The BioEnergieDat project had been carried out by a consortium of German research organizations for the German Ministry of Environment as one important part of the German Sustainable Biomass Strategy. The main goals of the project were the development of a comprehensive database of LCI data sets describing important Bioenergy process chains in a flexible and customizable way and the development of an associated open source software infrastructure for LCA modelling of Bioenergy technologies for German framework conditions. One of the main requirements of the project was that the provided data sets can be used in different LCA modelling situations and shouldn’t restrict users to only one usage context, i.e. performing only a certain type of LCA study. Therefore, new methodological approaches and enhancements to the underlying data set model / format had also to be developed and implemented to fulfil these requirements. Both, the tool environment developed in BioEnergieDat for LCA modelling of bioenergy technologies and key IT-aspects to support the new modelling approaches are described in further detail. 1. Introduction The increased use of Bioenergy technology is an important part of the German Climate Policy. Therefore, within the German Sustainable Biomass Strategy, the assessment of existing and novel Bioenergy technologies based on a life cycle assessment approach has been acknowledged as important contribution to verify that the new technologies meet underlying policy goals. To carry out such assessment studies the “need of an adequate database” of biomass related LCI data sets had been identified as one focus of the funding program of the Ministry of Environment (BMU) in the last years. In this context the project BioEnergieDat had been carried out by a consortium of German research organizations from 2010 – 2012 (Schebek 2010). The main goals of the project were the development of a comprehensive database of LCI data sets describing important bioenergy process chains in a flexible and customizable way and the development of an associated open source software infrastructure for LCA modelling for German framework conditions. One of the main requirements of the project was that the provided data sets can be used in different LCA modelling situations and shouldn’t restrict users to only one usage context, i.e. performing only a certain type of LCA study. Therefore, the technical and methodological concepts behind the BioEnergieDat database are based on the idea of a methodological core that specifies fundamental modelling information and additional information chunks (deltas) which can be switched on (or off) if needed for certain modelling situations. The methodological core ensures that necessary basic information for all applications of the data set for a broad variety of life cycle approaches is provided by all data sets. The ‘deltas’ define additional information needed only for specific usage perspectives (Ciroth 2011b). 1 Karlsruhe Institute of Technology (KIT), Institute for Applied Computer Science (IAI), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany, {clemens.duepmeier | oliver.kusche}@kit.edu 2 Technische Universität Darmstadt, Institut IWAR, Fachbereich Industrielle Stoffkreisläufe, Petersonstraße 13, 64287 Darmstadt, Germany, l.schebek@iwar.tu-darmstadt.de 3 GreenDelta, Müllerstrasse 135, 13349 Berlin, Germany, ciroth@greendelta.com Flexibility in data modelling was also needed to ensure that the described process chains don’t limit modelling to only one specific technical implementation of the underlying process. Therefore, wherever possible the unit processes and process chains were implemented as more generic processes that are fully parameterized and can be easily adapted to describe more specific technical implementations of the underlying process by tuning the parameterization of the data sets in the process chain. To support these new modelling concepts and a seamless workflow for users to edit, manage und use the BioEnergieDat data sets for LCA studies in a collaborative manner the BioEnergieDat project also developed an Open source software infrastructure consisting of an enhanced version of an Open Source LCA modelling tool and a service oriented LCA database software for accessing data sets via the Internet right from within the LCA tool. The BioEnergieDat web portal complements these tools with supplementary online accessible documentation and direct data access via the web browser. BioEnergieDat data sets and tools are provided to the general public at the BioEnergieDat portal www.bioenergiedat.de since April 2013. Time to summarize key IT-concepts and lessons learnt so far. 2. The BioEnergieDat IT-infrastructure The basic IT-infrastructure (see Figure 1) built for the BioEnergieDat project consists of a central web portal (http://www.bioenergiedat.de) as main access point to the results of the project, a service oriented LCA database application for the management of the BioEnergieDat data sets based on soda4LCA and an optimized version of the Open Source LCA modelling tool openLCA, which understands the complete data semantics and methodology behind the BioEnergieDat data sets. Figure 1 IT-infrastructure of the BioEnergieDat web portal soda4LCA (service oriented database application for LCA) is a Java-based web application developed by the Institute of Applied Computer Sciene (IAI) of the Karlsruhe Institute of Technology (KIT) that exposes most of its functionality through a RESTful service API which can be used by LCA tools and other software to access and exchange LCA related data sets with an soda4LCA instance via the Internet (soda4LCA 2011, Düpmeier 2011). Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 Its advanced data management capabilities, like version management, management of different physical and logical data stocks and data set and data stock release management, makes it an ideal fit for BioEnergieDat production and development database management [Düpmeier 2012]. soda4LCA allows it also to exchange data sets and synchronize complete data stocks between different soda4LCA instances, a functionality which is very important for the upcoming ILCD data network of the EU (Wolf 2010, Kusche 2012). Because the BioEnergieDat project and also the ILCD network partners of the EU will use soda4LCA for data management the BioEnergieDat database is a good candidate for a future node of the ILCD data network which could specialize on energy related data sets in the German and / or European context. openLCA (see Figure 2) developed by GreenDelta GmbH is a freely available Open Source LCA modelling tool for professional Life Cycle Assessment and Footprint modelling which supports all common modelling options, like parameterization and complex mathematical models for unit processes, describing complex process and product models graphically as connected process chains, supporting different kind of allocations and system expansions and even uncertainty calculations. Calculation capabilities include LCI and LCIA result calculations for different common LCIA methods and visualization options of the results, like Sankey diagrams to visualize hotspots. Because openLCA is based on the Eclipse rich client application platform it can be easily extended by writing new Eclipse plugins for the platform (openLCA 2011). Figure 2 openLCA LCA modelling tool showing the process chain of a biogas block power station All parts of the BioEnergieDat IT-infrastructure are highly integrated. Data sets and even complete product systems modelled within one instance of the openLCA tool can directly be stored within the central Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 database system over the Internet. Another user can just open its local openLCA software instance, search for and import every data set or even a complete product system directly from within the tool with a few clicks. A modified and enhanced data exchange format based on the International Life Cycle Data (ILCD) format used by openLCA and the database system ensures that every BioEnergieDat modelling aspect is transparently saved in the database and won’t get lost in between exchange actions. Users can directly browse, review and download data sets with their favourite web browser via a data navigation portlet integrated into the central web portal without the need to open an external LCA tool. Data sets within the database can also refer to external documentation files which are managed in the web portal in an integrated document management system. Hypertext linking between data sets and their document-based documentation then allows direct navigation from data sets to associated external documentation, and web pages within the portal can group related data sets and documentation together in a single web page. Figure 3 Data navigation portlet within the BioEnergieDat portal showing metadata of a process data set Both, the openLCA tool and the LCA data set browsing and navigation portlet in the web portal (see Figure 3) use the REST API of soda4LCA for accessing the database (Düpmeier 2011, Ciroth 2011a). The REST API uses the ILCD format to transfer complete LCA data sets between communication partners. The choice of data format is very important here for several reasons discussed in the next chapter. Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 3. The importance of enhancements to the data exchange format The flexibility of BioEnergieDat data sets depends largely on the fact that modern LCA tools support the usage of variables, formulas and the parameterization of unit process data sets to build more complex and more flexible mathematical models of data sets. When such parameterized data sets are combined into a bigger process model (product system) some of the parameters on unit data set level can be overridden in context of the product system to fine tune certain aspects of the unit processes within the modelled product system. Other Boolean parameters can just be used to switch certain functionalities, like the use of substitution, the use of a special form of allocation or a scenario calculating the impact of the data set for 2020 on or off in the product system. Figure 4 shows as an example the mathematical model and parameterization of a biogas block power station modelled in BioEnergieDat which has a parameter “gutschriftth” to switch the crediting for co-production of heat in a concrete realization of the power station on (value of parameter set to 1) or off (value of parameter set to 0 = default). Figure 4 Mathematical model and parameterization (as shown in openLCA) of a process data set describing a biogas block power station Because more traditional LCA data exchange formats, like EcoSpold1, don’t support parameterized data sets or even the notion of formulas, much of the flexibility of the BioEnergieDat data sets will get lost, if they are either exported or imported using one of these older formats. The complex mathematical model of the data set will then fall back to just numbers which are pre calculated within the data set when the variables and parameters in the formulas are all substituted by their default values. Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 The XML-based International Life Cycle Data (ILCD) data exchange format (Europe 2011) for LCA data was the first data exchange format developed for LCA which fully supports complex mathematical unit process data models and parameterization of data sets. Because of this, the service oriented database system soda4LCA and also the openLCA tool use the ILCD format to exchange data sets. Other LCA tools, like GaBi, support the ILCD format too. But neither older LCA formats nor ILCD allow it in their current forms to exchange complete product systems between tools. In fact it’s possible to calculate the LCI or LCIA results for a complete product system and exchange these as aggregated process data sets or result data sets in EcoSpold1 or ILCD. But current formats are lacking the possibility to attach machine readable information about the underlying process chain to an aggregated process data set so that the process chain can be reconstructed if needed in the importing tool. This was unsatisfactorily for the BioEnergieDat project in several aspects. First, if a parameterized product model is only transferred to another tool as aggregated result data set, it can only be used and viewed as a black box. This means that the user of this dataset cannot take advantage of the fact, that this product model is parameterized. If the whole process chain would be available to him he could adapt the parameterization to his special modelling situation and just recalculate the result data set so that it better fits to his application scenario. Another disadvantage is that aggregated processes as black box often don’t specify how they are modelled internally. If the underlying process chain is unavailable such data sets are lacking transparency, especially for reviewers of the data set. But transparency and flexibility of data sets for users were very important requirements for the BioEnergieDat project. <pm:productModel name="BHKW"> <!-- nodes (processes) --> <pm:nodes> <!-- each node has a unique id --> <pm:process id="p1" uuid="0001" name="BHKW"> <!-- parameterization --> <pm:parameter name="efficiency" formula="overallEfficiency * 0.6"/> <pm:parameter name="otherParameter" formula="1000"/> </pm:process> <pm:process id="p2" uuid="0002" name="TractorUsage"/> <pm:process id="p3" uuid="0003" name="Lubricate, at farm"/> <pm:process id="p4" uuid="0004" name="GrassCropping"/> <pm:process id="p5" uuid="0005" name="CornCropping"/> <pm:process id="p6" uuid="0006" name="BiogasPlant"/> </pm:nodes> <!—connectors between nodes --> <pm:connections> <!-- each connector has a unique id --> <pm:connector id="c1" origin="p2"> <!— mapping between flows of nodes --> <pm:product uuid="F0000" name="TractorUsage"> <pm:consumedBy processId="p4" flowId="F0000"/> </pm:product> </pm:connector> <pm:connector id="c2" origin="p2"> <pm:product uuid="F0000" name="TractorUsage"> <pm:consumedBy processId="p5" flowId="F0000"/> </pm:product> </pm:connector> <!-- some more connector descriptions here for whole model --> </pm:connections> </pm:productModel> Figure 5 Extension to the ILCD format describing the process chain of a product model Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 Therefore, an extension to the ILCD format was developed and implemented in soda4LCA and openLCA in context of the BioEnergieDat project (see Figure 5) which supports the exchange of aggregated process data sets and complete product models including a formal description of the underlying process chain so that the process chain models including their parameterization could be easily reconstructed in the receiving instance of the openLCA tool. Because the ILCD format has foreseen generic extension points throughout the format where foreign namespace extensions could be integrated into data sets without breaking the format, this extension was easily implemented conform to the existing ILCD format version using a separate namespace. Such enhancements can then be reconsidered for integration in the ILCD format itself when a new version will be created sometime in future. The XML pseudo source code in Figure 5 shows an abbreviated and simplified version of such a process chain description as extension to an ILCD format process data set. As illustrated in the pseudo code a description of a process chain <pm:productModel> consists of a list of nodes <pm:process> which represent the LCI processes (unit processes or other aggregated processes) contained in the model and a list of connectors <pm:connector> describing the connections between these nodes. Because processes may be parameterized a node description can contain a parameter assignment <pm:parameter> which overrides the default value of the corresponding process parameter within the context of the higher-level product model. The value part of such a process parameter assignment could be a formula containing variables defined in the mathematical section of the product model itself (not shown in the above example), i.e. to illustrate parameter assignment the parameter “efficiency” has been assigned the value “overallEfficiency * 0.6” which is a formula containing the product model variable “overallEfficiency”. Connectors describe the flow of products between processes in a product model. The “origin” attribute of a <pm:connector> tag defines the producer, i.e. where a product originates. The product, that flows, is identified by the “uuid” attribute of the <pm:product> tag wrapped by the <pm:connector> element which contains the uuid of the corresponding ILCD flow data set of type “product flow”. Lastly, the <pm:consumedBy> tag within the <pm:product> tag defines the process data set which consumes the product (consumer). Note, that for multi-output processes, more than one <pm:product> tags could be wrapped in one <pm:connector> tag. 4. Summary and Outlook Within the BioEnergieDat project a comprehensive database of flexible and customizable LCI data sets describing important bioenergy process chains has been developed. Flexibility and adaptability of the data sets are largely due to using more complex mathematical models and parameterization in the modelling of the unit processes. Such unit processes can be chained together to more complex parameterized product systems which are also highly customizable. But users can only take advantage of this flexibility if the used tool chain fully supports parameterized unit processes with complex mathematical models and the transparent exchange of complete parameterized product models in a seamless manner. Therefore, a second main result of the BioEnergieDat project is an Open Source IT-infrastructure consisting of a service oriented Internet accessible LCA database for storing and providing data sets and an enhanced version of the openLCA LCA modelling software that fully supports the exchange of parameterized data sets and complete product systems between database and tool instances directly over the Internet without losing any modelling information. For this goal the used ILCD data exchange format had also to be enhanced and expanded. Luckily this was possible in a compatible manner through the use of extension points within the format. In the future it would be nice if the IT-results of the BioEnergieDat project could be feedback into the used LCA standards like the ILCD format and will also find their way into other tools. Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5 In order to ensure continuous provision, updating and support of the BioEnergieDat database, a host organization was implemented which takes over the operational responsibilities for the database and will be active in future revisions as well as extensions of the database and the IT-infrastructure. The organization will be complemented by an advisory committee, consisting of BioEnergieDat partners as well as additional scientific organizations in the field of LCA and Bioenergy. Data provision will be based on collaboration with internationally leading data providers as to development of data sets and services. Bibliography Schebek, L., Ciroth, A.; Düpmeier, C.; Eltrop, L.; Simon, S.; Viebahn, P.; Wagner, H.-J.; Zschunke, T. (2010): Life Cycle Inventories for bioenergy: A German database for policy support. EcoBalance 2010: Towards & Beyond 2020; Tokio, 2010 Wolf, M.A.; Chomkhamsri, K.; Düpmeier, C.; Kusche, O.; Pant, R.; Pennington, D. (2010): International reference life cycle data system (ILCD) data network. International Conference on EcoBalance, Tokyo, Japan, November 9-12, 2010 European Commission - Joint Research Centre (JRC) - Institute for Environment and Sustainability, Karlsruhe Institute of Technology (KIT) - Institute for Applied Computer Science (2011): International Reference Life Cycle Data System (ILCD) data format and editor. (Accessible via http://lct.jrc.ec.europa.eu) Ecoinvent Centre (2011): Ecoinvent LCI Database, Version 2.2, http://www.ecoinvent.org/database/ Düpmeier, C.; Kusche, O. (2011): Concept and implementation of a service API for the remote access to life cycle data assessment databases. Pillmann, W. [Hrsg.] Innovations in Sharing Environmental Observations and Information: EnviroInfo 2011 - 25th Internat.Conf.on Environmental Informatics, Ispra, I, October 5-7, 2011 Aachen: Shaker Verl. 2011 Part 2, p.729-736 ISBN 978-3-8440-0451-9 Ciroth, A.; Srocka, M.; Düpmeier, C.; Kusche, O.; Schebek, L. (2011a) - Interacting with the ILCD data network from an LCA software - the example of openLCA. Life Cycle Management Conf.2011 (LCM 2011), Berlin, August 28-31, 2011 Ciroth, A., Schebek, L. (2011b): Perspectives in Life Cycle Inventory Modeling, presentation, LCA XI Chicago, October 3-6 2011. Düpmeier, C., Kusche, O. (2012): Managing LCI Data from Different Workgroups within the same Instance of an LCA Database, EnviroInfo 2012, Dessau, 29. – 31. August 201226th Internat.Conf.on Informatics for Environmental Protection, Dessau, D, October 29-31, 2012 Dessau Kusche, O.; Düpmeier, C. (2012): Creating LCA Data Networks with soda4LCA. Pillmann, EnviroInfo 2012: 26th Internat.Conf.on Informatics for Environ-mental Protection, Dessau, D, October 29-31, 2012 Dessau OpenLCA (2011): OpenLCA project homepage, http://www.openlca.org soda4LCA Open Source Project (2011): http://www.iai.fzk.de/www-extern/soda4lca Copyright 2013 Shaker Verlag, Aachen, ISBN: 978-3-8440-1676-5