An Introduction to ISO 15926
Transcription
An Introduction to ISO 15926
ELEMENT 8 WORKFORCE & TRAINING An Introduction to ISO 15926 November 2011 ™ An Introduction to ISO 15926 November 2011 ACKNOWLEDGMENTS Fiatech has produced this book at the request of and for the benefit of its members, and in cooperation with the POSC Caesar Association. We wish to acknowledge the contributions of those individuals whose efforts and input have significantly influenced this publication. Notably, this book was produced by a project team who voluntarily carried out the objective to provide an introductory document for the Capitol Projects Industry. This team consisted of: Manoj Dharwadkar, Director, Product Management, Bentley Systems; Glen Worrall, Solutions Architect, Bentley Systems; Onno Paap, Data Integration Manager, Fluor; Faith Junghans, President, Faithful Engineering; Chris Schwander, Consulting Engineer, DuPont; and Alan Johnston, President, MIMOSA. In addition, Ian Glendinning, Glenco, President, GlencoIS; Hans Teijgeler, Fluor (retired); and Matthew West, Information Junction; provided considerable technical expertise and content during the writing process. I wish to acknowledge the special efforts of Gordon Rachar, WorleyParsons who served as a consultant to the project and was the primary author of this publication. Additionally, Daril Bentley provided copyediting services and Dallas Peters, Dallas Peters Design, assisted in graphic design production. Among the staff, Fiatech Project Manager, Sharon Bickford, contributed significantly to the development of this publication and her efforts are appreciated. Publication of this report is made possible through the contributions by Fiatech members. Raymond E. Topping, PE Director, Fiatech PREFACE Fiatech (www.fiatech.org) is an industry-led consortium, housed at The University of Texas at Austin that provides global leadership in identifying and accelerating the development, demonstration and deployment of emerging technologies and innovative practices to deliver the highest business value throughout the life cycle of all types of capital projects. Fiatech is member-driven, comprised of approximately 85 companies and partner organizations that include owners and operators from the industrial, power, and retail markets; leading providers of engineering, design, and construction services; software and equipment suppliers and technology providers, research institutes and universities. Fiatech is a clearinghouse for innovative ideas where members can quickly learn about new processes, methods, and materials; but it also collectively funds and executes development, demonstration and deployment projects. Project teams are formed to identify and accelerate the adoption of technologies and systems; demonstration projects are conducted to validate and perfect new approaches or methods; and teams are formed to aid and facilitate the deployment of those breakthrough initiatives that have been identified. This publication was developed at the request of Fiatech members and in cooperation with the POSC Caesar Association, for the benefit of the entire capital projects industry. TABLE OF CONTENTS INTRODUCTION1 Introduction to ISO 15926 1 ISO 15926 Is Like a Babel Fish 3 About This Book 6 Why We Need ISO 15926 8 The History of ISO 15926 8 How Does ISO 15926 Work? 8 Current Events in the World of ISO 15926 9 Getting Started with ISO 15926 10 Appendices11 CHAPTER 1: WHY WE NEED ISO 15926 The Role of Context in Information Exchange The Value of the ISO 15926 Dictionary A Confederation of Applications Public Extensible Dictionary An Example Project How ISO 15926 Makes Life Easier in the Near Term How ISO 15926 Makes Life Easier in the Long Term CHAPTER 2: HISTORY OF ISO 15926 12 13 18 22 27 29 31 36 40 How We Use the Internet to Find Information 41 How We Know and Understand Things 42 Open Ways to Store and Exchange Data 44 Markup Languages 46 Evolution of Product Information Standards 48 Summary67 CHAPTER 3: HOW DOES ISO 15926 WORK? Integration Versus Interoperability The Parts of ISO 15926 Are Like the Parts of Human Speech Putting It All Together Levels of Compliance with ISO 15926 Versus Ambiguity Wouldn’t It Be Better if We Agreed on Common Definitions? Why Don’t We Use Industry Standard Definitions? Would It Help If I Told You How I Was Using the Data? If We Use the Semantic Web, Couldn’t We Automate More of This? Shouldn’t We Add Query, Workflow, and Security to the Semantic Web? CHAPTER 4: CURRENT EVENTS IN THE WORLD OF ISO 15926 Using ISO 15926 in Production Development of Enabling Infrastructure Continued Development of Standards 70 70 74 91 93 95 97 99 101 103 105 106 110 113 CHAPTER 5: GETTING STARTED WITH ISO 15926 Key Preparation Implementing ISO 15926 Join Fiatech or the POSC Caesar Association APPENDIX A: THE PARTS OF ISO 15926 ISO 15926 APPENDIX B: COMPLIANCE WITH ISO 15926 Semantic Precision Representation Technology Referencing Technology Interface Technology Industrial Standardization Subject Matter Scope Metadata Scope Functional Capability APPENDIX C: EXAMPLES OF DATABASE MAPPING 116 117 118 123 124 124 128 129 129 130 131 131 132 132 133 134 Example Set 1: Let’s Just Exchange Raw Data and Figure It Out Together 134 Example Set 2: Wouldn’t It Be Better to Agree on Common Definitions? 141 Example Set 3: Why Don’t We Use Industry Standard Definitions? 147 Example Set 4: Would It Help if I Told You How I Was Using the data? 151 EMCA156 Using the RDS/WIP 158 APPENDIX D: OTHER RESOURCES Information Modeling Resources Origin of ISO 15926 Organizations Responsible for ISO 15926 User Groups Other Organizations RDS Browsers Business Interface Definition Guides GLOSSARY OF CONCEPTS 160 160 161 161 162 163 164 164 167 Artificial Intelligence and the Semantic Web: Difference Between 167 First-order Logic, Semantics, and Reference Data 167 Ontology and Taxonomy: Difference Between 169 Integration and Interoperability: Difference Between 170 Strong Coupling, Loose Coupling, and Encapsulation 171 Abstraction172 Semantic Web Technology 173 INTRODUCTION Introduction to ISO 15926 If you are new to the concept of information interoperability, you will be forgiven for thinking that ISO 15926 is a new phenomenon. In fact, the search for easy ways to exchange and reuse engineering information stretches back to the mid 1970s—led by increasingly frustrated end users who resented the inability to transfer their information from one computer-aided design (CAD) system to another. This bit of history is especially poignant for your humble author, who at that time was just starting his career as a piping designer. Whereas large users—heavily represented by the U.S. military and aerospace organizations—were just starting to confront compatibility issues among competing CAD systems, your author had just enrolled in technical school to learn how to draft with a pencil. A few years later—while the U.S. Air Force was hosting a conference that led to the CAD drawing exchange standard known as IGES (which we introduce in Chapter 2)—the author’s idea of high technology was drafting on Mylar with (gasp!) plastic lead! In this introduction to ISO 15926, we will look at the need for information interoperability, some of the things that have been done to address interoperability issues, and some of the things we should be doing instead. We will take a brief historical look at the forebears of ISO 15926, look at the different parts that make up the standard today, and end with some of the things you can do to get started implementing the standard. The obvious question is: Why do we need ISO 15926? The short answer is: So that we can exchange and reuse complex plant and project information more easily and more cheaply. A slightly longer answer is: To mitigate the current high costs of rekeying and reformatting information to move it from one proprietary system to another. For example, take the task of designing, specifying, and purchasing a process instrument for a plant modification. Imagine how many times information has to be rekeyed after the instrument is basically designed, until it is installed and commissioned in the target plant. • After design, enter the information into the project’s engineering design system (which may be a database or spreadsheet). • For quotation, a procurement officer assembles several sets of data sheets and sends a set to each bidder. • Each bidder will have a sales engineer read the data sheets and enter some of the data values into proprietary software to make a selection, and then compose a quotation and return it to the engineering, procurement, and construction (EPC) contractor. • During the design of an instrument, the engineer will usually specify only those properties necessary for operation under process conditions. However, there are many other properties that must be known—which are dependent on the manufacturer. After vendor has been chosen, the design engineer has to enter this information manually into the engineering design system from the vendor’s quotation. • Data sheet turnover to the client will likely be something like an Excel file for each data sheet. • After receiving a load of boxes filled with CDs from the EPC contractor, the owner will review each data sheet. Critical data values will be rekeyed into an asset management system. This can take months. Introduction 1 The situation is improving. A few years ago engineers would have faxed the data sheets to the vendors, who would manually add their information and fax them back. Now they E-mail editable electronic files back and forth. And more recently, some owner-operators are trying to streamline the final handover from an EPC contractor by specifying data requirements. However, configuration costs and the lead time required speak to the complexity of the issue. What we need is a way for each participant’s software to be able to communicate complex information to the other participants without having to know in advance things such as database structure or format. If you have ever read The Hitchhiker’s Guide to the Galaxy, by Douglas Adams, you will know exactly what we need—we need a Babel fish! In the book, if you wanted to listen to Vogon poetry spoken in the original dialect you would use a Babel fish. The Babel fish would listen to the Vogon speaking, and then rearrange the syntax and translate all of the words on the fly. ISO 15926 acts like a Babel fish by acting as an interpreter between two otherwise incompatible systems. Let’s compare the process of specifying and purchasing an instrument in the previous example to doing the same thing with tools that support ISO 15926 protocols. The initial data entry is the same. • After design, enter the information into the project’s engineering design system (which may be a database or spreadsheet). However, thereafter tools written to support the ISO 15926 standard extract the relevant information automatically. • For quotation, a procurement officer will expose the Request for Quotation on his company’s public (or secured) ISO 15926 interface and then send a link to the bidders. • By connecting to the EPC contractor’s ISO 15926 interface, each vendor will pull in the relevant information for each instrument. At this point, the vendor has a choice. He can have a human sales engineer read the information and manually make decisions in the same manner we use today. However, because it is in ISO 15926 format the instrument information will be rich enough that analysis, decisions, and composition of a preliminary quotation will be able to be done by a computer program. In this case, the sales engineer will only have to review the quotation before submitting the bid to the EPC contractor. • After selecting the winning bidder, the engineer will point his engineering design system to the vendor’s ISO 15926 interface and pull in vendor-supplied information. • Data turnover to the client will simply require exposing the plant information database on the EPC contractor’s ISO 15926 interface. Introduction 2 • The owner will open the link to the engineer’s interface and import the information. You can see that if we use tools that support ISO 15926 protocols we are removing many opportunities for human error. Thus, in addition to being able to transfer information faster by removing the labor-intensive tasks the entire process will be more reliable. (At the same time, the routine parts of the sales engineer’s job are removed leaving more time for more innovative tasks and talking to customers.) One of the reasons ISO 15926 will make it easier to share information is that it is worldwide. If everyone uses a common standard, a number of things happen. • We can exchange information without having to know anything about one another’s data storage configuration. • Information can be transferred directly from machine to machine without having to be rekeyed. • The information can be transferred with high fidelity. We will not need human beings to review every data value to make sure nothing is lost or added. Everyone will still have their own data stores (perhaps in a proprietary format, perhaps not) but will employ a “Babel fish” (an ISO 15926 interface) when we exchange information with others. This will enable a number of interesting scenarios. • A consortium of EPC contractors will be able to collaborate on designing a plant, each using its chosen engineering design system with proprietary work processes. They will be able to share information without having to know anything about one another’s data storage format beforehand. • During design, vendor and EPC contractor software will be able to connect to each other— and thus passing information back and forth will be much easier. • Information turnover from EPC contractor to owner will be a non-issue. Owners will be able to receive the plant data by connecting to the EPC contractor’s “Babel fish” (the ISO 15926 interface) and then store it in their own format. • After information turnover, any of the owner’s computer systems will be able to use the information. For instance, a plant operations system will be able to access the pieces of information it needed. A plant maintenance system will be able to access just the pieces it needs. Each application will take the pieces it needs and ignore the rest. To help you imagine what it will be like when ISO 15926 is mature, let’s look at three metaphors. ISO 15926 Is Like a Babel Fish We have already mentioned The Hitchhiker’s Guide to the Galaxy. In it, Arthur Dent hitches a ride on a passing Vogon spaceship just moments before the spaceship blows Earth to smithereens to make way for an interstellar bypass. When he is discovered by a Vogon, Arthur is given a “Babel fish” to put in his ear. The Babel fish enables Arthur to understand what the Vogon is saying. From the book: The Babel fish forms a symbiotic relationship with the person who carries it in his ear. The Babel fish feeds on the brain wave energy from around its host. It Introduction 3 absorbs the unconscious mental frequencies from this energy, more or less, as food. Its excrement is a so-called telepathic matrix of conscious thought frequencies combined with the nerve signals from the speech centers of the brain which supplied them, which is picked up by the mind of the host. Basically, then, if you stick a Babel fish in your ear, you can instantly understand anything said to you in any language. Bringing this metaphor into the field of plant design, ISO 15926 is similar to a Babel fish in that it translates the descriptions of plant objects from one company’s database to that of another. The important thing to note here is that the meaning of all the terms is maintained. You do not have to rely on the context of the information to know what individual terms mean. The metaphor of the Babel fish is a pretty good one, but there is a slight difference. The Babel fish translates thoughts directly from Vogon into English in one operation. ISO 15926 will do this in two steps using a middle, neutral layer. If you used ISO 15926 to translate Vogon to English, it would first translate Vogon into intermediate standard descriptions and then from these standard descriptions into English. Using a middle layer of standard descriptions is an important step we will examine in more detail, but briefly the middle layer is what makes it all work. Each organization’s “Babel fish” (which we will call an ISO 15926 interface from now on) only has to understand these standard descriptions, not the descriptions in the proprietary operations of every business partner. ISO 15926 Is Like HTML In case you don’t know what Hypertext Markup Language (HTML) is, you can rest assured that you are a part of a very large majority. HTML is the common language of the World Wide Web. Every web page you have seen is written with some variant of it. If everyone involved in plant design, construction, and operations were to use ISO 15926 to exchange information about plant objects, we would have an equivalent to the HTML experience—but between machines. For instance, if you want to look at the web page of a pump manufacturer you don’t need to know anything beyond the web site address of the company. When your browser connects to the web site, it assumes that what it finds will be encoded in HTML. Of course, it will be—if the manufacturer wants to get any business through the web page—because HTML is the standard format of the World Wide Web. In addition, it does not matter which browser you use. Internet Explorer, Firefox, Safari, Opera, and Netscape are all written to understand HTML. Imagine the hassle if you first had to contact the pump manufacturer and ask for the encoding format, and then instruct your IT folks to write a translator program, before you could access the web site? Of course, you would not do it. And of course the pump manufacturer would not make a web page like this in the first place because no one else would do it either. This metaphor does a good job of describing what the average user will have to know about ISO 15926 as well. In the same way that most people who use the World Wide Web do not need to know about HTML, most users of ISO 15926 will not have to know about it to exchange information. When ISO 15926 is mature, it will simply be built into the software we will all use. Engineers will be able to exchange information much more easily than they do now, and very few of them will need to know that the standard exists. On the other hand, many web sites today are actually written in HTML. This metaphor implies Introduction 4 that a large proportion of plant information will actually be stored in an ISO 15926-compliant data structure. Although this is certainly possible, it will probably not be the case. Most companies will maintain their plant information in the proprietary format they currently use. Instead, they will write a public interface to render the information in ISO 15926 format when a business partner asks for it. In this regard, ISO 15926 is more like the case today whereby a database is exposed to the World Wide Web. When a user queries the database (via her web browser), a program dynamically searches the database and renders the results in HTML “on the fly.” There is another similarity that may appeal to your geeky side. HTML and ISO 15926 are standards that were developed along similar trajectories. Although the underlying infrastructure that enabled the Web started to form in the last quarter of the twentieth century, most of us only discovered it 20 years ago. At first there was some controversy as we speculated about its possibilities. Some of the ideas caught on and some didn’t, but over time “surfing the web” just became part of our lives. Now many of our young people cannot imagine life without it. Similarly, many people are just now finding out about ISO 15926—even though the standards that led to it first started to appear in the mid twentieth century. As with any new technology, there is some controversy and speculation on its future. However, the demand for the interoperability of information is strong and over time ISO 15926 will work its way into the way we do business. ISO 15926 Is Like English on Your Cell Phone If you and I were not close together but needed to talk about something, we might decide to use our cellular telephones. There is a great deal of complexity hidden from the view of the casual user. One of us could be in a digital roaming area while the other is in an analog area. One, or both, might be in a vehicle traveling at high speed down a highway—moving from one cellular area to another very quickly. We could be on different continents. All this is handled automatically by the software and circuitry that makes up the cellular telephone network. None of this would do either of us any good, however, if you spoke Mandarin and I spoke Cantonese. Mandarin and Cantonese are dialects within the same language family, but are far enough apart that it is difficult for native speakers of one to understand native speakers of the other. Thus, to communicate with cell phones we would have to agree to speak the same language. In that China has one of the highest populations of English speakers of any country in the world, it is quite possible that we both speak English. To talk to you using English, I would first translate the words and sentence structure from Cantonese to English. When you heard me speak, you would translate the words and sentence structure from English to Mandarin and (hopefully) understand what I said. Introduction 5 Use a mental dictionary to translate Cantonese to English Hello Hello Use a mental dictionary to translate English to Mandarin In this metaphor, ISO 15926 takes the place of English. ISO 15926 is a common “language” for exchanging information about capital projects. It would not matter how either of us stored our plant information, at the interface we would “translate” it to and from ISO 15926. This is quite a good metaphor in that each of us would continue to think and work in our native language (me, Cantonese; you, Mandarin) but would encode/decode the message to the common language of English more or less on the fly. Similarly, organizations that use ISO 15926 to communicate with each other can continue to use proprietary systems internally. This is a good metaphor in another way as well. The complexity of managing the call is hidden from cell phone users. You and I can contact each other by simply calling each other’s cell phone number. You don’t have to call a different number if I am away from my office. I don’t have to use a different protocol if you have a digital phone or an analog phone. You don’t have to know if I am at home or at ball game. The cellular network figures out where we are and directs our calls through the closest transmission tower. Similarly, by using ISO 15926 we will all be spared the detail of matching our own information systems to those of our business partners. A major difference is in what people will have to know about ISO 15926 in order to use it. This metaphor implies that users will have to know ISO 15926 almost as well as they know their native tongue. In fact, most people will not even have to know how to spell ISO 15926—it will simply be built into whatever computer system they are using. To extend the cell phone metaphor, it will be as if an intelligent English translator is built into both cell phones. I would speak my native Cantonese into my phone and you would hear your native Mandarin in yours. About This Book This book is intended for those who know a great deal about capital project work but not a lot about the software by which projects get done, those who know a lot about the software but not a lot about capital project work, and the poor folks in the middle who are just trying to make a living using software. This book is intended to be the first book you read after hearing about ISO 15926. Introduction 6 Although it describes some complex technology and includes many three- and four-letter acronyms, the explanations are functional (“How does this help data exchange?”) rather than procedural (“First press this button, then that one…”) If you have a background in any part of engineering projects, you will have a knowing, wry smile when we talk about our past and existing ways of dealing with data exchange. But even if you do not have such a background you will still be able to follow the discussion. Managers Managers; engineering managers, construction managers, and plant managers generally know a great deal about engineering, construction, and plant operations; but typically not a great deal about the computer systems that make their operations run. As such, they may view the claims of ardent proponents of ISO 15926 with a certain amount of suspicion. This introduction to ISO 15926 reviews the practical issues with information exchange today (to show where money is being wasted), describes the theoretical and practical work that has been devoted to the development of ISO 15926 (to show that it is viable), and shows how ISO 15926 will make everyday tasks easier (to show where the opportunities lie). Computer Professionals Depending on their area of expertise, computer professionals may already have heard about information exchange and the Semantic Web. What they may not be aware of, however, is the business context of the capital projects industry where their computer systems are used. This book describes several situations project personnel encounter in real-world scenarios, shows traditional solutions, and contrasts them with the way ISO 15926 would be used to make their life easier. Casual Users Because of the way it can be implemented in software, when ISO 15926 is mature many users may not know it exists. But in the transition there might be something like an “ISO 15926” button to push. This book will show what happens under the hood when it is pushed. As well, users will see the growing opportunity for discipline professionals to get involved and will encounter a few ideas for doing so. How This Book Is Organized This book is divided into five main chapters and several appendices. • • • • • • • Why We Need ISO 15926 The History of ISO 15926 How Does ISO 15926 Work? Current Events in the World of ISO 15926 Getting Started with ISO 15926 Appendices Glossary Introduction 7 Why We Need ISO 15926 Traditional ways of exchanging and reusing information all involve some variant of having someone read one document and then decide which parts to transcribe to a different document. This is true whether we are moving a single value from one data sheet to another or mapping entire databases together. Even with modern computer technology, we still rely on highly skilled people to interpret information and to discern which data values are important. For this we rely on a surprising amount of context. For instance, when we look at a data sheet we rely on visual cues and our experience. When we map databases together, we deal with attribute names that are usually inadequate in themselves to make fine distinctions between similar terms. To understand what our data means, we need more information than is contained in the data itself. This affects us whenever we attempt to exchange information. An obvious large information exchange is the handover phase of a project. Even though a fully electronic handover is quite possible today, the documents that are handed over are often still formatted for humans. Currently, when we hand over information we still have to think about how it is to be accomplished. This adds friction, making many exchanges uneconomic. When ISO 15926 is mature, the mechanics of information exchange will fade into the background—allowing more exchanges to take place economically. The History of ISO 15926 The reason information exchange, as envisioned by ISO 15926, is possible now is because of the convergences of four areas of study. The study of how we know things, ontology, gives us techniques for embedding meaning into data. The Semantic Web gives us the tools to do so. Open ways of encoding data make it practical to do so. But the most important is the evolution of product information. With the advent of CAD software in the late 1970s, we have continually played the role of Oliver Twist: “Please sir! Can I have more?” At first, all we wanted was to be able to exchange CAD drawings without having to redraw them and got CAD exchange standards such as the Initial Graphics Exchange Standard (IGES). Then we wanted the product information that is behind the drawings in many manufacturing systems and got a standard called STEP, which we will become very familiar with in the coming chapters. Later, as we tried to apply the principles of STEP to long-life process plants STEP itself evolved into ISO 15926. The core of ISO 15926, the data model (called Part 2), and the dictionary (called Part 4) have been developed in the heat of battle on several large, world-class projects. Dictionary-level information exchange is now routine. We are poised for an even greater increase in productivity with the development of techniques for embedding meaning into our information exchanges. How Does ISO 15926 Work? “Your computer can talk to my computer and neither of us has to know anything about each other’s systems beforehand” sounds somewhat magical. Readers are excused for being somewhat skeptical. But when you consider what actually needs to be exchanged the claim of ISO 15926 becomes more believable. The engineers, fabricators, constructors, and plant operators Introduction 8 who need to exchange information all work with the same physical objects—just in different ways. Each job function needs different information about the same object. For instance, engineers need to know the size envelope and functional performance; fabricators need to know the material and fabrication methods; constructors need to know the mass, envelope, and delivery; and operators need to know how to maintain it and where the spare parts are. There is some overlap, but because the computer system each uses is optimized for particular job functions it is not surprising that the computer systems cannot understand each other even if they actually contain the same information. What ISO 15926 does is to capture a view of everyone’s data so that each can pull out what they need. Imagine the situation of two people, each with their own native language, together learning a new foreign language. They will first learn the names of objects they are familiar with, essentially building a new dictionary. Then they will learn to master a new way of expressing ideas, which is learning new rules of grammar. While learning the new language, they will practice writing it on some type of media (such as a whiteboard, paper, or, for the sake of argument, a stone sculpture). Eventually, if they master the new language well enough and others seek them out they may need a way to regulate access. The parts of ISO 15926 are like the parts of human speech. Part 2 is the data model equivalent to the rules of grammar, and Part 4 is the reference library, equivalent to the dictionary. When any two people use the same rules of grammar and use the same dictionary, they can communicate freely. Similarly, when two machines encode an information exchange using Parts 2 and 4 they can communicate freely. This is the core of ISO 15926. Part 7 contains what are called templates, and is like a phrase book that allows new users to construct a meaningful sentence a bit sooner. Part 8 is like the writing media, and Part 9 is like a web site or the postal service. Current Events in the World of ISO 15926 ISO 15926 is being used around the world more every year. Parts of ISO 15926 are being used in real projects, development of infrastructure to support ISO 15926 information exchange is ongoing, the standards themselves are continuing to be developed, and educational materials are being created. Plant Operations The Norwegian Continental Shelf is one of the major oil-producing regions in the world. The harsh environment has led to a strong need to automate information exchanges to and from offshore facilities. Integrated Operations in the High North (IOHN) is a multi-year initiative to implement this, combining the efforts of all operators in the area. ISO 15926 is part of the data layer. The safe start-up, operation, and maintenance of an operating plant is often limited by a lack of early information about plant equipment and controls. A large bitumen upgrader, planned for startup in Canada in 2015, is part of a pilot project to develop strategies to improve information handover. The Operations and Maintenance SIG (O&M SIG) of Fiatech and the POSC Caesar Association (PCA) are reviewing all relevant information standards, including ISO 15926, and will use them to develop best work practices for information creation and handover. Introduction 9 Development of Enabling Infrastructure One aspect of information exchanges, as envisioned by ISO 15926, is that the definitions of terminology be publicly accessible in real time during the exchange. This will allow the participants to validate terms, which will remove ambiguity and reduce costs. Until now this has not been possible for production work due to lack of facilities. The Joint Operational Reference Data (JORD) project is developing the infrastructure and funding to bring a public reference data service (RDS) into operation. It builds on several years of experience in operating an RDS that supported standards development work. We have used dictionary-level information exchange for years using ISO 15926 Part 4. This works well in high-value situations that can afford the brute-force approach of understanding equivalent information objects on each side of the transaction and mapping them together. But with the pace of business increasing we need to be able to exchange information easier without having to do so much preparation. To do this we will have to use the full specification of ISO 15926. The iRING user group is developing tools, available free of charge with an open-source license, to implement ISO 15926 parts 7, 8, and 9. The tools will more or less clip on to the side of commercial software to allow information exchange between any similarly equipped software. Continued Development of Standards The Geometry Special Interest Group (Geometry SIG) of Fiatech and PCA is developing a reference library for the geometrical shapes used in 3D modeling software. Current tools for exchanging graphics strip out all of the meaning. (This means we can exchange the surfaces of plant objects but not their identity.) The geometrical reference library will be issued as ISO 15926 Part 3, which will allow us to exchange graphics as easily as we exchange other data. One barrier to the wide use of ISO 15926 is that best practices for using Part 7 templates are not yet common knowledge. The Instrument Special Interest Group (Instrument SIG) of Fiatech and PCA is creating recommended templates for describing industrial instrumentation. Because the management of instrumentation is crucial for the safe and profitable control of a plant, the Instrument SIG is working closely with the O&M SIG. Many information standards exist today for various niches in heavy industry. Two such standards have recently been harmonized with ISO 15926. Under the guidance of Fiatech’s Engineered Equipment Life Cycle Application Tools project (EELCAT), what is known as the AEX XML schema (developed by another Fiatech project, Automated Equipment Information Exchange) and the Hydraulic Institute’s Standard HI 50.7-2010 for Electronic Data Exchange for Pumping Equipment have been brought together. HI 50.7 is a dictionary of data fields from many wellknown standards along with the AEX schema. The EELCAT project has recently examined the two standards and has demonstrated how they can be further harmonized with ISO 15926. Getting Started with ISO 15926 With any new technology, there is uncertainty about what can be realistically accomplished. Implementing ISO 15926 is easily within reach of most organizations, probably without having to hire additional staff. At the introductory level, implementing ISO 15926 is similar to traditional methods of exporting information from one application and importing it into another. Introduction 10 Open-source and commercial tools exist to assist in both dictionary mapping at the introductory level and the more advanced use of ISO 15926 Parts 7, 8, and 9. Appendices The preceding fulfills the promise of an introduction to ISO 15926. For those who wish to know a bit more, we have included several appendices: • Appendix A: The Parts of ISO 15926 • Appendix B: Compliance with ISO 15926 • Appendix C: Examples of Database Mapping • Appendix D: Other Resources • Glossary of Concepts: Contains extended explanations of key concepts, including key terminology Introduction 11 CHAPTER 1: WHY WE NEED ISO 15926 It is difficult to overestimate the value of being able to exchange information with anyone without fear of transcription error, while maintaining the precise meanings of all terms, even though you know nothing at all about your partner’s internal work processes and methods of data storage. When information is transferred correctly, the quality and reliability of your end product increases. When you know for sure that information will be transferred correctly, you can move faster because you do not have to check for transcription errors. A significant part of designing, building, and operating capital assets involves transferring and accessing information about those assets. In its oft-cited 2004 report Cost Analysis of Inadequate Interoperability in the U.S. Capital Facilities Industry, the National Institute of Standards and Technology (NIST) reported claims of some companies that 40 percent of engineering time was spent finding and verifying information. Overall, the study showed that the lack of interoperability among computer-aided design (CAD), engineering, and other software systems costs the American capital projects industry more than $15 billion every year.1 ISO 15926 makes exchanging information between applications easier in two ways. First, when your information exchanges go beyond manually rekeying data or using point-to-point custom mapping (which we discuss shortly) you need to create some type of data dictionary containing definitions of all objects in your facility—along with their attributes. If your organization is sufficiently large, this requires a significant effort. Instead of developing your own dictionary, ISO 15926 offers a dictionary2 that you can use free of charge. Because the ISO 15926 dictionary has been developed by a great number of people from many industries in many parts of the world, there is a high probability that the definitions you need are already there. Second, when we exchange information today we need people to manage the transactions because we mistakenly assume a consistently applied background of rules and jargon of the trade. We call these background rules “context,” and without context we are lost. We need to be able to apply these rules consistently in order to correctly match terminology in a universe of very similar terms. If we use the full specification of ISO 15926, we can make information exchanges easier by essentially building the context of the information into the information itself. When we model the information, we can capture the precise meaning of each term and embed it with the term. This makes it easier for machines to talk directly to each other because implied meanings that participants are “just expected to know” are eliminated (or at least minimized). These two issues are intertwined. To fully understand the value of simply being able to use an existing data dictionary, instead of having to develop one from scratch, we will first discuss the sources of information exchange costs. This will leads to a discussion of “context” and 1. The report is available online. Search for “Cost Analysis of Inadequate Interoperability in the U.S. Capital Facilities Industry.” 2. Truth in advertising: Data exchange in the manner envisioned by ISO 15926 requires more than just a dictionary. We are simply starting with a description of how the dictionary component works as a way of easing into the topic. Chapter 1 12 what it means in information exchanges. After understanding the role of context, we will return to the way we manage information exchange today. We will discus the way small, ad-hoc manual exchanges can develop into organization-wide networks of linked applications. The value of the simple existence of ISO 15926 will be apparent when we discuss the issues of managing this organization-wide network. We conclude with some examples of information flows today and show how ISO 15926 will make them easier. The Role of Context in Information Exchange Information exchange today requires the skills of experienced people because we rely to a great degree on the context in which we find information to understand the precise meaning of the information3. Figure 1.1 shows someone with a bright idea. To achieve some business result, he has to pass the information to someone else. If he just sends the information without context, he is just throwing it all in a bag and hoping the person on the other end can figure it out. Fig. 1.1 Putting information in a bag. When we encode information using the full specification of ISO 15926 (including all of the components we will discuss later), we include enough context that other ISO 15926–enabled tools will clearly understand what we mean (Figure 1.2). We no longer have to know anything about the partners with whom we exchange information. 3. The “information in a bag” diagrams are courtesy of Robin Benjamins. Chapter 1 13 Fig 1.2 Putting information in an ISO 15926 bag. When your humble author started his career in plant design, computers were not commonly used by designers and engineers. Drafting was done by pencil on paper. Specifications were written with a typewriter. When information had to be transferred from one document to another, the only way to do it was for a human to read the original document, find the value to be transferred, and then write it by hand on the target document. If the target document was something like a specification, it was usually given to a secretary for typing. Transferring data to the owner at the end of the project was cumbersome but conceptually simple: you would take all of the specifications and drawings, sort them into some logical order, perhaps bind them into books, and move them to the new location. Data turnover to the client at the end of a large project was similar to the last scene of the movie Raiders of the Lost Ark. In that scene a forklift carried a wooden box down a long aisle of identical wooden boxes and put it on one of the piles, something like that depicted in Figure 1.3. In the real world, it sometimes took years for the owner to review all of the boxes and categorize the binders of information. Fig 1.3 Data handover the old way. Chapter 1 14 No one really liked this (as in “I really liked that piece of chocolate cake, may I have another!”), but that was just the way it was. Everyone just built such manual tasks into their plans. Things started to change when computers made their way into the design office. Binders of data sheets gave way to spreadsheets burned onto CDs and then DVDs, graphite pencils gave way to electronic pencils (i.e., CAD), and rolls of Mylar drawings gave way to CAD files burned onto more DVDs. These days, when the owner receives the material it is physically easier to sort through boxes filled with DVDs than to sort through boxes filled with paper but the documents still have to be reviewed individually. On a moderately large project, it can still take a crew of people many months to index it all and bring it into the owner’s system—and until a document has been received by the document management system it is essentially invisible. There have been improvements, but things have not changed much conceptually. In our work processes for plant design or plant operations, a large proportion of an engineer’s activities still involve finding information and manually transferring it from one document to another. For instance, after the engineer chooses, say, an instrument from a manufacturer’s catalog the only practical way to record the information about the instrument is to read the manufacturer’s data, interpret it to decide which of the data values to transcribe, and then figure out where to put the data values in the engineering design system. Some of the operations are simple transcription, such as transferring a model number from one spot to another. However, some involve calculation—such as changing from one unit of measurement to another. Other operations involve interpretation ranging from ignoring the data value altogether to decisions involving judgment, such as orientation or handedness. The work is done on a computer, but often the only real difference is that engineers do the typing themselves instead of giving it to their secretaries. To reiterate, in most cases today information exchange still requires the skills of experienced people because we rely on the context in which we find information to understand the precise meaning of the information. Earlier we defined “context” as “the rules and jargon of our trade.” Design engineers learn these rules and jargon by a combination of education and experience. For an example of how we rely on context, suppose that you have to transfer information from one data sheet to another and you see this: 1034 This means nothing. So, you “back up” and look for more context. Pressure 1034 Okay, so you know a bit more—but still nothing usable. So, you back up again. Pressure: 1034 Pressure Units: kPa Now you expect other values to be in SI units, but you still really do not know what is going on. So, you back up some more. Chapter 1 15 Seal Flush Pressure: 1034 Pressure Units: kPa You still have questions, so you continue to back up. Tag No: P-101 Service: Chemical Injection to D-101 Seal Flush Pressure: 1034 Pressure Units: kPa Now you are getting a clearer picture. When you “back all the way up” and read the entire data sheet you can finally put the initial value, 1034, into context. Centrifugal Pump Data Sheet Client: ABC Chemical Company Tag No: P-101 Service: Chemical Injection to D-101 Seal Flush Pressure: 1034 Pressure Units: kPa Even with such a trivial example, without context we are lost. Experienced engineers may exclaim at this point “But this is the way design is done. You have to consider the entire data sheet to understand a particular term!” That is the point: If we want to use machines to transfer information to other machines, we want the information to be self-describing. Chapter 1 16 The Data Sheet Problem Consider a more realistic example of selecting and specifying a centrifugal pump. After selecting the proper make and model, the mechanical engineer has the pleasant task of figuring out which data values to transfer from the manufacturer’s data sheet to the owner’s data sheet. Figure 1.4 shows a small portion of both data sheets. Different Name CONDITIONS OF SERVICE, EACH PUMP Normal Flow Rated Flow Disch. Press. kPag Temperature C Suct. Press. Max. kPag Specific Gravity Differential Pressure kPag Vapor Pressure kPag Differential Head m Viscosity NPSH Available m Corr./Erosion Caused By Corrosion Allowence mm Cold Start Temp. C @Sp. Gr. & Viscosity CONSTRUCTION Nozzles Size Rating Facings Suction Discharge Different Name Same Definition Rated kPag Location Same Definition Same Definition Different Definition Convert to Imperial Convert to Metric OPERATING CONDITIONS Rated Max. Normal Capacity (gpm) Suction (psig) Discharge (psig) Diff. Press. (psig) Diff. Head (ft) Power (hp) Different Definition Min. Different Definition @ Minimum S. G. Rated Max. Normal Min. Op. Time (hr/yr) NPSH Avail. (ft) System Design Stand Alone Parallel Operation Series Operation with Service Pressure Min/Max (psig) . Service Continuous Starts per Day System Control Method Speed Flow Level Temp. Pressure Pipe Friction Resistance Only Fig 1.4 Comparison of two data sheets. The most notable difference is that one data sheet expects metric units, whereas the other expects Imperial units. But beyond that the data sheets are organized differently: the data are grouped differently and the groups are arranged differently. These two excerpts only have eight data spots in common. However, looking closer we can see that of the eight spots only three are obviously identical: Discharge Pressure Rated Suction Pressure Differential Pressure Chapter 1 17 The rest require some interpretation: Metric Data Sheet Imperial Data Sheet Comments Normal Flow Capacity: Normal Probably the same Rated Flow Capacity: Rated Probably the same Max. kPag Suction: Max. No data entry spot Differential Head Diff. Head: Rated Possibly the same NPSH Available NPSH Avail.: Rated Possibly the same Most mechanical engineers with a little experience will have no trouble figuring everything out because they have the rest of the project documents, and perhaps have experience with the pump manufacturer. Again, that is the point. We need humans to guide even what seem on the surface simple transcription tasks because our work practices depend to a great degree on context. However, when machines start talking directly to other machines the problem of implied meaning based on context becomes a serious barrier. The reason information exchange works these days is because we exchange entire sets of data (for instance, a complete data sheet) in which the context is preserved. The disadvantage, however, is precisely that: we have to exchange entire sets of data and we must have humans interpret them item by item. What we really want is to be able to let machines exchange information directly without having to rely on context to retain the correct meaning. What we really need is a “cut-and-paste” tool for plant information. We want to be able to just “cut it from that database over there” and “paste it to this database over here.” However, it’s not that simple. The first and most obvious reason we cannot use a simple cut-and-paste tool is that the data values we want to transfer seldom map to the same (x,y) coordinates on any two data sheets. In order to know which data values to transfer, we have to first know enough about the data sheets and underlying databases to know which values are required, which values can be ignored, and whether or not something is missing. The second reason is, as we have seen in this example, we cannot rely on the name of the attributes. Even when attributes in two databases have the same name, we cannot be sure there are no subtle differences in their meaning. We need a human expert to confirm that the attributes match. These actions are trivial if you have the right context. We have many thousands of design engineers doing this all day, every day, and generally they are good at it. However, we rely so much on context to convey meaning that we cannot trust machines to make the right decisions on their own. Finally, this is all carried out in the context of the same written language. What would happen if the project were engineered in English but the client wanted all of his data sheets turned over in Russian? The Value of the ISO 15926 Dictionary When we want to exchange information between two software applications, the traditional way is to map the respective databases together. We preserve the context by manually examining the terms in each database to determine which are equivalent. Directly mapping two applications together is often the easiest short-term solution. Chapter 1 18 Let’s say you are an engineer working for a construction company planning the installation of some piping spools. Part of your job is to plan the hydrostatic testing of all piping systems before commissioning the plant. So, along with a great deal of other information you need to know the design temperatures and pressures of all lines in the project. Fortunately, the design engineer has given you a “line list”—a list of all line numbers and their critical attributes. Your first job, then, is to simply get the design engineer’s line list into a form you are used to dealing with; that is, your company’s construction management software. You might decide to bite the bullet and just rekey the information manually. However, this gets old quickly and after a few dozen lines you will be wondering why you cannot just download the data from the engineer’s design software into your construction software automatically. After all, the design engineer did not (in this day and age) have a secretary type the line list onto a piece of paper with a typewriter and fax it to you. It almost certainly exists in an electronic database somewhere. It is in the design engineer’s best interest to make sure the construction contractor (you) has the correct information. It is easy to imagine that your IT folks and their IT folks get together to give you a connection over the Internet to just “suck in the data.” In addition, because the “lexical scope” of a line list is only a few terms it is easy to imagine that the data will “go right in.” (I mean, computers are pretty intelligent these days, aren’t they?) Artificial intelligence is the study of how to make real computers act like the ones in the movies.4 In a perfect world, everyone developing an application for some part of plant design would use the same column names and ensure that the meanings of the content of the columns were the same. (In database jargon, the “schemas” would be the same.) Figure 1.5 shows what this would look like. Engineering Application Line Table tag_no dia len temp press ifc tag_no dia len temp press ifc Line Table Construction Application Fig 1.5 A perfect world. 4. Port 2000 Newsletter: The Information Technology Newsletter for Port Washington Educators, December 1996. Chapter 1 19 Of course, in the real world things seldom work out so nicely. Even in the rather small scope of a line list (where there the lexical scope is only a dozen or so terms) there is usually ambiguity. For instance, in Figure 1.6 two columns have names that do not match. One of them, tag_no in the engineering application, is most probably equivalent to the id column in the construction application. But what about the column cl in the construction application? For that matter, what does temp really mean? Could it mean “Temporary”? It is possible. It is not unheard of to have temporary lines erected for commissioning, which are then removed after a plant has been put in production. In addition, if you are being a bit paranoid (and if you have to sign off on the actual hydrostatic test pressure you better be at least a bit paranoid) what do you make of press? It almost certainly means “pressure,” but what type of pressure? • Operating pressure? • Maximum allowable working pressure? • Design pressure? Engineering Application Line Table tag_no ? dia ? ? id len dia temp press ifc temp press ifc ? cl Line Table Construction Application Fig 1.6 Real-life situation. If you have to specify the hydrostatic test pressure, you have to know for sure. The reality is that we are inferring everything, based on context. The only solution is to start digging. Fortunately, the lexical scope of a line list is only a couple dozen terms. To determine if temp means “temperature” or “temporary” you will have to look at a few rows of data. If the data values are y/n, or 0/1, it points to “temporary.” However, if the data values are numbers in the range of, say, –50 to 1,500 it probably means “temperature.” The column name press will take a bit longer. You will have to look for clues in other documents and use your engineering judgment to determine which type of pressure it is. You may have to contact the design engineer. What looked fairly simple just a while ago is looking a bit more complex. Chapter 1 20 There is always an easy solution to every human problem— neat, plausible, and wrong.5 If you only have to import the data once, one option is to simply copy the information column by column. However, if you have to import information from the engineering application several times you will want to have someone configure a custom mapping application. In Figure 1.7, someone has examined the data coming from the engineering application and determined which columns matched those of your construction application. In software development jargon, they have “mapped the databases together.” The red box represents software that uses the map to transfer information from the design application to the construction application. (An interesting side note is that you may not have to actually transfer very much. For instance, if you knew that both applications expect pipe sizes based on ASME B36.10M you would only have to transfer numerical digits—such as 6 or 12.) Engineering Application Line Table Custom Map tag_no dia len temp press ifc tag_no id dia dia len cl temp temp press press ifc ifc id dia cl temp press ifc Line Table Construction Application Fig 1.7 A custom solution. So far in this simple example, writing the mapping software would be relatively easy. But the real world is usually a bit more complex. One common problem, from the point of view of a lonely construction engineer on a remote project a long way from help, is that the data is often mangled across many fields. Take, for example, a “simple” line number such as what might be represented by the column tag_no. As a construction engineer, you are likely to look at a line number as “a label representing the name of this run of pipe, from here to there.” You expect the line number to be one string of text, something like: Line Number 150-HCL-250-1C200-35 However, the design engineer needs to be able to sort and filter on the pieces. Her idea of a line number probably looks more like this: 5. H. L. Mencken, 1917 (Variously attributed to Albert Einstein, Winston Churchill and others.) Chapter 1 21 Line Number Area Fluid Code Nom Dia Service Class Insulation Thk 150 HCL 250 1C200 35 In fact, it may not even be that nice. It could be something like this: Area WBS Service Class Nom Dia Fluid Code Testing Insulation Thk 150 7395-132 1C200 250 HCL 3 35 Many engineering procurement and construction (EPC) contractors have proprietary work processes for plant design supported with custom software. It is quite possible that the database the design engineer used to create your line list has the columns in a different order, and with other columns mixed in. The line list sent to you could have been created with reportwriting software that had been configured to select just the right columns and put them in just the correct order just for your project. This added complication may not reveal itself until you get “under the hood” and look at the database dump. If you are the one transferring information from one application to another, it is up to you to figure out all the pieces and to put Humpty Dumpty back together for your construction application. It is a good thing that the lexical scope of a line list is only a few dozen terms! A Confederation of Applications If linking the first two applications together works out well, you may be tempted to link in another one. Figure 1.8 shows how the engineering application might be mapped to a procurement application with a second custom database map. Engineering Application Line Table Custom Map tag_no dia len temp press ifc tag_no id dia dia len cl temp temp press press ifc ifc id dia cl temp press ifc Line Table Construction Application Custom Map tag_no tag tag dia dia len cl temp press ifc ifc dia len code price ifc Line Table Procurement Application Fig 1.8 A confederation of applications. This is the beginning of what might be called a “confederation of applications” whereby many applications get information from each other, with custom-built database maps between each Chapter 1 22 pair of applications. Conceivably, you could link all of the construction applications for your project together in this manner to form an enterprise-wide solution (Figure 1.9). Fig 1.9 An enterprise-wide solution. If you managed to do all of this before your construction project was finished, you would have at least a short period of relative bliss. Instead of having to manually export a list from one application, massage it a bit, perhaps do a unit conversion or two, and then import it into another application all you have to do is push a button. The advantage, in terms of lower labor costs and higher reliability, is obvious. In addition, if mapping databases together point to point on a project is good why not connect all applications in your company together the same way? Of course, many organizations do this. Although there may not be a theoretical limit to the number of applications that can be connected in this way, there is a practical limit. Every piece of software you link together is subject to change. For the most part, this is good. As hardware becomes more powerful, we can make software do more things. However, the natural cost of this is that eventually the database structure of an application will change. If you use point-to-point maps to link databases directly together, the link will break when either of the databases changes. If you have many applications daisy-chained together, they may all go down like a row of dominos. The practical reality of this approach is short lifetimes, high maintenance, and having to relearn an application’s data structure whenever something goes wrong. How do we deal with this? As it turns out, collectively, we have tried a number of options. The first approach is some variation of “If it ain’t broke, don’t fix it!” Here an organization deliberately stops upgrading software even when new versions are released. The immediate Chapter 1 23 benefit is stability of information exchanges. Personnel can get on with whatever their business is without having to continually relearn old skills. Instead of having to spend all of their time fixing mapping problems, the IT group can do more value-added work. Done right, this can work well for quite a while. However, the longer an organization resists change the more work processes will become entrenched—tied ever stronger to old software and aging engineers. Change becomes unthinkable because of the knock-on effects that will ripple through the organization. Eventually support from the software vendors stops and the pool of trained users shrinks. The trick here is to know when to tightly adhere to this principle and when to let go. “You’ve got to know when to hold ’em, know when to fold ’em…”6 A second approach to managing a large confederation of applications linked with custom point-to-point maps is to live with the maintenance and try to make it as organized and efficient as possible. Under this approach, as applications in the confederation evolve and the maps break someone will be there to fix the important ones. We might even argue that this is a good thing because it is a natural way to cull applications that are no longer useful. There may be some truth to that, but we must also acknowledge that the pace of innovation and change is speeding up. Business success, in some measure, goes to those organizations that make good decisions quickly. Good decisions require good information, and good information choked by inefficient information transfer is bad information. As with the first approach, we have developed good ways to handle maintenance. There is an entire business segment devoted entirely to what we call middleware to make creating and maintaining custom point-to-point maps easier and more reliable. Still, there will come a time when maintenance is a large cost. There is another solution, but it is a bit more work up front. Instead of making individual pointto-point maps, an organization could make a dictionary of terms for all of the applications it uses. Under this approach, we study all of the software the organization uses and decide on the meaning of every term. Then when we link two applications we do not map them directly together but map each to the standard dictionary, as indicated in Figure 1.10. 6. A line from The Gambler, a song by popular Country and Western singer Kenny Rogers. Chapter 1 24 Engineering Application Line Table Map to Common Dictionary Map from Common Dictionary tag_no dia len temp press ifc tag_no szTag dia dDia len dLen temp dTemp press dPress ifc dateIFC szTag id dDia dia dLen cl dTemp temp dPress press dateIFC ifc id dia cl temp press ifc Line Table Construction Application Fig 1.10 A common dictionary of database definitions. In this example, someone has created a common dictionary containing the meaning of the attributes of the line list. Using the dictionary, the developer of the engineering application has determined the appropriate definitions that apply to the columns in his database. Without having to change the engineering application in any way, he builds an export function that extracts the appropriate data values and labels them with names from the common dictionary instead of what the engineering application called them. Engineering App Dictionary ID Description Units tag_no szTag Tag Number of a plant object dia dDia Nominal Diameter per ASME/ANSI B36.10 len dLen Pressure Class per ASME/ANSI B16.10 temp dTemp Normal Operating Temperature degF press dPress Normal Operating Pressure psig ifc dateIFC Issued for Construction Date yyyymmdd Similarly, the developer of the construction application uses the same dictionary and determines the appropriate definitions that apply to the columns in her database. Construction App Dictionary ID Description id szTag Tag Number of a plant object dia dDia Nominal Diameter per ASME/ANSI B36.10 cl dLen Pressure Class per ASME/ANSI B16.10 Units Chapter 1 25 temp dTemp Normal Operating Temperature degF press dPress Normal Operating Pressure psig ifc dateIFC Issued for Construction Date yyyymmdd Again, without having to change the construction application in any way she builds an import function that expects to receive attributes with names from the common dictionary and maps them to the attribute names in the construction application. There are several advantages to using a common dictionary that are important to note. • The developer of the engineering application does not have to know anything at all about the construction application. Indeed, he does not even have to know it exists. • The developer of the construction application does not have to know anything at all about the engineering application—beyond, of course, knowing that it is the source of certain input data. • Both of the developers are now free to modify their respective applications in any manner, including massive changes to their data structure—as long as the output and input are mapped to the same common dictionary. • Both developers know the precise meaning of the input/output from each other’s application because they know the precise meaning of the dictionary term they are mapped to. We call this “semantic precision.” Of course, there is a cost to developing a common dictionary. Someone has to herd the cats together to decide on the meaning of each term and what to call them. This is not trivial. Many people, each with different interests and agendas, have to look at the applications in question—possibly having to make fine distinctions between very similar attributes, as well as build in some allowance for growth. In short, there is a great deal of initial work in making a common dictionary but in a large organization it is recouped whenever a new application joins the confederation. For instance, when a procurement officer needs information from the engineering application to use in a procurement application instead of examining the content of the engineering application’s database all a software developer has to do is consult the dictionary and determine the values he wants to use. Procurement App Dictionary ID Description tag szTag Tag Number of a plant object dia dDia Nominal Diameter per ASME/ANSI B36.10 len dLen Pressure Class per ASME/ANSI B16.10 ifc dateIFC Issued for Construction Date Units yyyymmdd The fact that the engineering application is exposing other data attributes is not of interest. The procurement application (Figure 1.11) will simply choose the four data items it needs and use them. Creating the common dictionary made the first mapping exercise take a bit longer, but the benefits are clear. • The second (and third, and fourth) mapping exercise is almost free. When the developer of the procurement application wanted to import line list information from the engineering Chapter 1 26 application, he simply used the appropriate definitions from the common dictionary. The developer of the engineering application might not have even known this was happening. • If the common dictionary is maintained, it is robust. If a new application in the confederation needs another type of pressure, for instance, the keepers of the dictionary can simply add it to the standard. The new applications will use the new definition, but none of the existing maps between existing applications will break. Map from Common Dictionary szTag tag dDia dia dLen len tag dia len dateIFC ifc code price ifc Line Table Procurement Application Fig 1.11 A reuse scenario. Again, there is a cost. As an organization grows in size and adds more members to its confederation of applications, managing the common dictionary will require significant time. In a large organization, this can be a full-time job. Regardless of the size of the organization that creates the common dictionary, eventually its sphere of interest will intersect the sphere of interest of another organization. For example, if two organizations—each having its own common dictionary—decide to merge, which of the two dictionaries do they keep? Or do they make a third to map between them? Likewise, if two such organizations want to collaborate on a joint venture they will have to harmonize their dictionaries. Today, there is no practical alternative to each organization maintaining its own dictionary—duplicating the efforts of all of its peers. It would be nice if there were a way for everyone to combine efforts so that all organizations use the same dictionary, at least for nonproprietary information. As it turns out, this is exactly what ISO 15926 offers with its specification for a reference data library (RDL). The following is a high-level view of how it will work. Public Extensible Dictionary Figure 1.12 shows the same engineering application and construction application where somebody decides that they need to include a new attribute, Widget Color, in the line list. (Widget Color may have been in the engineering application all along, or it could have been added to support a particular project for which Widget Color was important.) Note that the developer of the engineering application has decided to use the attribute name wcolor to describe the color of the widgets, whereas the developer of the construction application wants to use the attribute name color. This is okay because they both intend to use a database map for the information exchange and therefore do not need to use the same attribute name. Chapter 1 27 Engineering Application Line Table Map to Common Dictionary Map from Common Dictionary tag_no dia len temp press ifc tag_no szTag dia dDia len dLen temp dTemp press dPress ifc dateIFC wcolor oColor1 New Definition szTag id dDia dia dLen cl dTemp temp dPress press dateIFC ifc oColor1 color N e w D e fin itio n id dia cl temp press ifc wcolor Industry Reference Data color Line Table Construction Application Fig 1.12 Public extensible dictionary. What each developer does, independent of the other, is consult the public dictionary to see whether or not there is an attribute (or “class,” as it is properly known) that describes the color of widgets. In the previous example, there is such an attribute: ColorW. Each developer—again, independent of the other—creates their map. The engineering application maps wcolor to ColorW, and the construction application maps ColorW to color. Note that neither developer had any constraints on what they named the attribute internally, nor had to consult the other to configure their own half of the database map. Indeed, the two developers could have been on different continents—each speaking a different language. Thus, we are now back to the first benefit of ISO 15926; that it has a publicly available data dictionary.7 The simple fact that the data dictionary exists means that an organization does not have to develop one of its own. The ISO 15926 dictionary can be used for any purpose without royalty payment. However, what if in the previous example the public dictionary did not have a term for the color of widgets? Even if the dictionary were 100-percent up to date when it was first published, new technology would undoubtedly require new terminology. So, another requirement for the ISO 15926 data dictionary is that it be able to handle change. To handle change, the developers of ISO 15926 envision a library that will allow anyone (perhaps with the requirement of a bit of training first) to add new terms. A pilot project, called the RDS/WIP (Reference Data Service/Work in Progress), is in operation—sufficient to handle research needs. One of the current development efforts of ISO 15926 is to create an industrialscale reference data library. With a fully functioning, publically available RDS we will enable organic growth. Each organi- 7. Truth in advertising: ISO 15926 does not intend to have a complete data dictionary for all of “life, the universe, and everything.” It contains an “initial” dictionary and specifies how to extend it. Chapter 1 28 zation will use it to further their own private interests, but the result will be an expanded RDS for everyone to use. An Example Project Everyone can see the value of having all participants in a project able to exchange information with one another directly from database to database. On a typical fast-tracked project, however, there is no time for participants to map their software applications to each other’s databases once the project starts. On the other hand, there is no incentive to do so before contracts are awarded either. About the only situation in which the partners in a data exchange are known well enough ahead of time, and in which the volume of information justifies a custom data mapping project, is between an EPC contractor designing a capital asset and the owner. Some owners realize that the handover of asset information is a bottleneck to efficient startup and are increasingly specifying information standards and transfer protocols in advance as a condition of contract award. But one only needs to look at recent examples of this type of thing to verify that such an exercise is expensive and time consuming. However, if all participants in plant design, construction, and operation map their databases and configure their software to comply with an industry standard the time constraint is removed. They no longer have to design and implement database mapping within the window of opportunity of one project. • Vendors are able to bid on more projects. • EPC contractors are able to entertain more bidders. • Owners can mandate more stringent standards for information turnover without limiting the field of participants. Many Project Participants All project participants benefit from mapping their software applications to be able to talk to one another. However, the sheer number of pairs of participants makes doing so prohibitive. The owner has it the worst because one way or another it will have to pay for all mapping exercises on its project. Even if all information an owner receives is funneled through one EPC, all participants will embed the costs of database mapping in their prices. Figure 1.13 is a diagram showing what might be a moderately large project. There are two engineers, three construction contractors, and one owner. The owner has two preferred suppliers it wants everyone to deal with. There are five suppliers between the two engineers, and five suppliers between the three construction contractors. Chapter 1 29 V 01 44 V 02 43 45 V 03 28 27 19 EPC 2 26 11 V 04 12 25 10 24 8 Const 1 20 22 13 9 V 05 29 14 21 V 10 23 15 2 17 EPC 1 18 3 34 31 33 38 16 V 11 4 1 30 32 37 35 Const 2 V 20 V 21 36 V 22 39 V 30 5 42 41 7 6 Const 3 40 V 31 Owner Fig 1.13 One owner, two engineers, and three constructors. Overall, there are 18 players with 45 different connections between them. Remember that each colored line in Figure 1.13 represents a separate mapping project involving hundreds or, in the case of the EPC contractor-to-owner transfer, thousands of data-point maps. Of course, no one would ever do this. Even if the owner were willing to pay for all custom database maps (either explicitly or built into the prices), the speed with which the maps would have to be created would be prohibitive. Once an owner signs an agreement to begin detailed engineering, there is no time to spare. The requirement that all bidders agree to a database mapping exercise before submitting bids delays a project significantly. Wouldn’t it be nice if the whole world would agree on a common manner of describing plant objects and a common set of definitions? Then we could just tell each other “Have your machine talk to my machine.” If all participants map their databases to a common standard, everyone only has to map their applications once—ever. We Need a Babel Fish To reuse a metaphor, we need a Babel fish (Figure 1.14) to translate information from one company to another. Fortunately, ISO 15926 acts very much like a Babel fish. • Each company creates its own ISO 15926 interface and makes it available to its business partners. • Each company maps its own database (or at any rate, those portions of its database it wishes to publish to the project participants) to the ISO 15926 standard, and opens it to its partners. • Each company creates an application that can access the interfaces of its business partners. Chapter 1 30 F F Const 3 EPC 1 Fig 1.14 Everyone needs a Babel fish. After that, the only question you have to ask of a supplier (or engineer, or construction contractor) is “What is the URL of your interface?” Figure 1.15 shows how project data exchanges might be configured using ISO 15926. The small yellow circles are ISO 15926 interfaces that can range from e-mailing a neutral file to a specially programmed interface on the Internet, which we will talk about in Chapter 3. The individual data exchanges have been replaced by an “ISO 15926 cloud” to show that within the cloud information can go anywhere. We have added two new entities, called “data brokers,” which project participants can hire to perform ISO 15926 data exchange—in much the same way some organizations today hire an outside organization to manage their Internet web pages. V 01 EPC 2 V 02 V 10 V 11 Const 1 Data Broker 2 V 03 ISO 15926 V 04 Data Broker 1 Const 2 V 20 V 21 V 22 V 30 EPC 1 Const 3 V 05 V 31 Owner Fig 1.15 The same project using ISO 15926 interfaces. How ISO 15926 Makes Life Easier in the Near Term ISO 15926 standardizes the representation of project information across organizational boundaries. This means that information can move faster and more reliably with fewer transcription errors. Organizations are able to keep their proprietary systems while exchanging information with business partners in a manner their business partners can understand. Because Chapter 1 31 information has a standard representation, some exchanges can be automated. The following examples show how ISO 15926 works in realistic project situations. Project Design To mitigate the time delay between receiving project documents from an EPC contractor and getting the relevant information entered into its document control systems and operations and its maintenance systems, an owner may wish to specify in advance the precise nature of the data handover. Unless the EPC contractor has worked for the owner previously, there will be a delay while the EPC configures its engineering systems to comply with data handover requests—and the costs of doing so will be borne by the owner either explicitly or embedded into the fees. When ISO 15926 is mature, this will all change. As soon as the owner determines that the EPC contractor can hand over the project data following the ISO 15926 protocols, no one will have to give data handover any more thought. Project participants will be able to simply get on with design. In addition to not having to spend time negotiating handover up front, any information exchanges during the project design (Figure 1.16)—such as between EPC contractors and vendors, between the EPC contractors themselves, and to the owner—will be easier. EPC 2 V 10 V 11 ISO 15926 EPC 1 Owner Fig 1.16 Information exchanges during design. Procurement On a large capital project, an equipment supplier bidding on the project will receive a great deal of information with the initial inquiry—which can consist of many books of specifications and many partially filled-in data sheets. All bidders will have to read everything carefully and make enough enquiries to verify that all parties agree on the terminology. The successful bid- Chapter 1 32 der will have to fill in the remaining parts of the data sheets and return them along with many books full of installation and maintenance instructions. ISO 15926 makes procurement (Figure 1.17) more efficient in two ways. • Because ISO 15926 standardizes the description of plant objects, data sheets are more reliable—which means that there is much less need to verify terminology. • There is a potential for automating repetitive tasks because the meaning of equipment attributes is accurately conveyed with the attributes themselves. V 01 EPC 2 V 02 V 10 V 11 V 03 ISO 15926 V 04 Data Broker 1 EPC 1 V 05 Fig 1.17 Information exchanges during procurement. Construction Construction contractors are starting to use sophisticated construction planning systems that rival engineering systems in size and complexity. Currently, importing information from an engineering system is a potential bottleneck. With ISO 15926 (Figure 1.18), this is no longer an issue. • Construction planning can start earlier, and can be kept up to date more easily. • There is the potential to integrate construction planning with engineering. Chapter 1 33 Const 1 EPC 2 V 20 Data Broker 2 ISO 15926 V 21 V 22 Const 2 V 30 EPC 1 Const 3 V 31 Fig 1.18 Information exchanges during construction. Handover At the end of a large capital project, there might be many tens of thousands of documents for the EPC contractor to turn over to the owner (Figure 1.19). The EPC contractor in turn will have received many of these documents from a number of suppliers and subcontractors, each in the form most convenient for the authoring organization. Usually, to be of any use to the owner every document has to be read by a real person, entered into the owner’s document control system, and manually linked to the plant object in the owner’s facility operations and maintenance software. This can take many months and cost millions of dollars. Obviously, owners want to reduce the cost and have the information ready to use for startup. Const 1 EPC 2 ISO 15926 EPC 1 Const 2 Const 3 Owner Fig 1.19 Information exchanges during handover. Chapter 1 34 Some owners these days make turnover in a specific form a project requirement. This moves the effort to comply with specific requirements forward in time but does not make the job itself any easier. Using ISO 15926, the information can be rendered in a standard form—eliminating most of the issues we have today. • Information exchange is easier because the exchange format is built into the computer systems of all parties. • Handover information is cross-referenced directly against assets, eliminating most of the manual work of relating information to specific plant objects. • The time a project is most vulnerable is during commissioning. Because the data is in a standard form, it is much more easily made available for startup. Plant Operations and Maintenance When information is turned over to the owner already indexed to the relevant equipment, it is easier for maintenance personnel to locate. When information is easier to locate, it is easier to keep it up to date during plant modifications (Figure 1.20). • Hands-on maintenance time is maximized because less time has to be spent finding information. • Worker safety is improved because it is easier to make sure critical information, such as equipment hazards and hazardous substances, is in the proper place. • It is easier to manage information over time as computer hardware and software changes. Currently, documents in an old system may be very difficult to find and use. With ISO 15926, however, all information can be stored in a standard form so that loading information into a new system is much easier. Chapter 1 35 V 01 EPC 2 V 02 V 10 V 11 Const 1 Data Broker 2 V 03 ISO 15926 V 04 Data Broker 1 Const 2 V 20 V 21 V 22 V 30 EPC 1 Const 3 V 05 V 31 Owner Fig 1.20 Information exchanges during operations and maintenance. How ISO 15926 Makes Life Easier in the Long Term The preceding examples highlight existing information exchanges and show how they are made easier using ISO 15926. In the three examples following, we are speculating based on what will be possible when ISO 15926 becomes widely adopted. Using Specialized Designs Figure 1.21 shows a project previously designed by one of the EPC contractors, as well as a specialist the EPC contractor works with. ISO 15926 will make it much easier to reuse previous work and to integrate the work of specialists. Currently, we try to do the smart thing and reuse work we have previously done. But there are practical barriers that often get in our way. For instance, a design may have been done with a different engineering design system or perhaps with an older version of a design system than that currently used. We salvage as much as possible, but often large parts must be redone. Chapter 1 36 Previous project information stored as ISO 15926 Specialist EPC 2 Communication to Specialist supplier using ISO 15926 ISO 15926 Fig 1.21 Integration of information with an engineering subcontractor. In addition, it will be easier to integrate information from the designers of major systems. On a large capital project, it is typical that the design of major systems be given entirely to organizations that specialize in those systems. For instance, after an EPC contractor determines the performance characteristics of a power boiler the design and fabrication are typically given to a company that specializes in power boilers. The power boiler may have a great many instruments and other devices that have to be integrated into the owner’s plant maintenance and operating systems. Currently, the best way to handle this is similar to the overall data handover issue we have just discussed. Basically, either the EPC contractor or the eventual owner will have to receive all of the documentation about the equipment and manually enter it. As with other information exchanges, use of ISO 15926 will make these situations much easier to deal with. • Designs from an older project will be more easily reused on a new project. • Designs from joint venture partners will be more easily integrated. • Design from a specialized designer will be more easily used on all projects that use that particular design. • Licensed process design will be sold more easily to EPC contractors. Application Development When software is developed (Figure 1.22), the developer must deal with the manner in which information is to be acquired (data in), how it is to be stored, and how it is to be published (data out). ISO 15926 will make all of these issues much easier to deal with. Chapter 1 37 • ISO 15926 is a single standard to support. The questions of how input is received and how output is to be made disappear. • ISO 15926 already exists. A new data model for data storage does not have to be developed from scratch. ISO 15926 Software Developer Fig 1.22 Information exchanges for software development. Integration of RFID information RFID technology is getting less expensive over time and is showing up in new applications every day. It is easy to imagine a future in which almost all objects have an RFID tag. It is a small leap to imagine a manufacturer imbedding an “ISO 15926 identifier” in the tag that will automatically link to all of the information about the component that had been publicized with ISO 15926 protocols. This could include: • Material certificates and other information from the manufacturer • Inspection certificates • The purchasing information from the vendor that supplied it • The construction planning information from the constructor that installed it • A sound recording of the pump made during commissioning • The specifications from the engineer that specified it • Warehouse information for spare parts along with the GPS location of them • Links to operations and maintenance data This is different from reading a product serial number from a nameplate and then manually searching on the manufacturer’s web site for product information. The ISO 15926 identifier will link directly to the product information without the user having to know the manufacturer’s name or be able to read a nameplate. This represents true integration of RFID information (Figure 1.23). Chapter 1 38 V 03 ISO 15926 Const 2 EPC 1 Owner Fig 1.23 Integration of RFID information. Chapter 1 39 CHAPTER 2: HISTORY OF ISO 15926 The ability to exchange digital information between computer programs probably became an issue as soon as the second computer program was written. Software developers create their applications independently and make individual pragmatic decisions on how to represent data. As a result, users of the software can typically only open a data file by using the authoring application—not a competitor’s application. In early computing, information exchange between computer programs could only be done the hard way: by reading the output of one application and manually rekeying the appropriate parts into another. Over time, as software vendors responded to user’s needs, we have come to expect to be able to move information from one system to another without having to completely rekey it. But we still need to know a great deal about the computer systems and work processes involved if we want the meaning, or semantics, of our information to be preserved throughout the exchange. Today, at the beginning of the second decade of the twenty-first century we are on the verge of making the vision of open exchange of project information a reality. Although there have been many business drivers pushing this, it has only become possible via the hard work of many people and the convergence of the following four areas of study. • How we use the Internet to find information. • How we know and understand things. • Open means of storing and exchanging data. • The evolution of product information standards. Figure 2.1 shows how these areas of study relate to ISO 15926. Chapter 2 40 How we use the internet to find information The Semantic Web How we know and understand things Open ways to store and exchange data Plant Information Exchange Markup Languages Ontologies Evolution of product information standards Ontology Languages ISO 15926 Fig. 2.1 ISO 15926 enablers. How We Use the Internet to Find Information The amount of information available on the Internet is truly staggering. Unfortunately, the unstructured format and lack of validation of the majority of this information makes it difficult to use with confidence. Worse, much of what is potentially useful is stored in unpredictable forms and in unpredictable locations. Therefore, because the information is not presented in a uniform manner understanding whether a given piece of information is worthwhile or not usually requires a human being to sift through it page by page. This is not what Tim Berners-Lee had in mind when he invented the World Wide Web in 1990. He envisioned much more than what we use today, which he has called version 1.0. He envisioned a web environment in which people could ask their personal digital assistants questions such as “Is there a medical doctor near here who specializes in geriatrics and has an open appointment before this coming Friday noon?”—and then go for coffee while the assistant locates a doctor and makes an appointment. He called this the Semantic Web. Currently, the World Wide Web is built to link documents primarily for human consumption. Computers can process web pages for layout and visual format, but they have no way to process the semantics—to know what the pages mean. Thus, if you wanted to find a doctor in the pervious example you might be able to use the World Wide Web to get a list of doctors, their specialties, and maps with which to judge the distance, but you would still have to call each Chapter 2 41 doctor’s office individually to find out if he or she is taking new patients and if there is a suitable open appointment. Using existing sources of information, you might get lucky and get an appointment with the first call, but it could easily take much longer. The Semantic Web is all about describing things in a manner that computers can understand, so that you can ask questions like the one in this example and let a digital assistant do the leg work. Using Semantic Web technology, data can be shared and reused across application, enterprise, and community boundaries. ISO 15926 has borrowed two technologies from the Semantic Web, OWL (Web Ontology Language) and RDF (Resource Description Framework). OWL is a language for creating ontologies, a concept we will discuss next. RDF is a way of storing information in declarations of truth using specific vocabularies, or ontologies, in a manner that makes the meaning machine readable. How We Know and Understand Things When we go beyond custom-built methods of exchanging information between two particular computer applications—when we try to design a way for any two computer applications to exchange information without having to know anything at all about each other—we confront the question of how we represent knowledge. This is not just sophistry. When two people are having a conversation, they will naturally ask questions if they do not understand something or if they receive an unexpected response. But when software applications exchange information there is no opportunity to ask questions. The developers of these applications will make certain assumptions about the world, and therefore key terminology in the applications may differ. The solution is to embed the necessary context (that is, the understanding that humans bring) into the data being exchanged. For this, we need to understand how we know things. The study of how we know things in philosophy and mathematics is called ontology. We do not need an in-depth understanding of what ontology is in order to understand ISO 15926, but a brief, personal, example will be helpful. I, your humble author, ride a bicycle to work most days. (Among other things, it lets me indulge in the luxury of eating the fine Ukrainian food my wife cooks for me!) The distance to work makes a nice workout but is beyond walking if the bicycle were to break down. Therefore, I have developed what you might call an Ontology of Things That Will Carry a Bicycle. Now, in Western Canada—which to most Europeans is but a few years out of the horse age— the pickup truck is king. In Western Canada, all “real men” have pickups. As you can see from Figure 2.2, there is ample room in a pickup truck to carry a bicycle. So, it is not difficult to imagine that if my bicycle broke down on the way to work, I would try to think of everyone who owned a pickup truck that might have driven it to work that day. Fig. 2.2 Pickup truck. Chapter 2 42 Suppose one such friend is Bill, who owes me a big favor. But when I talk to Bill he tells me he can’t help me. He tells me he is going camping that weekend, and to make a fast getaway he has already loaded his camper. How do I know this will be a problem? It is because I know that when you load a camper onto a pickup truck there is no room for a bicycle, as you can see in Figure 2.3.1 Fig. 2.3 Pickup truck with a camper loaded. But hold on! My father used to own a camper for his own pickup truck (he being a “real man” and all), and I have been inside it many times. On most campers there is a door at the back, and just inside the door is enough space for a bicycle. Alas, Bill tells me, he has already filled the available space with his other camping gear—leaving no room. So, with that conversation I start planning how to get home on public transit. Being a “real man” myself, I own a pickup truck and will have to drive it back to work to pick up my bicycle. But by coincidence a new engineer, who just emigrated from the Czech Republic, walks by and overhears my dilemma. He tells me that when he moved to Canada he brought with him his Felicia Fun. I can’t imagine what a Felicia Fun is, but judging by the expectant smile on his face I suspect it might be relevant so I ask him about it. Being new to Canada, he does not know how to describe it so he says it is like the F150 his friend has—but a bit smaller. Hooray! The Czech Republic has “real men” too! I immediately accept his kind offer to drive me and my bicycle home after work. (Oh, and I owe him a really big favor. Perhaps I will invite him in for Ukrainian food!) How did I know that a Felicia Fun would carry my bicycle without ever having heard of it before? It is because of my Ontology of Things That Will Carry a Bicycle. In this ontology is the general class “pickup truck,” and one instance of the class “pickup truck” is the F150—which is very common in North America. When my Czech friend said that his Felicia Fun is just like an F150 but a bit smaller he was essentially adding another instance of the class “pickup truck” to my ontology—and because I knew that all pickup trucks can carry a bicycle I instantly drew the inference that a Felicia Fun can also do so. Figure 2.4 shows the relationship of things in my ontology. 1. For readers outside North America, a “camper” is like a holiday trailer without wheels. Instead of being towed behind a vehicle, it is loaded onto the back of a pickup truck. Chapter 2 43 Ontology of Things That Will Carry a Bicycle F150 Pickup Truck Felicia Fun Pickup Truck with a Camper that is not full of stuff Fig. 2.4 Ontology of things that will carry a bicycle. Similarly, when we use a machine-readable ontology to organize and store information we make it possible for machines to make inferences. If two applications use the same ontology to store information it will be much easier for them to exchange information, and to preserve the meaning of the information during the exchange. There has been a great deal of study on the subject of ontologies in an effort to implement the vision of the Semantic Web. A number of tools have been developed to make it easier to create and share ontologies. One of the more important of these is OWL, which is being incorporated into Part 8 of ISO 15926. Open Ways to Store and Exchange Data For machines to be able to understand what data means, we need to see how to store data in a way that retains its meaning without being dependent on proprietary software or short-lived hardware. Since the invention of writing at about 3000 B.C., people have used all manner of what we now call hard copy to record information. The Library of Alexandria, which burned down in 48 B.C., according to one account,2 is an example of the best technology for managing information in hard-copy form and of a major limitation of doing so. With the advent of computer-managed storage in the mid twentieth century, information managers have had to grapple with two problems. • Survival of information beyond the lifetime of proprietary hardware and software • Moving a large amount of information between proprietary systems A typical example of these types of questions is a help desk inquiry from the mid 1980s. 2 Look up “Library of Alexandra” in Wikipedia. Chapter 2 44 “I have data I want to keep for decades. Should I invest in a good card reader or should I transfer my data to these far more efficient but newfangled floppy disks?” Unfortunately, the best answer to this type of question has always been rather labor intensive. That is, the only reliable way to keep digital information for decades is to upgrade our storage media every few years to whatever is the latest and greatest at the time. For personal use, in the 1980s it would have been 5-1/2-inch floppy disks. By the 1990s, we would have had to copy our archive to 3-1/2-inch floppies. Then, sometime around 2000 transfer them again to CDs—and a bit later to DVDs, and more recently to BDs. Now, at the beginning of the second decade in the twenty-first century, flash drives are looking like they will be readable for quite awhile. But for how long will our personal computers have USB ports? At some point we will still have to load up our thumb drives and copy the data to some new media; perhaps a 3D holographic memory block. Unfortunately, even if we go through the exercise of transferring our archive every few years how are we going to open the files 25 years from now? In the lifetime of the author (who is so old he can remember when an entire family had to make do with a single telephone), the word processor of choice has gone from WordStar, to Word Perfect, to Microsoft Word—which comes in two “flavors,”” one for PCs running Windows and one for Macintosh computers. The nice thing about Windows is that it does not just crash; it displays a little dialog box and lets you press OK first. Q. How do you make a Macintosh go faster? A. Drop it from a higher window! Working with Word 2002, now, as this is being written, we can open the following word processor file formats. • Word 2.0 • Word 5.1 for Mac • Word 6.0 (95) • Word Perfect 5.0 • Works 2000 Where is my beloved WordStar? In addition to copies of all my data files, do I have to keep copies of all my old authoring software? And even if I do, what will I run it on? Do I also have to keep a working model of each vintage of personal computer? What if they break down? So now, if I actually want to be able to retrieve my personal archives for decades (perhaps I am thinking that after I become a famous author a publisher will give me a million dollars to write my memoirs) I will have to open each of my archived files every couple of years and somehow transfer the content to whatever the new authoring software is. This will remove the problem of having to keep old hardware and software around, but will introduce two new problems. First, this solution will create an upper limit on how much information I can keep around. Be- Chapter 2 45 cause it will take a certain amount of time to upgrade my archive each cycle, I will have less and less time each round to create new information. Eventually, I will just finish one upgrade when I will have to start over with new technology. Second, who’s to say there will always be a clear and easy upgrade path from one authoring software to the next? For example, what if I have a large number of files authored with obscure CAD software? What if none of the current set of dominant CAD developers wrote the appropriate conversions into their offerings? Well, Figure 2.5 shows another option.3 Fig. 2.5 Long-term information storage using the Internet. If the problem of moving information between proprietary systems is daunting on a personal level, try to imagine what it is like for organizations that create large bodies of documentation. For instance, every model of commercial aircraft you see today requires millions of pages of documentation that have to be revised and published on a regular basis. The combined documentation libraries of the aircraft industry are immense, yet every few years the dominant hardware and software changes. In the government and law we have a similar situation in that large bodies of text must be readable for decades. Because of this, the text must not be stored in a proprietary format that may go out of fashion in a few years. This brings us to the topic of markup languages and neutral files. Markup Languages The forebear of modern markup languages was developed in the late 1960s by a lawyer named Charles Goldfarb. He had just joined IBM to get some high tech experience and was assigned to a project to figure out how to merge case law research results together into one document, compose it, and print it. At the time, there was no single system that would perform all three of these functions. Thus, when text was to be printed it had to be transferred from one proprietary system to another—all without loosing its fidelity, or meaning. With a team of two others, he developed what he called Generalized Markup Language (GML). GML was a set of macros that described the logical structure of the document, for instance, to declare some text to be a heading and other text to be a body paragraph. The use of GML 3. This is taken from the online forum Slashdot.org Chapter 2 46 markup tags freed document creators from having to deal with the appearance of the documents so that they could concentrate on content. Since the development of GML, markup languages have become widely used in enabling computers to handle large bodies of text properly without human intervention. When encoded, or marked up with tags, the content of a body of text is separated from the format (appearance) of the text. This is an important concept in ISO 15926, wherein the goal is to embed enough context into the content that we do not need to see the format of the information to know what it means. So, for instance, we would not need to see an entire data sheet to know what a single data value means. Figure 2.6 shows the descendents of GML. GML is no longer used, but the other languages are. SGML (Standard Generalized Markup Language) is used for managing very large bodies of textual information. HTML (Hypertext Markup Language) is the language of the World Wide Web, which all web browsers can read. XML (Extensible Markup Language) has become a workhorse transport language for embedding meaning into all manner of information exchange. GML SGML HTML XML Fig. 2.6 Development of XML. If you wish to continue in your studies of ISO 15926 beyond this introduction, an understanding of XML will be helpful. You will probably not write very much XML code yourself, but you will run into quite a bit of it. It will be helpful to be able to recognize what you are looking at. There are a number of good learning resources on the Internet. Resource Description Framework (RDF) and Web Ontology Language (OWL) are two technologies developed for the Semantic Web that were adopted by the developers of ISO 15926. Figure 2.7 shows their relationship to XML. Information in ISO 15926 is structured in an ontology, starting with basic concepts all the way down to individual objects in a particular project. OWL and RDF are well suited to this type of structure. OWL and RDF are the basis of ISO 15926 Parts 8 and 9, which we discuss in Chapter 3. Chapter 2 47 XML Validated by Compares to RDF XML Schema Compares to Validated by OWL Fig. 2.7 XML, RDF, and OWL ISO 10303 Part 21 and EXPRESS As you learn more about ISO 15926, you may hear about “EXPRESS” and “Part 21 files” as a method of exchanging information. ISO 10303, a standard for information exchange, is the direct forebear of ISO 15926—as we will discuss in the next section of this chapter. To manage the data models for the various parts of ISO 10303, the developers needed a standard modeling language to specify how the data models were to be created and used. They called their language EXPRESS and created ISO 10303-21, which standardizes the way to use EXPRESS for storing and exchanging product information. In recent years, EXPRESS has been replaced with OWL by some developers of ISO 15926—but is still regarded as an efficient modeling format. Part 21 files are used for data exchange in airplane, automobile, and building manufacturing and construction. Evolution of Product Information Standards When we study the evolution of the exchange of product information, we can start pretty well at any arbitrary point in history. We humans are nothing if not ingenious, always looking for ways of improving work processes. Laziness had been given a bad rap. Laziness is the reason we live indoors and eat three meals a day instead of living under a tree and chasing wild animals for our supper. Laziness gives us the incentive to invent things so we don’t have to work as hard. What we should avoid is slothfulness.4 No matter how far back one looks to find an example of a product standard, there is always an earlier example. For instance, the National Institute of Science and Technology—in its publication STEP, The Grand Experience—starts before the Industrial Revolution with a description of what we would today call a machinist, carefully measuring the prototype of a part while making a duplicate. Then, at the beginning of the nineteenth century the idea of an engineering drawing was developed—which enabled workers to follow an objective standard, which in turn lead to the idea of interchangeable parts. The Wikipedia entry for computer numerical control (CNC) describes the first attempts to 4. The author’s English instructor at technical school in 1972. Chapter 2 48 minimize the variability of parts by reducing a drawing of a part to a series of (x,y,z) tool path movements and storing it on punched tape. The vision of ISO 15926 is that full life-cycle product information—which includes information about every object in a processing plant, manufacturing assembly line, airplane, ship, and highway interchange—can be stored in a neutral form that any computer system can read and write to. We are close enough to achieving this vision that we can see it, but the progress toward this goal has always been at the margins— with one development leading to the next. For our history here, of the evolution of product information standards, we will start with the methods developed to exchange electronic drawings produced with computer-aided drafting (CAD) software. Shortly after the advent of CAD, a number of such standards sprang up around the world—and although we could use almost any of them as an example we will use the Initial Graphics Exchange Specification (IGES). From there, we will move to the Standard for the Exchange of Product Information (STEP)— which preserved the meaning of the drawing along with the graphical image. Finally, we will come to ISO 15926—which uses a single generic data model instead of the many specific-purpose data models used in the STEP world (with the addition of the dimension of time). Along the way, we will introduce a number of organizations that have played key roles. The Initial Graphics Exchange Specification The first systems we would call CAD were written in the early 1960s in universities, with commercial systems appearing a few years later. The early commercial CAD systems were developed by large organizations, such as automotive and aerospace companies to assist their engineers in complex calculations, and by computer manufacturers as tools to increase hardware sales. By the 1970s, there were a number of competitors—all with proprietary file formats, with the resulting inability to share data across system boundaries. To take just the defense industry, at the time there were more than 3,500 third-party vendors in the U.S. Department of Defense (DoD) supply chain. Members of manufacturing supply chains each have unique requirements depending on their role, and the software they use differs accordingly. At that time, even within a single organization different manufacturing processes used different software that was not designed to communicate with the others. This meant that when drawings were sent along the supply chain information from them had to be reentered at every step. The frustration over closed systems came to a head during a meeting of the Society of Manufacturing Engineers in the fall of 1979. A large user of CAD software challenged the CAD vendors in attendance to develop a neutral exchange mechanism. From there, the historical accounts start to resemble the old folk tale of “stone soup.” At first, two vendors agreed to open certain portions of their database format—then two others. By the end of the conference, all of the players had agreed to contribute something—and what would eventually become the IGES was born. (And it probably did not hurt that the U.S. Navy was about to award some large contracts and no one wanted to appear to be unresponsive.) Figure 2.8 shows the timeline of IGES. With the support of the National Bureau of Standards— now known as the National Institute of Science and Technology (NIST)—the DoD, and the user community, the first version of IGES was completed and published in 1980. Within a few years, IGES became the de facto standard of information exchange within some segments of the manufacturing community. Chapter 2 49 U.S. DoD ISO TC184 SC4 IGES 1980 CALS 1990 IGES 5.3 2000 2010 Fig. 2.8 History of IGES. In 1985, the DoD launched what is now the Continuous Acquisition and Life-cycle Support (CALS)5 initiative to accelerate the use of digital information in the management of defense system technical data. IGES was one of the first standards it supported. By 1988, all manufacturing information for weapons systems for the DoD had to be turned over in electronic form in the IGES format. This meant that any vendor of engineering or manufacturing software that wanted to market its products anywhere along the military supply chain had to make sure its products could read and write to an IGES neutral file. With the use of IGES, it no longer mattered whether or not all the members of the DoD’s supply chain used the same software; as long as their software supported IGES, they could ex- 5. The scope of CALS has changed over time and the meaning of the acronym has changed with it. In the beginning, it was Computer-Aided Logistic Support, then Computer Aided Acquisition and Logistic Support, and finally Continuous Acquisition and Life-Cycle Support to indicate that its scope was the entire life cycle of a product. Chapter 2 50 change drawings with each other and with the DoD. This eliminated rekeying information from paper drawings, with the resulting reduction of human error. As well, if the drawings were stored as IGES neutral files rather than in their native format they could be retrieved years later with different software. This is not to say that all users of IGES were enthusiastic supporters. Although there is genuine value in exchanging the graphical elements of a drawing in a way that does not depend on proprietary formats, in a manufacturing environment there is often more to a “drawing” than just the graphical elements. Interest and support from the manufacturing community shifted to what became known as STEP. The last official version of IGES is 5.3, published in 1996. An update had been planned, version 6.0, but work stopped within a couple of years. Nevertheless, IGES remained an American National Standard until late 2006—and many modern CAD systems still support it. Whereas the driver for IGES was the ability to exchange the graphical information about a product (i.e., the drawing), the driver for STEP was the ability to exchange the complete product model, unambiguously, independently of any computer system. A product model is the complete definition of a product, with all of its properties. For instance, if you were an engineer designing a machine part you would start with what the part was supposed to do; would make decisions on material, the production process (including whether or not to cast the part or machine it from stock), its surface finish and heat treating; and along the way would make a drawing showing the dimensional properties. If it were important to the part, you might also include packaging and shipping instructions. In short, the product model is all of the information about the product for its entire life cycle. By some reports, IGES did a reasonably good job of transferring images from drawings—but all other information was lost. For example, a circular object on a drawing was faithfully communicated by IGES as a circle—but thereafter, all it would ever be was just a circle. In the original CAD system, the object that appeared as a circle may in fact have been a through-hole, a blind hole, a spot face, or a simple circular mark on the face of a part. In the original system, the circular object may have in fact been able to drive production systems—such as a machine that would drill the hole (or whatever it was) in the physical part. But once the drawing of that part had been converted to the IGES format all of the knowledge of what the circular feature really represented was lost. [Of course, the drawing would still have the original notes describing the circular feature (text was transferred faithfully as well) but a human would have to read the notes and reenter the information into the new system after the drawing was transferred.] What the industry needed was a neutral format that captured a richer information payload, reliably and without loss of design intent. The Origin of STEP The story of STEP starts around 1984, with the first meeting of a committee of the International Organization for Standardization (ISO) and what is now NIST. ISO was founded in 1947 and is based in Geneva, Switzerland. It is composed of some 160 member bodies who are agents of their respective countries, where the ISO standards often become law. Every nation on Earth is eligible for membership, and most have joined. The range of ISO standards covers almost every human endeavor. For instance, ISO-1 (Standard Reference Temperature for Geometrical Product Specification and Verification) defines the temperature (which happens to be 20º C) at which materials must be when taking mea- Chapter 2 51 surements that will be used for comparison in different geographical locations. Individual standards are written and managed by representatives of the industry affected by the standard, not by ISO staff. The role of ISO is more to make sure that individual standards are developed with as wide and fair a representation as possible. The development of ISO standards is organized by a hierarchy of committees and workgroups. The responsibility for standards related to the exchange of product information falls to a group with the cryptic moniker TC184/SC4, which stands for Technical Committee 184, Subcommittee 4. A partial hierarchy is shown in Figure 2.9. Work Group 3, Team 25, is responsible for ISO 15926. The various parts of ISO 10303, the official name of STEP, fall under a range of SC4 workgroups and teams. International Organization for Standardization (ISO) Technical Committee 184 Subcommittee 4 Work Group 3 Team 25 ISO 10303 Automation Systems & Integration Industrial Data Product Models Oil, Gas, Process, Power ISO 15926 Fig. 2.9 The relationship of ISO 10303 and ISO 15926 within ISO. The National Institute of Standards and Technology (NIST) started more than a hundred years ago, in 1901. Its name describes what NIST is all about: to promote U.S. industrial competitiveness by advancing measurement science, standards, and technology. IGES, STEP, and ISO 15926 are all within its mandate—and NIST has had an influential role over the years in organizing their development. By the time of NIST’s first meeting with TC184/SC4 in 1984, there was interest in the United States for developing a new standard for product data that was similar in operation to IGES but that would not lose any product information. This new standard was to be called the Product Data Exchange Specification (PDES). It is important to note that this was not to be an enhancement of IGES but a complete redevelopment of the approach to information exchange. The marked departure from IGES was so important that the organization responsible for IGES changed its name from the IGES Organization to the IGES/PDES Organization (IPO), as shown in Figure 2.10. Chapter 2 52 ISO TC184 SC4 NIST Sponsoring organization IGES 1980 LEGEND Consortium IGES Standard IGES/PDES (IPO) PDES STEP 1990 ISO 10303 2000 2010 Fig. 2.10 The transition from IGES to STEP. Internationally, events were lining up in support of a new standard as well. Other CAD exchange standards besides IGES had been created by this time, most notably in the French and German manufacturing industry (with more or less the same limitations as IGES). In 1988, the IGES/PDES Organization submitted PDES to the international community—which eventually adopted it as the basis for STEP and published it as ISO 10303.6 At the very beginning, the participants realized that the complexity of product data demanded robust data modeling. This was a significant development. The data model for a computer system tells us what the data means. It is like the set of blueprints for a building. Many experienced carpenters can design a simple building in their heads and build it without any drawings, but a large and complex building requires a comprehensive set. The blueprints are first used for recording and communicating the design between all interested parties. Later, the builder uses them to organize work schedules and to purchase materials. During construction, the carpenters can use the blueprints to work independently of each 6. We will use “STEP” and “ISO 10303” interchangeably. Chapter 2 53 other—knowing their work will coordinate in the end to the finished building that was envisaged by the architect. And if the design is particularly good the blueprints can be used elsewhere so that others do not have to redesign the same thing from scratch. In this metaphor, where the blueprints are like the data model the building is like the finished computer system—and what the users will do with the eventual building is like a process model. The process model drives the data model. The data model helps us visualize data structures before we build the system. Data model diagrams drawn on a few pages can represent a very large system that would be difficult to visualize by looking at many pages of computer code. The initial intent with STEP was to develop a single data model for a product that would become the master record containing everything that everyone who used the product would ever want to know. But by the time the standard was issued the real world proved to be too complex for one standard. STEP was therefore divided into several parts. Figure 2.11 shows the STEP standards that are of interest to the development of ISO 15926. Each of them is shown at about the time it was first issued as an international standard, but they all went through many years of development. For an example of the time involved, we have shown in greater detail the development timeline of AP-221—arguably the most important of the STEP standards from the point of view of ISO 15926. Today, STEP is in wide use in the aerospace, automotive, electrical, and electronic industries. Each industry, and segment within each industry, has its own exchange standard—called an application protocol (AP). Each industry that adopts STEP is encouraged to create its own AP. Today, there are many hundreds of parts to ISO 10303. Some were developed by the process industry, although none of these are in use today. The most interesting to the history of ISO 15926 are outlined in the following. • Part 11. Description methods: The EXPRESS language reference manual EXPRESS is well suited to data modeling for product data. • Part 21. Implementation methods: Clear text encoding of the exchange structure This part describes how to encode information using EXPRESS. If your interest in ISO 15926 goes “under the hood,” you will hear about “EXPRESS Part 21” files—which represent one method of transferring information between two users. • Part 42. Geometric and topological representation This part addresses how to represent the physical shape of a product. • AP-203. Configuration controlled 3D designs of mechanical parts and assemblies • AP-214. Core data for automotive mechanical design processes AP-203 and AP-214 are widely used in the manufacturing industry. AP-214 is a superset of AP-203. • AP-221. Functional data and their schematic representation for process plant This AP is primarily for schematic drawings such as process and instrumentation diagrams (P&IDs). Discussion during the development of this part led to the initiation of ISO 15926. • AP 227. Plant spatial configuration This AP includes classes for the physical representation of piping, HVAC, cable tray, and mechanical systems. • AP 239. Product lifecycle support Chapter 2 54 This AP is a mechanism for maintaining the information needed to support complex products and systems over their complete life cycle from conceptual design to decommissioning. In future diagrams we will show ISO TC184 SC4 AP-221 appearing, as if by magic, LEGEND around 1997. This hides a great deal of detail: Sponsoring organization 1980 International standard 1980 • AP-221 started development in the early 1990s Part of a standard PDES • Published as Committee Draft in 1997 • Formally published as an International Standard in 2007 STEP 1990 1990 • In between a number of consortia worked on it, as we shall see in the next few pages Part 11 • From the beginning it separated ISO 10303 Part 21 reference data from the data STEPlib Part 42 model, something not done with AP-203 the other STEP APs to that time AP-214 2000 2000 • For the reference library it used an existing library known as AP-227 STEPlib, adding to it and pub- AP-239 lishing it as part of the standard, Annex M. AP-221 2010 When you read this diagram, and Annex M others like it throughout this chapter, 2010 remember that although they appear at a particular time, all of the standards were developed over a similarly long timeline. Fig. 2.11 History of STEP. Today, the use of STEP in the manufacturing industry is routine—but it took a few notable successes to make it so. The following are three examples. • Boeing, Pratt & Whitney, Rolls-Royce, and GE Aircraft Engines used STEP for the 777 and 767-400 aircraft. • General Motors uses STEP to coordinate designs among its various design centers when they do not use a common CAD system. • The European Space Agency evaluated the use of AP-203 and AP-214 to transfer information between prime contractors and suppliers to the Agency’s programs. At the conclusion of the study, the two APs were proven to be mature enough to be used as a standard for CAD exchange in the European space industry. Chapter 2 55 The Process Industries STEP Consortia With the perceived success of STEP in the manufacturing industry, the process industry started to pay attention in the early 1990s. This development is entwined with a number of organizations and consortia that sprang up around the world in the couple of decades after the formal adoption of IGES. Figure 2.12 shows the more prominent process industry consortia on an approximate timeline. ISO TC184 SC4 LEGEND European Union Sponsoring organization Consortium which no longer exists 1980 Consortium which still exists ESPRIT PDES 1980 International standard Part of a standard STEP Japan STEP 1990 USPI-NL ProcessBase ISO 10303 PISTEP POSC Caesar Assoc’n 1990 PISTEP Activity Model EPISTLE STEPlib Part 42 PIPPIN PlantSTEP PIEBASE PIEBASE Activity Model 2000 AP-227 AP-221 2000 Annex M 2010 2010 Fig. 2.12 History of the process industries STEP consortia. It may appear that there is a 10-year gap between the first proposals for STEP and the formation of the first consortium. In reality, the period was very busy with many organizations devoted to the development of information exchange standards in general, and STEP and ISO 15926 in particular. Some of these organizations left very little trace of themselves on the World Wide Web, so their history is inaccessible to this researcher—and some were simply absorbed into the larger consortia. The consortia and standards are placed as close as possible to their actual times, but some license has been taken where dating information is ambiguous and where the overlapping of shapes would present a legibility problem. If the arrows between the consortia give the appearance of a complicated membership, the reality is even more so. We could almost have put two-headed arrows between every one of the consortia, and between each of these to each standard. In a book, we are limited to a serial description of events and 2D drawings, but the development of STEP—and the thinking that lead to ISO 15926—was all happening at the same time. Many individual companies were members of more than one consortium, and some consortia Chapter 2 56 were themselves made up of other consortia. The arrows between consortia and the international standards and parts roughly show the main sponsors—but, again, the reality is more complicated. Many of the people involved worked on behalf of more than one consortium and over time contributed to many parts of the international standards.7 Complicating the proper representation of responsibility is human memory—the people that were part of this while it was happening do not all remember things in the same way. What we will concentrate on, then, is what happened rather than who did it. We will start by introducing the main players. Development work was initially divided between the schematic representation of a process facility and its physical representation. AP-221 was initiated in Europe to develop the schematic (2D) representation, whereas AP-227 was initiated by an American consortium to develop the physical (3D) representation. ESPRIT A major theme of the many treaties leading to the formal creation of the European Union (EU) in 1993 was to develop a single market for its member states. It is not surprising, then, that the CEC—the executive body of the EU—was, and is, interested in efforts to standardize the flow of information between its manufacturers in order to make them more competitive. Perceiving that the competitive position of Europe’s information technology industry was loosing ground to those of the United States and Japan, the CEC established the European Strategic Programme for Research in Information Technologies (ESPRIT) in 1983. The purpose of ESPRIT was to fund research and technology development programs in cooperation with industry. By the early 1990s, a number of European organizations had made significant contributions to the development of STEP. The CEC sponsored its continued development with two ESPRIT projects: ProcessBASE and PIPPIN. ProcessBase ProcessBase started at the end of 1992 and ran for three years. The initial objective of ProcessBase was to develop a neutral format to be used for the exchange and management of process data used in all phases of a process plant—from design, through to construction, and eventually to operation. The approach used was to develop a data model for a process plant using the information modeling language EXPRESS, along with software to process the EXPRESS code. This approach introduced a high degree of automation to data exchange. The data model became part of AP-221. The culmination of ProcessBase was two pilot projects that demonstrated the exchange of plant information over the World Wide Web: one a chemical processing facility and the other a power station. According to the final report, this demonstration proved that bulk information exchange was practical and that information exchange based on a standard neutral form delivered the expected business benefits of reduced costs, higher information reliability by reducing opportunities for human error, and shorter cycle times. 7. We are not saying that this happened, but from some reports it is not difficult to imagine two fellows meeting at yet another conference, fumbling in their pockets for the first minute or two, trying to remember which business card they were representing that week. Chapter 2 57 PIPPIN The Pilot Implementation of Process Plant Information Warehouse (PIPPIN) started at the beginning of 1996 and ran for two years. The purpose of PIPPIN was not so much to develop AP221 but to build a data warehouse using AP-221 for the functional life-cycle data of a process plant and to use it in a real project. In this they were successful, as we shall explore in material following. PlantSTEP PlantSTEP, established about 1994, was a collaborative effort between NIST and a consortium of EPC contractors, owners, and suppliers for the purpose of developing and supporting standards based on ISO 10303. Starting with just a few organizations, the consortium eventually included most high-end CAD systems, many EPC contractors, many manufacturers, and many owner-operators. The intention was that all parties involved in a large capital project could use their own tools and work methods but still be able to share appropriate information seamlessly. Where the focus of ProcessBase was the schematic design of a plant (AP-221), the focus of PlantSTEP was the physical design (AP-227). Considerable effort was made by both organizations to ensure that AP-221 was compatible with AP-227. Japanese STEP Organizations The main mover of STEP and ISO 15926 activities in Japan is the Engineering Advancement Association of Japan (ENAA). ENAA played in influential role in developing ISO 10303 AP227 with its participation in PIEBASE. In the early 1990s, the capital projects industry in Japan was having the same issues with information exchange everyone else in the world was having; namely, that the perceived lack of standards for information exchange was restricting commerce. Rather than developing its own standard, ENAA decided to follow international standards and was an early proponent of AP-221 and AP-227. Later, Japan funded the development of the second edition of AP-227. In the mid 1990s, Japanese industry and government was motivated by the U.S. and European CALS initiatives and did not want to be left behind. The Japanese commissioned a domestic pilot project, called the Nippon CALS Research Partnership (NCALS)—which ran for three years, with similar goals to the U.S. CALS initiative. ENAA became more involved with STEP with their own project, Plant CALS—representing EPC contractors, equipment vendors, and owner-operators. This project enabled the participants to better understand the various ISO 10303 application protocols and proposals on operation and maintenance APs and to experiment with SGML as a transport language. During this period, several Japanese organizations participated in the PIEBASE consortium—until PIEBASE disbanded in 2003. By the late 1990s, the interest of the Japanese consortia was turning from the individual application protocols of ISO 10303 to a data warehouse and e-commerce solution. Some notable work was done with STEPlib and the POSC/Caesar Association (PCA) reference data library (RDL) to provide storage and a search mechanism for engineering asset information, which was later expanded to include operations and maintenance information. Chapter 2 58 The Japan Electric Measuring Instruments Manufacturers’ Association (JEMIMA) contributed to standardization by making practical use of ISO 13584 and is responsible for maintaining its reference dictionary, Part 501. At about the same time, the Japanese construction industry— with an endorsement from the Japanese government—used STEP AP-201, Explicit Draughting, as a base of 2D drawing exchange and as the archive standard for Japanese government projects. This standard uses conformance classes to inspect and validate the content of drawings and has become mandatory for all large Japanese government projects since its introduction. Currently, ENAA has been a strong supporter of ISO TC 184/SC4 standardization and participates in T25 standardization activities as a liaison organization to the Japan Nuclear Cycle Development Institute (JNC). ENAA is also a member of SC4 RDA Maintenance Team and Validation Team and actively supports enhancing the quality of the Part 4 RDL. European Process Industries STEP Technical Liaison Executive EPISTLE was formed in 1993 to be a forum for all of the international projects and organizations working toward standards-based exchange of engineering data using STEP. Figure 2.13 shows EPISTLE as consisting of USPI-NL, the Process Industries STEP Consortium (PISTEP), and what is now the PCA. In the early years of EPISTLE, the membership consisted of a number of companies directly involved in plant design, equipment manufacture, and operations— but over time the individual companies withdrew, leaving only the three consortia as members. Petrotechnical Open Software Corporation 1990 PISTEP Caesar Offshore Project UPSI-NL POSC/ Caesar Project 1995 Energistics POSC Caesar Assoc’n UPSI-NL 2000 EPISTLE Fig. 2.13 EPISTLE member consortia. It is important to remember that the individual consortia did not lose their identity while a member of EPISTLE. Therefore, many publications were made under one of the individual Chapter 2 59 names. As a consortium, EPISTLE directly supported AP-221, published some general work on the methodology of developing data models, and managed the EPISTLE Core Model—which we will describe in the next section when we describe the development of ISO 15926. USPI-NL In 1997, an informal group of plant owners and EPC contractors operating for about four years under the name SPI-NL created the formal association USPI-NL (Uitgebreid Samenwerkingsverband ProcesIndustrie-Nederland) as the Dutch Process and Power Industry Association for the development and implementation of industry standards for plant life-cycle data. The new group included equipment suppliers, and was gradually joined by software vendors, consultancies, and universities. The mission of USPI-NL is “To develop, maintain and promote the use of international information standards and best practices for product and plant life cycle information”—with the aim of achieving the quality of information required for the delivery of products and installations that are safe, reliable, and environmentally friendly and to achieve a shorter project delivery time, lower costs, and support for innovation. USPI-NL’s s vision is that “Companies in the process industries shall be able to share and/or exchange electronically the information needed to design, build, operate and maintain process and power plants using internationally accepted standards.” Currently, USPI-NL is active in five roles: networking, setting direction, providing implementation templates and frames, acting as a knowledge center, and developing and maintaining international standards. USPI-NL supports ISO 15926 (with emphasis on Part 4 and its maintenance), Part 11, ISO 10303-221, and several others—including ISO 13584. It actively encourages Dutch industry to adopt international standards for electronic data exchange and storage. USPI-NL has taken a lead in road-mapping and maturity assessments of industry, assisting individual companies to assess where they are in the roadmap, how they compare to the others, and how the industry as a whole progresses through the roadmap. USPI-NL has a wide network of associations and companies it cooperates with on development and adoption of standards. It maintains special cooperation with PROLIST, Fiatech, and ENAA. Process Industries STEP Consortium PISTEP was founded in 1992 to increase the competitiveness of the UK process industry by improving engineering information management. Its founders saw that the current paperbased information-handling methods used during design, construction, operation, and eventual decommissioning was hampered by locking information in documents where it was difficult to find. PISTEP endorsed the use of international standards to store information so that it would be accessible across organizational and system boundaries and not be locked into proprietary systems. PISTEP promoted STEP—and when it was introduced, ISO 15926 as well. A major contribution in the mid 1990s was the Process Plant Engineering Activity Model, shown in Figure 2.14. This was intended as an overview of the main activities and data flows required for the design and operation of a process plant. The first page (shown in Figure 2.14) is a summary, with each process (represented by a rectangle) developed in more detail within the document. It was intended as a guide for those developing STEP APs, but it remains Chapter 2 60 useful today as a starting point for anyone seeking to model such activity. In 2000, PISTEP merged with the PCA—with PISTEP becoming the UK chapter. Fig. 2.14 PISTEP activity model. POSC Caesar Association The PCA was founded in 1997 to promote the development of openly available standards for the integration and interoperability of data, software, and related matters for e-engineering and e-commerce. PCA works in close collaboration with other standardization organizations in Europe, the United States, and Japan. PCA is the originator of ISO 15926 and is committed to its maintenance and enhancement. The Petrotechnical Open Software Corporation (POSC) is a nonprofit, vendor-neutral corporation based in Houston, Texas, that was founded in 1990 by five major oil companies with the aim of establishing open software standards to be used for sharing information through the asset life cycle. By 1998, it had grown to 130 members, including large and small suppliers, government agencies, universities and research centers, other standards organizations, and oil companies. This consortium works to share information among its members and to promote useful software modeling, data, and application integration standards. At the 2006 Standards Chapter 2 61 Summit & Reception in Houston on 8 November 2006, POSC renamed itself Energistics. The Caesar Offshore Project was founded in 1993 by seven organizations active in the North Sea as a research and development project under the name of Caesar Offshore Program. The purpose of the project was to develop a product model for life-cycle information. The focus was on standardizing the technical data definitions for facilities and equipment associated with onshore and offshore oil and gas production facilities. In 1994, the Caesar Offshore Project was reorganized and defined as a project of POSC and changed its name to the POSC/Caesar Project, and then more recently to the POSC Caesar Association (PCA). The technical work of PCA was more and more related to the ISO STEP standard and influenced by similar work in European standardization organizations such as PISTEP in the UK and USPI-NL in the Netherlands through the EPISTLE consortium. PCA collaborates with many standards organizations around the world, including Energistics and Fiatech. Process Industry Executive for Achieving Business Advantage Using Standards for Data Exchange By the mid 1990s, there was such significant effort being expended on the development of STEP that there was real risk of duplication of effort. To coordinate the work globally, the Process Industry Executive for Achieving Business Advantage Using Standards for Data Exchange (PIEBASE) was chartered in the United States in the fall of 1996. The membership was essentially the members of EPISTLE, PlantSTEP, and POSC—with representatives of the Japanese STEP community and NIST. The intent was to develop a common vision and a coordinated roadmap for the development and use of international standards for information exchange, including many of the STEP APs and a number of other standards. Although PIEBASE did not author any standards directly, it did valuable work defining boundaries for APs (so that the APs did not overlap) and reviewed overall priorities to increase the return on investment being made by the participants. A significant publication was the PIEBASE Activity Model, which was a reworking of the PISTEP Activity Model. An activity model is a diagram showing the relationship of a set of inputs, outputs, and activities that constitute a process, and it is an important input to a data model. The PIEBASE Activity Model had the viewpoint of a process plant owner-operator that wanted to produce a product by building a process plant. The purpose of the activity model was to describe the activities related to the generation and use of information in the creation and operation of the process plant and to provide a context for more detailed activity models for individual APs. The PIEBASE Activity Model is contained in both AP-221 and AP-227. PIEBASE wrapped up in 2002. STEP Issues STEP does well in manufacturing environments, but some deficiencies were exposed when it was used as the basis for storing life-cycle information for process plants. First, it sees information exchange as a problem to be solved by a series of exchanges—each with its own AP. It has been estimated that a typical process plant would require several hundred APs. Aside from the obvious problem of simply keeping track of them all, a major issue is that “things” that make up a modern process plant do not naturally come to us with their preferred AP stamped on their bottoms. Chapter 2 62 APs by their nature overlap with each other and it is therefore not always obvious which AP should be used. For instance, a particular control valve is part of a plant’s process design and thus the valve should show up on a P&ID, which implies that we should use AP-221. But the physical valve will be a product of some manufacturer that will want to use AP-203/214 during design and manufacturing. The valve will also show up in an engineer’s 3D model, which implies AP-227. Now we could come up with a scheme that uses AP-221 in some situations and AP-203/214 or AP-227 in others, but the best solution is a single standard that can be used to represent the valve in all of its aspects. Second, maintaining information about plant objects requires the ability to handle change over time. This is possible with STEP, but it is cumbersome because one would have to keep updating the data model. During the pilot projects for the various STEP APs, which we have described previously, a detailed data model worked well—but one characteristic of pilot projects is their relatively short duration. Over a longer term, maintaining such a detailed data model in the face of the amount of change over the lifetime of a typical facility proved to be a large effort. The third reason is more technical. In the typical STEP work process, the manner of creating an AP involved navigating a number of levels of modeling. The process started with defining the activities and information flows the AP is intended to support. (This is the principal reason the PISTEP and PIEBASE Activity Models, discussed previously, were created.) Then it defines the requirements using a more or less generic data model called the Application Requirements Model (ARM),8 refines it further to a more detailed data model, and then maps the requirements to a set of standard components that are interpreted in the AP. Are you confused yet? It worked good in theory, but in practice it was difficult to understand and proved to be quite cumbersome. Development of ISO 15926 Perhaps the best way to describe the development of ISO 15926 is to describe the development of AP-221 in a bit more detail. Figure 2.15 shows some of what we have described before, with a new standard—the EPISTLE Core Model (ECM)—leading to ISO 15926 Part 2. In reality, AP-221 is closely entwined with the ECM. 8. Remember this term. We will refer to it shortly. Chapter 2 63 ISO TC184 SC4 European Union ISO TC184 SC4 LEGEND Sponsoring organization Consortium which no longer exists 1980 ESPRIT 1980 Consortium which still exists International standard Part of a standard Japan STEP 1990 USPI-NL ISO 10303 ProcessBase PISTEP Activity Model EPISTLE PIPPIN STEPlib PIEBASE 1990 POSC Caesar Assoc’n PISTEP RDL EPISTLE Core Model B C/D E Snapshot Series 2000 ISO 15926 2000 PIEBASE Activity Model Part 2 Part 4 AP-221 Related to Annex M 2010 2010 Fig. 2.15 Development of AP-221 and ISO 15926. If you follow the relationships shown in the diagram, you will see that all of the consortia shown had a hand in developing some aspect of AP-221. We have previously described how the EU, through its ESPRIT program, launched ProcessBase—which initiated AP-221. When ProcessBase wound down in 1995, members of EPISTLE continued its development. PIEBASE, the executive body coordinating STEP activity, extended the PISTEP Activity Model into the PIEBASE Activity Model—which was used in AP-221. The Japanese STEP consortia were involved through their participation in PIEBASE. When PIPPEN started in 1996, its purpose was to use AP-221 in a real project. It accomplished this through working closely with EPISTLE on the development of what came to be called the EPISTLE Core Model, and by using a version of the ECM in one of the Snapshot projects. The EPISTLE Core Model and the Snapshot Series An unfortunate casualty of legibility in Figure 2.15 is the way the ECM is shown as being separate from AP-221. In fact, the first version of AP-221 (published in 1997) was identical to the ECM. The ECM, which covered the information requirement for the life of the entire process plant, was a key deliverable of EPISTLE. It was called the ECM because it did not depend on any one system or use proprietary functions. Because of EPISTLE’s close association with the developers of AP-221, the ECM borrowed a significant part of its data model from AP-221. Chapter 2 64 EPISTLE’s idea was that if you break data down into its smallest pieces you have the best opportunity for reuse. (In technical language, it was highly normalized.) Thus, instead of the database tables having many columns they had only a few columns but were used in combination with each other. We have said that EPISTLE developed a reference library, which it called STEPlib, and which was used as the basis for Annex M of AP-221. At some point in the development of STEPlib, PCA decided to reserve its own work for just its own members—and for awhile STEPlib and the PCA RDL forked. Eventually, however, the two organizations combined them back into one and the merged RDL eventually became Part 4. A significant part of the evolution of ISO 15926 is the Snapshot series. As the ECM, STEPlib, and the PCA RDL evolved, they were used on real world-scale projects over a period of a few years. As each of the projects was launched, PCA took the latest thought on data models and the latest version of its RDL and merged them into a snapshot, starting at A and running to E. The lessons learned from each project were fed back for improvements in the next. The diagram implies a direct connection that may not have actually existed. However, because many of the same people worked in many capacities there would have been some crossover. There were some key differences between the Snapshot series and the ECM. The ECM was highly normalized and used multiple inheritance. Due to the perceived inability of the thencurrent technology to support this, the data models of the Snapshots were less normalized and did not support multiple inheritance. What the creators of ISO 15926 did (in the jargon of professional data modelers) was to use the ARM of AP-221. Thus, the data model was much more generic—with much of the intelligence pushed into the RDL. The ISO 15926 data model incorporates 201 entity types and reuses them many times to represent information about plant objects. Think of it this way: instead of defining the “perfect” data sheet for an instrument, ISO 15926 uses a generic pattern for each attribute of the instrument. The sum of the combined attributes is the representation of the entire instrument. When information is exchanged using the ISO 15926 protocols, the receiving system looks for patterns. This allows you to do what might be called “just-in-time modeling.” You model what you know now, and model later what you discover later. The model itself can evolve and recover from earlier errors. In this way, computer systems that use data sheets that appear to humans to be wildly different can still communicate without being told explicitly about each other. Because of the marked differences from the approach of the other STEP APs, the general consensus was that the best path forward was to create a new standard instead of work the new approach back into STEP. By 2000, the EPISTLE Core Model had been formally approved as Part 2—whereas the combined STEPlib and PCA RDL became Part 4. Part 3 (for geometry), first published about the same time, is based on STEP Part 42. We will describe the most significant parts of ISO 15926 in Chapter 3. Figure 2.16 brings in the remaining parts of ISO 15926 and shows a new player, Fiatech. Chapter 2 65 ISO TC184 SC4 European Union ISO TC184 SC4 LEGEND University of Texas Sponsoring organization Consortium which no longer exists 1980 ESPRIT 1980 Consortium which still exists CII International standard Part of a standard PCA/Fiatech project Japan STEP 1990 USPI-NL ISO 10303 ProcessBase PISTEP PISTEP Activity Model EPISTLE Part 42 PIPPIN STEPlib PIEBASE RDL 1990 POSC Caesar Assoc’n EPISTLE Core Model B C/D E Snapshot Series 2000 Related to 2000 Part 4 Initial Part 7 Part 7 Annex M Part 8 Part 9 2010 Fiatech Part 3 Basis For AP-221 ISO 15926 Part 2 PIEBASE Activity Model IDS/ADI Part 10 2010 Part 11 Fig. 2.16 Development of ISO 15926. Fiatech This brings us to the twenty-first century, to an organization called Fiatech. Its purpose is to increase the productivity in the capital projects industry by introducing technology to engineering and construction work processes. In this definition, capital projects industry includes all manner of capital construction—from roads and sewers, commercial buildings and shopping centers, ships, and pipelines to manufacturing and industrial plants. Fiatech was founded in 2000 as a collaborative effort between NIST and the Construction Industry Institute (CII), a research center in the College of Engineering at the University of Texas at Austin. At the time, CII had more than 100 members representing owner-operators, contractors, and suppliers from the engineering and construction industry—as well as more than 30 universities. The mission of the CII is to publish information on best practices in the U.S. construction industry. Today, Fiatech’s membership roster includes many of the largest owner-operators, EPC contractors, construction contractors, equipment manufacturers, software developers, and universities. One of the most compelling reasons for any of these organizations to join Fiatech is that because Fiatech is part of a university they can collaborate without breaking antitrust legislation. Outside Fiatech they are competitors and must guard what they say to each other. But on a Fiatech project top experts in a field can work together across organizational boundaries. For a portion of the development costs, all members can participate in all results. Chapter 2 66 For planning, Fiatech developed what it calls its Capital Projects Technology Roadmap—with nine elements depicting a completely integrated structure to accelerate the deployment of emerging technologies to drive productivity in the capital projects industry. Of these elements, ISO 15926 is part of number nine, Lifecycle Data Management & Information Integration. Until that time, ISO 15926 had been seen as applying predominantly to processing plants. With Fiatech’s endorsement, it was seen to apply also to all capital projects. 2005 Process Industry Data Integration Workshop In 2005, a workshop was held at the Wilmington, Delaware, offices of a large owner-operator—with many of the key players in plant data standardization in attendance. A strategic agreement was reached, later known as the Wilmington Agreement. The participants agreed that all would cooperate in achieving a single common RDL for the industry, which was ISO 15926-4, and work toward formal approval by ISO. Agreement was also reached regarding the implementation of a working reference data service (RDS) to support a project. IDS-ADI Project In 2009, both PCA and FIATCH (by coincidence) launched projects to promote ISO 15926. PCA called its project Intelligent Data Sets (IDS), whereas Fiatech called its project Accelerating the Deployment of ISO 15926 (ADI). Both came together under combined leadership with the purpose of demonstrating the capabilities of using the full specification of ISO 15926. Under their joint oversight, many subprojects have been sponsored to develop and demonstrate the use of various parts of ISO 15926. Summary This has been a long chapter. We started with CAD file exchange, described a number of STEP APs, finally ending up with the Part 4 RDL—which is used in industry as a common set of definitions for plant objects—and the Part 2 data model, which will be used by a new generation of information exchange that embeds meaning into data to make information exchange easier and more reliable. Some of the research we have described has led to dead ends, and many of the STEP APs are no longer used. This does not mean they were a waste of time. Sometimes the things that do not work teach us as much as things that do work. If we knew what we were doing, it wouldn’t be called research, would it? —Albert Einstein However, our intention is not to make you into an amateur historian but to understand the progression of ideas regarding information exchange in industry. We summarize them with the graph in Figure 2.17. In the late 1970s, soon after the advent of CAD, we as an industry grew tired of having to redo drawings in order to move them from one system to another. We thought that if we could only have a way to import the drawings directly from one system to another all of our data exchange problems would be solved. We got IGES, and other CAD exchange standards, but quickly found that although this solves some problems there is often more to drawings than, well, drawings. Chapter 2 67 If only we could exchange CAD drawings. 1980 If only we all used the same data model. 1990 If only we all used the same Reference Data Library 2000 If only we could embed meaning into our information exchanges 2010 Fig. 2.17 Progression of information exchange ideas. Any who have worked in an engineering environment will know that there is more than one CAD application in common use, and that on a large project all business partners do not always use the same one. Sometimes, all an engineer needs is some way to open in his system a drawing authored in another system. If that is all that is truly needed, a simple translation tools is all that should be used. Although IGES is no longer used, the market demand has been met with several commercial tools. But in a manufacturing environment sometimes what appears as a drawing is a collection of machine tool instructions for making a part, with the drawing more or less simply the human interface. In these situations, converting the “drawing” using a CAD translator tool would loose almost all of the value. By the mid 1980s, we thought that if only we all used the same data model all of our data exchange problems would be solved. We got STEP, with its many and varied APs. But as with IGES we found that although STEP solved a certain set of problems in a process plant environment a fixed data model is too cumbersome to keep up with a degree of change measured in decades. Our research led to the idea of separating the monolithic data model into a more generic data model consisting of smaller reusable pieces combined with an external RDL. By the mid 1990s, we thought that if only we all used the same definitions of terms (that is, the same RDL) all of our data exchange problems would be solved. We got a common RDL in the form of ISO 15926-4, and (as with STEP and IGES) we found that a common RDL does indeed solve some problems. Sometimes all you need to do is translate some data authored in one system into a form that Chapter 2 68 can be read by another. A translator based on the common dictionary of Part 4 is what you need. As well, there are some situations in which all you need is a common naming convention—anything will do. You can make your own, but if one already exists in the form of Part 4 you can save yourself some significant work. This is all well and good if you have the luxury of time in which to perform brute force data mapping in order to be able to exchange some data. But the pace of business is increasing and we would like to be able to exchange information without having to know a great deal about the systems our business partners use. For this we will need to embed meaning into data, so that when your business partners receive your information their systems will simply be able to figure it out. Since the mid 2000s, the research has been focused on just how to embed such meaning into a data exchange. The principle means of doing this is Part 7 templates. The next obvious question is: “How does it all work?” That is the subject of the next chapter. Chapter 2 69 CHAPTER 3: HOW DOES ISO 15926 WORK? The full title of ISO 15926 implies a very ambitious goal, encompassing information about every plant object from conception, engineering, construction, and operations. Industrial Automation Systems and Integration: Integration of Life-cycle Data for Process Plants, Including Oil and Gas Production Facilities Integration is a word you will hear many times in discussions of ISO 15926, along with the word interoperability. Most of us will have heard both words many times and will naturally think we know what they mean. They are related, of course, and are often used (incorrectly) as synonyms. Both terms include subcategories of related concepts, and there is disagreement even among experts as to the terms applied to these concepts. Part of the problem is that when you dig below the surface of the commonly used examples you will not find a sharp point where one word stops and the other starts, but a continuum with a fair degree of overlap. Fortunately, this is only an introduction to ISO 15926; we can understand the basic concepts of ISO 15926 without having a precise definition of either term. We will therefore approach the question with a few examples and then describe what ISO 15926 actually intends to do. Integration Versus Interoperability If you search for the definitions of integration and interoperability as applied to the field of computer science, you will find that the definitions cover a wide range but with a clustering of ideas around each term. There is wide agreement that integration means that two different applications are made to work together seamlessly. Along with this idea, there is a strong implication that the integration of the two applications is done specifically with that end in mind, and with the specific identity of the two applications in mind. Thus, we can think of integrating application A with application B by having their respective developers collaborate to “make them work together” (whatever that might entail). Interoperability is usually associated with two applications being able to “work together” (whatever that means) by virtue of each, independently, following an outside standard. In the end they may be able to work together just as well as two integrated applications, but because the “working together” was achieved by each implementing an outside standard we call it “interoperability.” (A bonus is that both applications can also work with every other similarly enabled application.) Let’s look at a simple example. Example 1: Headsets and Handsets Figure 3.1 shows two manufacturers—ACME (a maker of telephone headsets) and EMCA (a maker of telephone handsets)—that wish to make their products work together. If the two companies were to collaborate to design a physical plug that connects the handsets to the headsets, it would be an example of integration (that is, they use a custom-designed physical Chapter 3 70 plug to integrate their products). But if they independently used the Bluetooth standard (with the intention that their products would work with all other Bluetooth-enabled devices as well), we would call it interoperability. Fig. 3.1 Integration versus interoperability. A common mistake is to assume that the Bluetooth part of this example is what defines interoperability. Not so. We can imagine the reverse situation, in which the telephone industry has standardized on a certain physical plug that allows any headset to work with any handset. In this case, we would call all of the handsets and headsets with this particular plug interoperable. Within this situation, we could further imagine two (ACME and EMCA) of the many manufacturers getting together to connect their respective products using a new wireless technology. If they achieved this wireless “working together” by designing their own wireless protocol, it would be considered integration. However, remember the title of ISO 15926: Industrial Automation Systems and Integration: Integration of… If we were to apply this definition of integration, it would imply that if we wanted to “integrate” two applications using the ISO 15926 protocols we would have to do some custom work to each application specifically to make them work together. But the vision of ISO 15926 is that two applications can work together by virtue of each, independently, implementing the ISO 15926 standard. This sounds more like our definition of interoperability. Have we contradicted ourselves? We can start to develop an answer to the dilemma by subdividing the definition of integration into application integration (achieving integration by writing custom application code) and data integration (achieving integration by somehow transforming the data). To see how this works, let’s look at another example. Chapter 3 71 Example 2: Integration and Interoperability with Engineering Design Systems Figure 3.2 shows two engineering design systems offered by two competing software vendors, ACME and EMCA. Each suite of software offers intelligent, three dimensional (3D) modeling for several engineering disciplines—including piping, equipment, raceway, and structural steel and concrete. Each suite also offers intelligent process and instrumentation diagrams (P&IDs), electrical and instrumentation schematics, and data sheets. Within each vendor’s suite it is easy to imagine that the applications would be integrated, but each application in a suite is probably not interoperable with the respective application of the other suite. Raceway Raceway Equipment Piping Structural ACME Engineering System Equipment P&ID Data Sheets Schematics Piping Structural EMCA Engineering System Data Sheets Schematics Integrated P&ID Integrated Interoperable Fig. 3.2 Integration and interoperability in engineering design systems. Within a suite, we would call the applications integrated if information created by one application is available to the applications of another discipline. For instance, a typical integration is for the 3D piping application to be able to request line numbers from the P&ID application. This is an example of integration at the data level, and it can be achieved in a number of ways. The first is point-to-point mapping, whereby (in this example) the piping application is taught how to query the P&ID database and bring back something it can make sense of. Figure 3.3 shows integration between applications by point-to-point mapping. Each application pulls in data from another application, modifies it with an adapter (what we have previously called a data map), and then imports it into its own database. This may be practical for a small number of applications developed by one organization and marketed as a suite, but if the vision is that any application can talk to any other application using the ISO 15926 protocols this is not what we are after. Chapter 3 72 A Procurement A A Construction A A A Adapter A A Engineering Operations Fig. 3.3 Point-to-point data integration. Figure 3.4 shows a second example of integration that solves the point-to-point mapping issues by converting all data flows to a common, neutral, format and storing them in a data warehouse (or an enterprise service bus or asset hub) specially designed to accommodate the information from all of the individual applications. These applications can now exchange information indirectly using the data warehouse for intermediate storage. Engineering Procurement Construction Operations Adapter A A A A Service Bus or Asset Hub (Data Warehouse) Fig. 3.4 Integration using a data warehouse. Each application looks at the data warehouse though the “glasses” of its adapter. By the time information reaches the application, it is structured to look just like that application’s own data structure. When an application publishes information, it publishes it in its own structure to its own adapter—and the adapter changes it to the structure of the data warehouse. Each application thinks that it is the only thing in the world. This is a perfectly good solution provided that you want a data warehouse. There are many very good reasons for wanting a data warehouse, but if your goal is simply to be able to exchange information among a number of applications you don’t actually need the data warehouse! Figure 3.5 shows what this would look like without the data warehouse. Chapter 3 73 Engineering Procurement Construction Operations ISO 15926 Adapter Information Exchange to the ISO 15926 Standard Fig. 3.5 Integration without a data warehouse. In this arrangement, all of the applications can exchange information with each other using messages structured in a common neutral format. By communicating using the ISO 15926 protocols, new applications can be added to the federation without having to modify any of them. Thus, this is what the word integration means in the title of ISO 15926. By using an ISO 15926 adapter, we achieve integration at the data level without having to modify the applications in any way.1 But because the applications each, independently, follow an external standard they are interoperable. The Parts of ISO 15926 Are Like the Parts of Human Speech The way ISO 15926 achieves the interoperability we have just described is similar to the way people communicate. Within a human language, there is a common dictionary of terms, a common way of structuring sentences, and a common way of putting sentences together. Figure 3.6 shows how ISO 15926 is divided into a number of parts, each of which is similar to an aspect of natural language. 1. Truth in advertising: You have to modify each application to be able to exchange information with the adapter, but you only have to do this once. Chapter 3 74 Fig. 3.6 The parts of ISO 15926. Part 2: Data Model Part 2 is like the basic rules of grammar in a natural language. When we grow up we naturally learn to speak the language of our parents. This happens so naturally we don’t find out how complicated communication actually is until we try to learn a different language. Easiest to learn is the new lexicon. For instance, if you are a native English speaker and wanted to learn French you could open an English-French dictionary and find out that the French word for couch is canapé and the French word for grass is herbe. More difficult are the rules of grammar and the exceptions to the rules. For instance, if your wife (also a native English speaker) wanted to tell you in French to “Get your lazy behind off the couch and cut the grass!” she probably shouldn’t just try a word-by-word translation. If she did, she would run into two problems. First, what does the word behind mean? In English, in this context it is a slang word that means the part of your body in contact with the couch when you are sitting on it. But in English behind can also mean “toward the rear,” “slow,” and in some cases “hidden.” Simply looking up the word behind in a dictionary may not get you the correct French word to convey the same meaning. Second, the order in which we form words into sentences in English is often different from the correct order in French. In the field of linguistics, this is called word order typology—and it can get tricky. For instance, word order in English is what linguists call SVO (or subject-verbobject), which means that most of the time native English speakers will say something like “I went home.” In other languages, the most common way of saying the same thing might be “I home went” (SOV), “Went I home” (VSO), “Went home I” (VOS), “Home I went” (OSV), or “Home went I” (OVS). The problem with rules about word order typology is the phrase “most of the time.” If Chapter 3 75 you, dear reader, are a native English speaker you have probably heard several of these combinations in conversation. “I went home” is by far the most common word order, but many of the others are used occasionally to convey subtly different meanings. As it turns out, French is also SVO most of the time—but has a number of exceptions. Thus, if your wife got out her English-French dictionary and translated “Get your lazy behind off the couch and cut the grass!” word for word she would get: Obtenir votre paresseux derrière off l’ canapé et tondre l’ herbe! But if she were to enter the entire English sentence into popular translation software, she might get: Obtenez votre derrière paresseux sur le divan et couper l’herbe! This last translation is interesting because a word-for-word translation back to English comes out: Get your behind lazy on the sofa and cut the grass. But a friend who grew up in Quebec, Canada, speaking French from birth says that his wife would be a little less formal. Lève ton derrière du fauteuil et file faire le gazon. (We won’t write the English translation; suffice it to say that it is a playful insult to the posterior of someone lazy enough to lie around reading the newspaper when there is work to be done!) So now we have an example where using a dictionary to translate something into another language does not yield something that sounds natural in the other language. The same sort of thing is true with databases developed for different applications. When we humans speak to each other there is usually at least a little bit of shared background so that we don’t have to explain every term from first principles. And if we make a mistake understanding the other person, the mistake will usually become apparent during the conversation and one or the other will ask a question or two. But machines don’t have a shared background and have no way of suspecting something has been misunderstood. For instance, to reuse an example above, “I went home” is in English and thus the correct word order typology is SVO. It would be (relatively) easy to program a computer to change this to “Went I home” (VSO) if it were translating it to Hawaiian. But if the programmer made a mistake and used the rules for VOS, as in Fijian or Malagasy, it would come out “Went home I.” Here, the computer—still thinking you were trying to translate the sentence to Hawaiian— would assume that the subject was “home” (instead of “I”) and the object was “I” (instead of “home”) and would simply write the incorrect values to the database. Just as there can be ambiguity in human speech, there can be ambiguity in data models. For instance, Figure 3.7 is a snippet of a database that contains some information about a person named Joe Bloggs. Here we see a number of skills and languages, but we can’t tell what it really means just by looking at it. Taken literally, this database means that Joe can groom dogs Chapter 3 76 in French, cook in German, type in Greek, and can do nothing at all in Spanish. Name Skill Language Bloggs, Joe groom dogs French Bloggs, Joe cook German Bloggs, Joe type Greek Bloggs, Joe Spanish Fig. 3.7 The skills and languages of Joe Bloggs, Part I. “Baloney!” you say. “Common sense says that Bloggs can groom dogs, cook, type, and in addition has some proficiency with French, German, Greek, and Spanish—with the blank simply being a place to record the next skill he learns!” But note that we jumped to the second interpretation because we know that entities that have attributes with the names NAME, SKILL, and LANGUAGE are likely human—and being human ourselves we know that the skills listed do not generally depend on language. But machines will not jump to that conclusion without being explicitly told to do so. To a machine, the first interpretation is the most logical. If you have trouble with this, consider Figure 3.8 (information in an identical database). Name Skill Language Jabberwok burble Whiffling Fig. 3.8 The skills and languages of the Jabberwok. No one knows precisely what the Jabberwok is so we do not jump to a conclusion of what burble or Whiffling means.2 If we said “The Jabberwok can burble in Whiffling,” it would sound plausible. If the database of Joe Bloggs’ skills and languages was intended only for one application, the developer could easily overcome any ambiguity in the database by artful code. But if this application ever had to exchange information with another application, the ambiguity would have to be removed. In this case, it is quite simple. If the correct interpretation is that skills and languages are unrelated, as our “common sense” tells us, the data should be structured as shown in Figure 3.9. Name Skill Name Language Bloggs, Joe groom dogs Bloggs, Joe French Bloggs, Joe cook Bloggs, Joe German Bloggs, Joe type Bloggs, Joe Greek Bloggs, Joe Spanish Fig. 3.9 The skills and languages of Joe Bloggs, Part II. So this is what we meant when we said earlier that when we use ISO 15926 protocols to exchange information we embed the context of the data into the data itself. We structure the 2. Except, perhaps, those who follow the work of Terry Gilliam. Chapter 3 77 data so that the “obvious” explanation is the correct one. Conceptual Data Model Fortunately, even if there is some inherent ambiguity in your database structure there are ways to make it appear to outside applications that you have removed the ambiguity without having to actually modify the database. To describe how this works, we need to take a detour and examine two new terms: application data model and conceptual data model. Every application has a view of the world that is derived from its business purpose. Each application follows certain business rules, which can change over time. All of this is encapsulated in an application data model, which can be unique to each application. If we want to exchange information between these applications, we need to develop a data model that supports all of the application data models. We call this a conceptual data model. Figure 3.10 shows how a conceptual data model connects many different application data models. Application Data Models Engineering Procurement Construction Conceptual Data Model Operations External Level Conceptual Level Fig. 3.10 Conceptual data model. At the external level, we have many different models (or views)—each with a limited scope relative to the conceptual data model. Each will have needs different from any other, and each will be optimized for a particular purpose. The scope of these external models may overlap, and they may or may not be compatible with one another. For instance, an engineering application built on the engineering data model and a procurement application built on the procurement data model may perceive instrument tag numbers differently. A procurement application may deal with a tag number as a single string of text (say, 34-PI325) and store it as such in one database field, including the dashes. On the other hand, an engineering application may manage the tag number by its parts and store each in a separate field without dashes. Humans may be able to look at the two database structures (or schemas) and understand the relationship, but a computer program would not be able to do so on its own. The conceptual model is neutral to the external models and must be able to support all of them. It should be independent of business rules or things that change. The conceptual model is what integrates information from the different external models. For instance, the conceptual model would have to be able to deal with all of the different ways of dealing with tag numbers. Most likely, the conceptual data model would have a place for each individual part of a tag number. In turn, the tag number in each application data model would be assembled Chapter 3 78 by pulling in the individual parts as required and concatenating them together in the correct order (with any necessary spaces and dashes) to produce the format the application required. It is important to note here that the conceptual data model simply says how the data should be structured; it does not imply any particular method of storage or exchange. It can be implemented as an actual data warehouse or as information exchange files, as shown in Figure 3.11, or in any other way technology allows. Engineering Procurement Construction Operations Application Data Models A A A A Conceptual Data Model Service Bus or Asset Hub (Data Warehouse) Engineering Procurement Construction Operations Application Data Models Conceptual Data Model Information Exchange to the ISO 15926 Standard Fig. 3.11 Conceptual data model. Thus (going back to Joe Bloggs), the database structure shown in Figure 3.7 could work as an application data model provided the software developer knew how to handle the apparent inconsistencies. But the special knowledge of how to handle the information would be lost if another application received the data in that form. The solution is to transform the information to a conceptual data model that has a more easily understood data structure. There are two ways to do this. One is to rewrite the application so that it uses the desired data structure. This is fine if you actually want to rewrite the application for other reasons. But if all you want to do is exchange information with another application an easier way is to use an adapter to transform the output. Part 2 of ISO 15926 describes just such a conceptual data model that can be used for the representation of information about objects used in capital projects. This representation is specified by a generic conceptual data model designed to be used in conjunction with reference data, which we will discuss next. The model can support all disciplines and life-cycle stages, and can support information about functional requirements, physical solutions, types of objects, and individual objects and activities. Chapter 3 79 To review, Part 2 defines the rules and constraints (more or less the grammar used throughout) on how we use ISO 15926. It is an ontology with basic axioms such as thing, class, and individual at the top level. At a lower lever, it includes subtypes of these axiomatic concepts— such as physical object and connections. Part 2 is very specialized and requires a fair amount of work to understand. Fortunately, most organizations will only have to deal with templates, described in Part 7, which are sort of like preconfigured definitions that point to objects in Part 2. If you ever run across a copy of Part 2, we recommend that you read Section 4 because it gives examples of how the entity types can be used to express certain information. Part 4: Initial Reference Data Part 4 is like the definitions of terms in a natural language; similar to a dictionary or thesaurus. Earlier we said that when we use ISO 15926 to exchange information part of the meaning of the data comes from its structure and part comes from reference data. Reference data is essentially definitions of terms that represent information common to industry. This means that if the meaning of your data is inherent in its structure, and if the definitions of objects come from a common dictionary, computer systems will be able to infer meaning without requiring human interpretation. Building a Wooden Swing: The Importance of Reference Data in Simplifying Communication Effective communication requires that all parties share a common set of definitions. In computer science, this is called a reference data library (RDL; that is, a library of reference data).3 Whether we know it or not, we use an RDL every time we talk to others. For instance, if you and a co-worker did not share the same RDL conversation around the coffee machine on Monday morning would be very complicated: You: “So, what did you do on the weekend?” Co-worker: “You know how they use a round circular metal thing with sharp points to slice trees up into long pieces? Well, I took some of those sliced-up trees and some braided nylon that comes in long coils and built a thing for the small little offspring of my wife and I so I could push them in a manner that they would go back and forth in a circular arc that starts near the ground but gets up to three meters off the ground before they start coming back down.” Wouldn’t it be much easier if your co-worker could just say “I built my kids a wooden swing”? But this would be impossible if you and your co-worker did not share the same definition of kids, wooden, and swing, not to mention built and my. (Remember this dialog. We will refer back to it several times later in this chapter.) 3. Another name for “Reference Data Library” is “Class Library”. We don’t use that name for the core classes of ISO 15926 Part 4 because it contains more than just classes of objects. Chapter 3 80 Where it gets complicated is differentiating between variations of a term. Basic terminology is easy. For instance, pressure and temperature are easy to tell apart. But in an industrial environment we seldom use the words pressure and temperature by themselves. Pressure can mean the pressure a vessel is operating at right now; its assumed design pressure; its minimum hydrostatic test pressure (which may be greater than its operating pressure to compensate for a lower testing temperature); the maximum pressure it is allowed to keep working at for an assumed lifespan of, say, 30 years; or the pressure at which a pressure-relieving device will open. To tell the terms apart, we must add adjectives to make longer and longer strings of words. But even if we come to agreement on the meaning of all of the adjectives, there is still opportunity for ambiguity. For instance, what is the difference between “maximum allowable working pressure” and “pressure-maximum-allowable-working”? For human beings, the answer is probably “Nothing; they are the same.” But when computer systems have to communicate without human intervention the two are not the same. (And we haven’t even talked about exchanging information when all of the people involved do not speak the same language.) Uniform Resource Identifier To get around this potential for ambiguity, the ISO 15926 RDL assigns a unique identifier to each definition so that the identifier can be used like a serial number for the definition. For example, if you were to look up temperature in the RDS/WIP maintained by the POSC Caesar Association (PCA), you would find something like the entry shown in Figure 3.12. All of the attributes in the left-hand column are familiar except the first. The unique identifier is called a uniform resource identifier (URI). A URI is sort of like a web page address that returns a definition instead of a web page. Every term, or class, in the ISO 15926 RDL has its own URI. Instead of having to describe the attribute, and get all of the adjectives in the correct order, an engineer only has to refer to the URI. RDS/WIP URI http://rdl.rdlfacade.org/data#R41192093771 Label TEMPERATURE Description The degree or intensity of heat or cold as measured on a thermometric scale, and a measure of whether two systems are relatively hot or cold with respect to one another. Two systems brought into contact will, after sufficient time, be in thermal equilibrium and will have the same temperature. Entity Type http://dm.rdlfacade.org/data#SinglePropertyDimension Fig. 3.12 RDS/WIP entry. For example, let’s say that two applications wanted to exchange information about temperature. Let’s also say that the first application stored the value in a column labeled “TEMP,” whereas the second application stored the same value in a column labeled “Temperature 24.” The following tables show what the entries in the database maps would look like. The export map from the first application would look as follows. Column Name RDS/WIP URI TEMP http://rdl.rdlfacade.org/data#R41192093771 Chapter 3 81 The import map into the second application would look as follows. Column Name RDS/WIP URI Temperature 24 http://rdl.rdlfacade.org/data#R41192093771 The first application is saying, “Here comes some values for the attribute R41192093771.” The second application runs down its map and says, “Great. R41192093771 translates to Temperature 24 over here.” As part of the data exchange, each application will resolve the URI as a quality control measure to make sure it is a valid term. These two examples show the importance of using reference data to reduce the volume of information that has to be exchanged, and to eliminate the opportunity for error. In the first example, of building a wooden swing, when we could use commonly understood definitions (which are, in essence, reference data) we reduced a 91-word explanation to 7 words. In the second example, being able to refer to the serial number of a particular attribute we totally removed the possibility of mistaking the attribute for a similar one. Reference Individuals In addition to the core classes, the core RDL contains what we call “reference individuals.” Normally, individual objects are contained in project databases where something like 37-P101A has only a local meaning. But some objects are important enough that we want to differentiate them from all other objects in the world. For instance, an RDL devoted to geography might have a generic class called “politically bounded object”—which would be used to describe an entire country, the provinces within the country, and the cities within each province. In addition to that general class, it may also contain individuals (or instances) of all of the existing countries in the world, their provinces, and their cities. Such instances might include England (a country), London (an important city in England), and perhaps Winston Churchill (one of England’s prime ministers). Users could make reference to these individuals in order to remove ambiguity.4 A Data Exchange Can Use Many RDLs One way to use an RDL is to copy the definitions of everything you will need to use into a single RDL. But this would result in the unnecessary duplication of information created by others. ISO 15926 allows the use of a federation of data libraries. Internally, this lets an organization keep separate RDLs—one for proprietary information and one that is published. And once you have made the leap from a single RDL, why stop at two? It is easy to conceive of a situation in which different authorities around the world will become respected repositories of certain categories of information. There is no reason they could not publish their information in the form of an ISO 15926–compliant RDL. Figure 3.13 shows a number of worldwide standards associations. Each of them publishes 4. For instance, if you mentioned “The city of London” in certain parts of eastern Canada, people would assume you mean London, Ontario instead of London, England. Chapter 3 82 many engineering standards for various parts of the capital projects industry within its jurisdiction. For instance, China has its own national standards that are called Guobiao (GB), which literally means national standard. GB standards must be used for all facilities built in China. Facilities in Australia, on the other hand, must use a combination of Australian standards (AS) and ASME standards. Currently, if an organization wanted to use all of these standards as reference data it would have no choice but to copy them into its own RDL. Fig. 3.13 Many standards. But the vision of ISO 15926 is that all of the organizations will put their standards into their own RDL and make them publicly available for other organizations to use as reference data. What we end up with is something like Figure 3.14. Here several organizations that are collaborating on a project have made their reference data available to the other members. An owner, some suppliers, some EPC contractors, and some standards organizations have all published information in an ISO 15926 RDL. Any member of the consortium can use information from these RDLs, along with the core classes published in Part 4. Chapter 3 83 Fig. 3.14 Cloud of federated RDLs. To review, when two people use the same terminology (i.e., use the same dictionary) and use the terminology in the same way (i.e., use the same rules of grammar) they can communicate freely. The same is true for machines. When two computer applications use the same terminology (i.e., use the same RDL) and structure their data in the same way (i.e., use the same data model), they can communicate freely as well. The core of ISO 15926, then, is the data model (Part 2) and the RDL (Part 4). These two parts define how the data is to be understood; the rest of the parts define how it is delivered. Part 4 defines the initial set of reference data (or core RDL) for use with ISO 15926 Part 2. The core RDL contains the core classes and reference individuals, which are arranged in an ontology of subtypes (or specializations) of the classes in Part 2. Examples are concepts such as “pump,” “reducer,” and “heat exchanger.” International standards bodies can create subtypes of the core classes to create the types of objects within the scope of their standards. Currently there are almost 20,000 classes in Part 4. The library is extensible, and thus the number is expected to grow considerably—but there is no intention that it will ever contain all of the terminology for the entire capital projects industry. Part 7: Template Methodology Part 7 templates can be referred to as units of information. They are like phrases in a natural language phrasebook that show a new user how to construct meaningful sentences in an unfamiliar language. Part 7, like Part 2, is written independently of computer languages. Its language is first-order logic, which is math. We have said earlier that the vision of ISO 15926 is that “your computer can talk to my computer and we don’t have to know anything about each other’s systems in advance.” If you are like the author, this sounds magical. The magic comes from using Part 2 (the data model) and Part 4 (the dictionary) together to translate data exchanges into the common language of ISO Chapter 3 84 15926 so that it can be translated by your business partner at the other end. We have said this before, but it bears repeating: if two people know what the words of a particular language mean, and if they know the correct grammar for that language, they can communicate seamlessly. Similarly, when two applications have a common dictionary and use a common data model they can communicate seamlessly. Thus, Part 4 (the dictionary) and Part 2 (the data model) are the two most important parts of ISO 15926. The problem is that modeling information at the level of Part 2 is generic and highly specialized. Although Part 2 enables considerable flexibility in what can be modeled, it is very complex. If everyone using ISO 15926 had to understand Part 2, it would be as if you needed to take a course in aeronautical engineering in order to fly on an airplane. But by using part 7 templates engineers can make their information models compliant without having to master Part 2. Part 7 specifies templates that are expressions of predefined units of semantics, allowing the use of a Part 2 data model in a convenient way. Using Part 7 is similar to using a phrase book for another language when you are travelling instead of using a language dictionary to convert each word in your native language to a foreign language. How ISO 15926 Templates Work When engineers start a project, one of the first things they do is create templates of each type of data sheet they expect to use to ensure a common look and feel across the project. For instance, all data sheets for level transmitters will look the same and will have the same type of values in the same place. The reason a common look and feel is important is because human beings are the ones reading the data sheets. Humans rely on visual clues to determine meaning, and we get used to looking for certain (essentially x,y) coordinates for particular values. For instance, “Maximum Allowable Working Pressure” for a particular type of vessel might be something like “1/3 of the way up the page near the right-hand side.” But in the world of ISO 15926 machines are the ones reading the data sheets. To a machine, the physical layout of the printed sheet means nothing. The machine is more interested in the precise definition of the terms. And in this, Part 7 templates function for machines in the same way templates of paper data sheets function for humans. Templates for paper data sheets display information in visual patterns; Part 7 templates use information patterns. The biggest difference between the type of template people are used to and Part 7 templates is that Part 7 templates are for much smaller pieces of information. If you wanted to create a typical equipment data sheet using Part 7 templates in order to transfer all of the information about a piece of equipment, you would use many Part 7 templates and in fact would use many of the Part 7 templates many times. In this way, part 7 templates can be compared to the chemical elements. Looking at the periodic table, you can see that there are 118 elements listed. Yet with just 118 elements the entire universe is made. Every once in awhile we find a new element, but we don’t expect to find a great many more. For an example of an information model, let’s take a look at a married couple (Bill and Joan) taking their dog (Willy) out for a walk after getting home from work. Figure 3.15 shows one way to diagram this.5 5. The “Walking the Dog” diagram is courtesy of Hans Teijgeler. Chapter 3 85 class_of_person ‘woman’ class_of_person ‘man’ class_of_organism classifier classifier classification classifier classification physical_object role_1 classification classified classified classified physical_object other_relation role_2 side_1 ‘dog’ physical_object Indirect_ connection side_2 classifier classification classified other_class_ of_relation ‘marriage’ Fig. 3.15 Walking the dog. This shows that there is the relationship of “marriage” between two physical objects that are classified as persons and identified as “man” and “woman.” There is an indirect relationship between the “man” and a physical object classified as an organism and identified as “dog.” If you have never seen this type of notation before it will probably look very complex. But the reality is that the diagram is not nearly complex enough to capture everything an information modeler would need to know. For example: • Details about the “walking” activity. * The period in time in which they did the walking. * The location in which they did the walking. • Is the marriage happy? • Are they holding hands? • What is the means of the indirect connection? A leash? * What is the composition of the leash? • What is the breed of dog? Friends of Bill and Joan will be able to answer all of these questions because they know the context. But our reason for modeling the information is that Bill and Joan’s friends are not the intended audience, machines are. We want computer programs to be able to understand the information as well as humans do. For instance, what is the time of day? Friends will know that Bill and Joan are lab technicians who work the afternoon shift at the local hospital. If they just got home from work, it is prob- Chapter 3 86 ably 1:00 in the morning. Walking their dog Willy at that time of day might normally be dangerous given the part of town they live in, but Willy is a large Rottweiler. Friends would understand this implicitly, but if you want a machine to understand it you have to embed the entire context into the information. Similarly, when modeling plant information there are a great many people who work with plant objects all day, every day—including engineers, buyers, suppliers, installation contractors, maintenance technicians, operators, and others. These people all have a great deal of knowledge about plant objects because they know the context. However, very few of them will have the motivation or patience to learn information modeling. Part 7 templates allow users to implement Part 2 without having to know Part 2. What a Template Looks Like Understanding the inner workings of a Part 7 template is far beyond the level of any book with “Introduction” in its title.6 However, a quick peek will help to explain how templates can both be simple enough for untrained people to use while being rigorous enough to satisfy the information modeling needs of Part 2. Temperature Range Example Suppose you have chosen a particular model of temperature transmitter for the project you are working on. You will be using it many times, and thus would like to create a class so that it is easier to use. Manufacturer Model No. 3051CG Ambient Temperature –40 to 85 deg C. If you were a data modeler, it would naturally occur to you that your data model would need the terms shown in Figure 3.16. Just as naturally, this would lead to a data model that would look something like that shown in Figure 3.17. Class of Individual Class of Class of Relationship Single Property Dimension Scale Express Real Express Real 3051CG Ambient Temperature Temperature Celcius “-40" “85” Fig. 3.16 Terms in a data model of a temperature range. 6. On the other hand, if you have a strong aversion to labor-intensive, error-prone work and want to remove it from engineering work processes every time you encounter it, learning to model information at the level of Part 2 will lead you to a very fascinating and rewarding career. Chapter 3 87 Property Quantification Upper Bound Of Property Range Classified Temperature 85 °C Input Property Ambient Temperature Classified Classified CO CO Relationship Result CO Individual Classifier Property Space Temperature Range -40°C – 85°C CO Indirect Property ”85 " Pattern Property Range Temperature Celsius Single Property Dimension CO Identification Scale Classifier Classifier ExpressReal Classifier Classifier CO Possessor Represented Classifier Classified 3051 CG 85 Arithmetic Number Classifier ”-40 " Pattern ExpressReal Classified Classified Temperature -40°C Input Property Lower Bound Of Property Range Represented Classified Result -40 Arithmetic Number Property Quantification Fig. 3.17 Data model of ambient temperature range. But none of this would naturally occur to an average engineer. Fortunately, most people will never have to learn how to do this because we can use a generic template for a property range instead. Figure 3.18 shows a template that says “Something” has “Property Type” with “Property Range” of “Base Property Type” defined by “Input 1” and “Input 2” with “UoM.” Hmm…It doesn’t look much simpler. Property Quantification Upper Bound Of Property Range Classified Property Type Input Property Classified Result Classified Pattern Classifier Property Space Property Range UoM Base Property Type Classifier CO Indirect Property CO Identification Classifier Classifier Pattern Classified Classified Lower Bound Of Property Range Property Input 1 Classifier Classifier Class Represented Classifier Classified CO Processor Arithmetic Number Represented Classified Input Input 2 Result Arithmetic Number Property Quantification Fig. 3.18 ISO 15926 property range template. Chapter 3 88 But what if we told you that your only requirement is to fill in the blue boxes with the appropriate information, and that you can treat the other boxes and diamonds and connecters as so much modern art? But it gets simpler yet. If all you have to do is fill in the colored boxes and can ignore the rest, why even show you the rest? Why not just put the boxes into tabular form, as in a spreadsheet? Why not, indeed. Figure 3.19 shows what users will actually have to deal with. Simple, eh? Select RDL Class or Project Data Template ID ABC Select from standard/ customized list of RDL Instance Class Property Type 3051CG Ambient Temperature Select from standard/ customized list of RDL Instance Property Range (Created by the system) Base Property Type UoM Input 1 Input 2 Temperature C -40 85 Fig. 3.19 Property range template inputs. A template is basically a regular pattern of information, like rows and columns in a spreadsheet. The column headers in the spreadsheet are the “roles” of the template. Each row of the spreadsheet is a template instance. Each cell in the row is a value of a role (the column heading). The combination of the role names and role types is called the template signature. A template definition is the generic spreadsheet itself. It defines the name of the template and the roles, and what types of information are valid in those roles. To review, in slightly more technical language Part 7 defines an implementation-independent template methodology built on the Part 2 conceptual model. Part 7 provides template specification requiring template signatures listing roles and types of each argument, and provides an initial set of about 200 templates. These cover a wide range of concepts and are enough to get started. When more are needed, Part 7 also contains instructions for creating new templates. Development of Part 7 was started several years ago. It has since been split into what is now Parts 7 through 10. Part 8: Implementation Methods for the Integration of Distributed Systems – Web Ontology Language (OWL) Representation Part 8 is a method of representing information. In a natural language, it is like paper, a computer file, or even a stone tablet—a particular way of presenting information. It is possible to implement ISO 15926 with spreadsheets, text files, word processor documents, or custom code. However, there is no standard specification for doing it this way. This means that if two companies decided to exchange ISO 15926 information using spreadsheets but developed them independently the first versions of the spreadsheets would probably not be compatible. Some discussion and agreement of terms would still be necessary. Using a natural language metaphor, it would be like two people—one a Cockney from the East Chapter 3 89 End of London, England, and the other a French Canadian—trying to communicate for the first time. Even though both people speak English, the two dialects are sufficiently different that it might take awhile for them to understand each other. Using Part 8 relieves an organization from having to develop an implementation method from scratch, and makes it easier for business partners to implement complementary systems. As a metaphor, we can compare the use of Part 8 to writing a memo and somehow delivering it to a friend. In order to compose the letter in English you would have had to use your knowledge of English words and grammar, which are equivalent to Part 4 and Part 2. Having the message formed in your mind is like Part 7. Putting the message on paper is like Part 8. Sending it in an envelope through postal services is like Part 9. Part 8 is a specification for the way to use two tools developed for the Semantic Web, Web Ontology Language (OWL), and Resource Description Framework (RDF) to implement ISO 15926. Web Ontology Language We have said earlier that Part 2 and Part 4 are arranged in an ontology, with Part 2 consisting of basic concepts and Part 4 consisting of subtypes (or specializations) of these basic concepts. User-created RDLs will consist of subtypes, or specializations, of Part 4 classes. The Web Ontology Language (OWL) was developed as a way of defining and managing ontologies. A number of software packages have been developed to process OWL knowledge directly. OWL was attractive to the developers of ISO 15926 because many of the concepts of Part 2 correspond to specific concepts within OWL (for example, class and relation), which makes the semantics of ISO 15926 easier to implement. When you organize and describe plant objects with an ontology that follows the rules of OWL, machines can follow the rules and make inferences without human intervention. For instance, you could declare that a centrifugal pump has at least one impeller. Therefore, if a pump does not have an impeller it cannot be a centrifugal pump. Resource Description Framework A resource description framework (RDF) is a way of making statements about things, which it calls resources, in the form of subject-predicate-object expressions (known as triples). For instance, in the declaration “My dog is a Tibetan Mastiff” “My dog” is the subject, “is a” is the predicate, and “Tibetan Mastiff” is the object. In addition, RDF notation has the ability to make explicit reference to published definitions through the use of URIs (discussed previously). Thus, instead of using the text string My dog the speaker could make reference to a web page containing a photograph of a particular instance of a dog7—and instead of Tibetan Mastiff a link to the American Kennel Club’s official description of Tibetan Mastiffs.8 An entire capital project can be represented in this form, which will enable machines to read the information and make correct inferences. 7. If the dog were particularly important to the breed, perhaps a male dog that had sired several show winners, it might be included in that breed’s RDL of reference individuals. 8. In this example, the American Kennel Club’s descriptions for the various breeds of dogs are a form of reference data library, with the address of each description being the URI. Chapter 3 90 Part 9: Implementation Methods for the Integration of Distributed Systems – Facade Implementation Part 9 is like exposing messages in an on-line forum. It is open to the Internet, but only certain users have the right to read certain messages. A facade is an outward-facing view of something. In relation to ISO 15926, you can think of this as a user interface for machines built in front of the information you have published with Part 8. Essentially, you can post the information you wish to publish into your facade, setting up appropriate security so that only your business partners can query for information—and so that they can only see what you want them to see. After that, they can access whatever information they need—when they need it. Remember that the information you publish using Part 8 is in the form of an RDF, which is a series of triples collectively is known as a “triple store.” A recommended protocol for querying RDF data defined by Part 9 is SPARQL,9 another W3C standard query language similar in purpose to SQL but intended for triple stores. To extend the metaphor of writing a memo to a friend, using Part 9 is like sending the message using your national postal system. You put the message in an envelope of a certain size, write your fiend’s name and address in a particular spot in a particular manner, and put a particular amount of postage on it. There are laws regarding where the postman can put the letter for delivery and who may open the letter. You also have the option of invoking certain rules by “registering” the envelope for an extra cost, which will guarantee delivery to a real person rather than just to your friend’s mailbox. Putting It All Together Figure 3.20 shows how all of the parts of ISO 15926 work together logically.10 It is important to note that this is not a single database but a federation of many databases. In fact, each of the layers is made up of many databases—each supported by the owner of that data. In Chapter 1 we discussed the idea that information expressed in natural languages is usually not precise enough to preserve the semantics, or meaning, at the level of precision required for information exchange in capital projects. In the Chapter 2 we discussed ontologies as a way of representing information in a manner that preserves meaning. Together, all of the layers of this diagram form an ontology—with each layer being a subtype, or specialization, of the previous layer (which means that the semantics of each object is preserved). For instance, the objects at the top in Part 2 are generic concepts such as thing, class, individual, and property. The objects in Part 4, the core classes, are subtypes (or specializations) of the objects in Part 2. In turn, the user-defined classes are specializations of those in Part 4. Toward the bottom, we get more and more specific until we have individual objects in a project. 9. Pronounced “sparkle”. 10. The pyramid diagram is courtesy of David Leal. It was adapted from a diagram in his paper “Life Cycle Data for a Process Plant: An Overview”. Chapter 3 91 ISO 15926-2 ISO 15926-4 User defined classes Generic engineering concepts, such as physical object, activity, composition, connection Core classes and reference individuals, such as pump, reducer, heat exchanger, Cairo Classes defined by international standards and industry associations Company commodity classes Catalogues from suppliers and manufactuerers Special project classes 26-7 159 ISO ates mpl e Te Cor Project Data Axiomatic concepts such as thing, class, individual, and property Project Individuals such as pump 37-P-101A and pressure guage 37-PI-205 Fig. 3.20 ISO 15926 pyramid. Part 7 provides base templates of standard relationships. Together, the templates can be used like a box of Lego blocks to build the database representations of the complex objects that exist in real life. When we use the templates following the rules in Part 7, we ensure that the resulting information follows the data model in Part 2—which means that the information will in fact be precise enough to preserve the semantics and will integrate with other information developed following the ISO 15926 protocols. In presentations about ISO 15926, you will often see a little pyramid looking something like that shown in Figure 3.21. The little pyramid is intended to represent the entire pyramid shown in Figure 3.20. ISO 15926 Fig. 3.21 ISO 15926 pyramid symbol used in presentations. Chapter 3 92 Other Parts of ISO 15926 Astute readers will note that some part numbers are missing. Appendix A lists them all, with their most recent publication dates. Levels of Compliance with ISO 15926 Versus Ambiguity The purpose of ISO 15926 is to eliminate ambiguity in information so that it can be exchanged and reused easier. Our current work practices require significant manual work because ambiguity is inherent in the processes. Manual work drives up costs and introduces opportunities for error. The more we use ISO 15926, the more we drive out ambiguity and automate information exchanges to drive costs down. Figure 3.22 shows various levels of compliance with ISO 15926 in information transfer, starting at the bottom (with no compliance). The scale is really continuous, but we have shown six discreet levels. The bottom two levels do not involve ISO 15926 but are included to show the progression. Least Ambiguity Shouldn’t we add workflow and security to the semantic web? Least Compliance Greatest Ambiguity If we use the semantic web couldn’t we automate more of this? Ambiguity Scale ISO 15926 Greatest Compliance Would it help if I told you how I was using the data? Why don’t we use industry standard definitions? Wouldn’t it be better if we agreed on common definitions? Let’s just exchange raw data and figure it out together Fig. 3.22 Levels of compliance with ISO 15926. Chapter 3 93 Let’s Just Exchange Raw Data and Figure It Out Together Figure 3.23 shows two companies, ACME and EMCA, which wish to exchange some information contained in databases. At the beginning of the exchange, neither knows anything about the other’s system. • ACME selects the data to be exchanged and exports it to an intermediate file (a “data dump”)—perhaps in a comma delimited text file or spreadsheet. • ACME sends the data dump to EMCA. • EMCA cannot use the information because it does not know what all of it means. • Each company assigns an engineer to collaborate to make a custom map so that EMCA can import the data. • EMCA uses the map to import the data. Fig. 3.23 Figure out each exchange every time. Level of Collaboration Required The two engineers cannot go directly to mapping database columns together. They first have to understand quite a bit about each other’s applications. The dialog might go something like this: “What are we trying to exchange here?” “A purchase order database.” “Do you use a commercial package?” “No, we developed our own.” “Hmm. So did we.” “Okay, how do you build your line items?” “What do you mean, build? We type them in just as you see them on the purchase order.” “Oh. We link directly from the 3D model and roll up identical items on the fly.” Chapter 3 94 “Do you retain any information of the source of the item.” “No. I told you we just type the line items in as you see them.” “Okay. We’ll have to do the rollup for you.” “Do you have any way to track overs or shorts?” “Qu’set que c’est overs and shorts?” “Well, if the supplier short-ships or over-ships, how do you handle this?” “I guess the receiver writes it on his copy of the PO and someone places a phone call.” “Hmm. We record it against the PO and the computer automatically generates a message to the purchasing agent.” And so on. Only after they understand what each other has and what each other needs can they start mapping database columns against database columns. Wooden Swing Metaphor Earlier in this chapter we used a metaphor of a co-worker telling you about building a wooden swing for his kids over the weekend. We compared the explanation he would have had to give you if the two of you did not have a common understanding of basic terminology (91 words long) with a shorter version that depended on a common set of definitions (7 words). If we were to compare the swing metaphor to this case, we would have to make the long version even longer. A more accurate metaphor is that you speak English and your co-worker is just learning it, perhaps being a native of Fiji. Perhaps he has learned a few words, but still thinks in his native language. At some point, he may end up making a little drawing and doing some pantomime. Together, you are more or less making up a list of equivalent words in the context of a wooden swing. This type of thing would be acceptable for a single exchange, but what happens if further ACME and EMCA business ventures require different exchanges, or exchanges between different applications? Wouldn’t It Be Better if We Agreed on Common Definitions? In Figure 3.24, ACME and EMCA exchange information often enough that there is an opportunity to streamline the operation. They do not want to have to consult with each other with every information exchange and start from scratch, so they agree on two things: • They will spend a bit of time to develop the terminology for a neutral information format. • They will not exchange raw data dumps. The sender will translate its information payload into the neutral file format, which the receiver will be able to understand. Chapter 3 95 Fig. 3.24 Common reference data library. This is the beginning of a data dictionary, or RDL. Start by assembling the terminology. • ACME and EMCA collaborate to create the dictionary of database terms and to decide the format of the neutral file.11 • Both ACME and EMCA use the dictionary to create their own database maps. • ACME selects certain data and exports using the database map to translate it to the agreed neutral file format. • EMCA receives the intermediate file and imports it into its systems, using its database map to translate it. Level of Collaboration Required The only difference here from the previous example is that after the two organizations agree on the meaning of certain data items they will be able to use a standard neutral form for the information. They will still have to agree on the meaning of the information. As in the previous case, any new application added to the confederation will still be subject to a detailed examination. If the new application uses new terminology, new terms will have to be added to the dictionary. If the new application uses existing terminology in a slightly different way, revisions to the dictionary may be required. New users may need more explanatory material than a simple list of equivalent terms. Wooden Swing Metaphor Going back to the wooden swing metaphor, it is as if you were writing your own English-Fiji dictionary not having thought of going down to your local bookstore and purchasing one. 11. It is important to agree on 100% common definitions, otherwise there is a tendency to put proprietary information into the neutral file, and without agreement on 100% of the information we are back to the previous case. Chapter 3 96 When new terms are introduced, you will still need a discussion to discover their meaning— and may have to revise the understanding of old terms whenever you discover new concepts. This is a definite improvement, but what happens when the confederation of mapped applications grows? What happens when a third and fourth organization join the confederation? And extending the situation further, what happens when members of different confederations have to exchange information? It is certainly possible to create new maps, and maps between maps, but the labor needed to maintain this will grow exponentially. Sooner or later someone will wonder whether or not there is an industry standard for data exchange. Funny you should ask! The next example shows two organizations exchanging information using ISO 15926-4 (the RDL) using the RDS/WIP Why Don’t We Use Industry Standard Definitions? In the example shown in Figure 3.25, ACME and EMCA wish to use an industry standard dictionary to create the neutral files they use for information exchange. Being able to export information into an industry standard format will allow them to exchange information with other organizations as well. • Both ACME and EMCA collaborate and agree to use the core classes in Part 4, and to use the rules in Part 4 to extend the classes. • Both use the dictionary to create their own database maps. • ACME selects certain data and exports using the database map to translate it to the agreed neutral file format. • ACME validates its data against the core classes in the Part 4 RDL, using the RDS/WIP to make sure it is compliant before sending it out. • EMCA receives the intermediate file and validates it against the RDS/WIP to verify compliance. • EMCA imports the information using its database map to translate the information into something its systems understand. Chapter 3 97 Fig. 3.25 Use ISO 15926-4 for information exchanges. Level of Collaboration Required The major difference here is that instead of consulting a growing proprietary dictionary, all who are involved refer to the public standard. But both organizations will still need to have a discussion about how they will use the core classes and will need to agree on the meaning of new classes. (Recall our earlier discussion of your wife trying to ask you in French to get up off the couch and mow the lawn. A French-English dictionary allowed her to translate the individual words, but they didn’t come out the way a native French speaker would make the same request.) New applications added to the confederation would still be subject to a detailed discussion. And, as before, because a new application may use existing terminology in a slightly different way revisions to the terms used for a particular information exchange may be required. Also, as before, new users may need more explanatory material than a simple list of equivalent terms. Wooden Swing Metaphor To expand the metaphor to include this case, it is as if you have discovered that you can purchase an English-Fijian dictionary. Using a standard dictionary prepared by experts in both languages will make things a bit easier, but will not itself remove all of your communication problems. For instance, when new words come into the morning coffee time conversation they will probably already be in the dictionary—but the two of you will still have to verify that Chapter 3 98 you are using them correctly and will have to discover things like tonal inflection and sentence construction by trial and error. For instance, if your Fijian friend used just the dictionary to translate a few sentences into English it would probably sound funny to you (and vice versa). The two of you would be able to work through it because you know what sounds funny and can ask questions. But a computer does not have the ability, on its own, to recognize ambiguity and work through it. Would It Help If I Told You How I Was Using the Data? Using Part 4 makes it easier to create database maps because both parties no longer have to relearn data definitions each time. The parties only have to collaborate when a new term is required, or if there is a deficiency in an existing definition. But both parties still need a fairly good understanding of the scope of the exchange. For example, using the dictionary will make it easier to identify the database term for the concept “maximum allowable working pressure” However, both parties still need to know that maximum allowable working pressure is required. When we send an inquiry to an instrument vendor for, say, a pressure instrument, the traditional way is to fill in a data sheet. If there were sufficient business with one vendor, we could conceivably configure a bulk database-to-database exchange using custom mapping. If business were sufficient, this might evolve to a custom data dictionary and then to a dictionary using ISO 15926-4. But in all cases the data values that are required and are available still have to be known in advance. The more you have to know in advance about an information exchange, the more time it takes and the more “friction” there is. We would like to be able to communicate with new business partners almost on a whim. What we want to be able to do is simply dump out our data in a manner our business partners can easily figure out on their own. To meet this need, we will still use Part 4 as the data dictionary—and will still use the RDS/WIP to validate the information on each side of the exchange. In addition, we will structure our data according to Part 2. In the example shown in Figure 3.26 ACME wants to be able to send information to EMCA in a manner EMCA can simply use directly or figure out easily. Chapter 3 99 Fig. 3.26 Use templates to make modeling easier. Both ACME and EMCA need to understand how to structure the neutral exchange file using the “grammar” of Part 2 along with the definitions of Part 4. The easiest way to do this is to use the base templates in Part 7 to build the database maps with which you will export (or import) the neutral data exchange file. The basic difference between this method and the one previously described is the work going into making the database maps and the composition of the neutral file. • ACME and EMCA collaborate and agree to use Part 7 (which implies Part 2) and Part 4. • Each organization creates information models, or maps, to transform its internal data structures to something that follows the ISO 15926 protocols. They use the base templates in Part 4 and the methodology in Part 7 to create new templates when required. • ACME selects certain data and exports it using the database map, based on Part 7 templates, to translate it to the agreed neutral file format. As it assembles the neutral file, the template validates the information against the core classes in the Part 4 RDL using the RDS/WIP • ACME sends the neutral file to EMCA. • EMCA receives the neutral file and processes it in reverse. • EMCA uses its database map, based on Part 7 templates, to translate the information into something its systems understand. EMCA’s templates understand how to read the meaning Chapter 3 100 from the structure of the data in the neutral file. Level of Collaboration Required The level of collaboration required in this case is much less than in the previous case. In the previous case, ACME and EMCA had to describe to each other how they were using their data so that each could figure out the equivalent data elements in their own systems. The reason they no longer have to review this person to person is that they are describing how they use the data simply by using the base templates in Part 7. After they implement Part 7, all they will need to know from each other is the general nature of the information to be exchanged. Neither party has to know anything at all about the way each other creates and uses the information because the context of the information is contained within the information itself. The addition of Part 7 adds some initial work to fully understand the landscape of different applications and model them properly, but thereafter the actual data exchange becomes much simpler. Wooden Swing Metaphor This is where ISO 15926 starts to resemble the Babel fish we discussed at the very beginning of this book. When your Fijian co-worker tries to explain the wooden swing, he wouldn’t have to struggle with English—he could just speak his native language and you would understand it just as if he were speaking your particular dialect of English. In addition, if the two of you so desired you could delve into the species of tree used for the lumber, the chemical constituents of the nylon rope, and the grade of steel used in the fasteners. If We Use the Semantic Web, Couldn’t We Automate More of This? Once we have our data exchanges in a form where the meaning of the information is embedded into the structure of the data, Figure 3.27 shows how we can automate the process further. • ACME and EMCA collaborate and agree to use Part 4, Part 7 (which implies Part 2), and Part 8. • Each organization creates information models, or maps, to transform its internal data structures to something that follows the ISO 15926 protocols by using the base templates in Part 7. • They go a step further and use Part 8 to structure their data exchanges. • ACME selects certain data and exports it, using the database map, based on Part 7 templates, to translate it to the agreed neutral file format. As it assembles the neutral file, the template validates the data against the core classes in the Part 4 RDL using the RDS/WIP • ACME posts the information to its repository and tells EMCA how to to connect to it. • EMCA connects to ACME’s repository and selects the information it wants. • EMCA imports the information into its systems using its database map, based on Part 7 templates, to translate the information into something its systems understand. EMCA’s templates understand how to read the meaning from the structure of the data in the neutral file. Chapter 3 101 Fig. 3.27 Automating the process. Level of Collaboration Required About the only collaboration required here would be something to do with the manner of exchange. For instance, each party could post information on a particular location on the Internet so that other users could automate downloading the appropriate parts. Alternatively, they could send exchange files by e-mail. They would not have to discuss any details of the structure or meaning of the information itself because the meaning of the data would be embedded in the data itself. Wooden Swing Metaphor We need to venture into science fiction here to extend the wooden swing metaphor to include this case. It would be as if your Fijian co-worker automated the morning coffee machine talk by programming his cell phone with automatic answers to questions such as “How are you?,” “What did you do this weekend?,” and “What did you think of the football game on Saturday?” Chapter 3 102 In turn, you program your cell phone to ask these questions in case the two of you get busy and miss each other. Your cell phone asks the questions and picks up the answers as you walk by his cubical so that you can read them at your leisure. (And by the way, your friend writes his answers in Fijian but your cell phone translates them to perfect English while they are being downloaded!) Shouldn’t We Add Query, Workflow, and Security to the Semantic Web? Part 9 contains tools developed for the semantic web to implement query, workflow, and security. Figure 3.28 shows how it might work. • Both ACME and EMCA use the base templates in Part 7 (which implies Part 2) and Part 4. • They go a step further and use Part 8 to structure their data exchanges. • ACME builds a facade using Part 9 to add security and allow automation. • ACME selects certain data and exports it to its facade, as in the previous example. • EMCA connects to ACME’s facade and selects whatever data it wants. • EMCA imports the information into its systems, as in the previous example. Part 9 will also make it possible for EMCA to perform more interactive queries. For instance, if ACME maintained historical information (say, of the design of a particular piece of equipment at each point in time), EMCA would be able to structure a query that would pull back not only the current state of the design but the way each piece had changed over time. Chapter 3 103 Fig. 3.28 Automating the process with security Level of Collaboration Required The only question at this stage will be something like “What’s the URL of your ISO 15926 interface and what credentials do I need to get in?” As in the previous case, they would not have to discuss any details of the structure or meaning of the information itself because the meaning of the data would be embedded in the data itself. Wooden Swing Metaphor To the previous example we add the idea that your Fijian friend could program his cell phone to adjust the information his cell phone gives out based on the cell phone doing the asking. So, when the boss walks by the boss’s cell phone would get the message “I worked overtime on the Johnson file all day Saturday.” But when you walk by your cell phone would get the message about building the wooden swing. And as in the previous example, your friend writes in Fijian and your cell phone gives you the messages in English. Chapter 3 104 CHAPTER 4: CURRENT EVENTS IN THE WORLD OF ISO 15926 In this chapter we discuss current efforts to continue the development of ISO 15926, and to use in production the parts that are ready. To explain how ISO 15926 will work, at the beginning of this book we used the fictional babel fish—which, after placing it in your ear, somehow converts everything you hear into your own language. Another way of saying this is: “My computer can talk to your computer and we do not have to know anything about each other’s systems beforehand.” To any with a background in computer systems, this is a tall order indeed. If we express the issue this way, it sounds like we are talking about artificial intelligence. Well, artificial intelligence would be nice but we’ve been working on it for decades with little apparent progress. However, we do not need actual human intelligence in a machine. For instance, you can fly— but not like birds. In that respect, you are both more free and less free than birds. Birds cannot fly across the Atlantic Ocean in eight hours, but you can. On the other hand, you can’t jump off a tall building and fly across the road to visit your friends,1 but birds can. Similarly, we do not need actual human intelligence for information exchange. In any given exchange, the scope we are talking about is limited. The computer programs that need to exchange information already deal with the same real-world objects, just in a slightly different way. All we need is to find a means of representing the attributes of these objects in a way all computer programs can recognize. (When we express it this way, it almost sounds easy!) When we review the history behind ISO 15926, it becomes even more believable. Soon after CAD software came into common use, we wished we could transfer dimensional information between different CAD software and got exchange standards such as the Initial Graphics Exchange Specification (IGES). Then we wondered why we couldn’t also transfer information such as surface finish and materials between various product design software and got the Standard for the Exchange of Product Information (STEP). In hindsight, knowing the success STEP has had in the manufacturing industry this seems almost inevitable. But when we tried to use STEP to manage the life cycle of a process plant, we ran into the issue of having to maintain a very complex data model for the 30- or 40-year life of the facility. That did not work too well in practice, so we traded in the detailed data model for a simpler, generic data model (Part 2) combined with a reference data library (RDL, Part 4) and got ISO 15926. The theoretical development has continued with the addition of templates (Part 7), which makes it easier to use the data model—and the addition of semantic web tools (Parts 8 and 9) to more easily preserve the meaning of information during exchange. Development and use of ISO 15926 is increasing every year. Current efforts can be divided into three groups. • The use of ISO 15926 mandated by owner-operators for production systems. This is probably the most important development because it often takes influential owners to make a standard a requirement before the standard is accepted widely. We have two instances 1. Except, of course, if your name is Keanu Reeves. Chapter 4 105 in which this is happening: in production oil facilities on the Norwegian continental shelf (NCS) and in a pilot project to develop best practices for information handover preceding the construction of a world-scale bitumen upgrader and oil refinery. • The development of key infrastructure in the form of a practical reference data service (RDS), and software enabling the use of semantic web tools for information exchange. • The continued development of standards for geometry, instrumentation, and the harmonization between ISO 15926 and other industry standards. Much of this development is sponsored by Fiatech and the POSC Caesar Association (PCA). Recently, MIMOSA and the OpenO&M Initiative (which we introduce later in this chapter) have also sponsored projects. Using ISO 15926 in Production Exploration Production Information Management Association The Exploration Production Information Management Association (EPIM) is a nonprofit association of companies operating on the NCS in some part of the oil production supply chain. The objective of EPIM is to develop the best IT solutions for exchanging information and to promote these solutions to its users. EPIM is a distinct Norwegian operation, but because most global oil producers have some work going on in the NCS the EPIM has an international membership. EPIM jointly sponsors a number of projects with POSC Caesar, including IOHN, EqHub, and the EPIM Reporting Forum. Integrated Operations in the High North The term integrated operations in the oil and gas exploration and production industry refers to new work processes that combine information between the various disciplines within a large oil company. For operators in very remote sites, such as the high north, there are heavy demands on communication. Instrumentation and efficient transfer of real-time data between the fields and centralized operations is critical to profitable development. The Integrated Operations in the High North (IOHN) project is a collaboration of about two dozen organizations along the oil exploration and production supply chain. The project involves designing, implementing, and testing a digital platform—based on open standards—to capture and transfer real-time data from drilling and production platforms to management offices. There are three layers: a physical layer, a business layer, and a technology layer. ISO 15926 is part of the technology layer, enabling an easier exchange and integration of information across disciplines and business domains. Equipment Hub Equipment Hub (EqHub) is a repository for technical information for all of the oil and gas operators and suppliers working on the NCS. EPIM, which owns and manages EqHub, estimates that the cost of providing necessary documentation ranges from 5 to 20 percent of the total installed cost—depending on the type of equipment. The repository will hold and share certified information on standard equipment, which can represent 80 percent of document vol- Chapter 4 106 umes on an asset. POSC Caesar will make the existing ISO 15926 RDL available to EqHub, and extend it to cover the EqHub scope. EPIM Reporting Forum The Reporting Forum standardizes the daily and monthly reports drillers and producers have to provide to their regulatory authorities. To date, the daily drilling report, daily production report, and the monthly production report have been standardized. Conceptually, this sounds pretty simple—but these three reports represent the entire value of a drilling or production asset. Smoothly operating, transparent reporting mechanisms build mutual trust among business partners and regulators. These reports are structured according to ISO 15926, follow the PCA oil and gas ontology, and use the PCA RDS. MIMOSA and the OpenO&M Initiative More money is spent in the operations and maintenance (O&M) phase of a facility, between startup and demolition, than on the original design and construction. Capital assets—including facilities, plants, and complex platforms—use increasingly complex automation systems requiring many event-oriented data and information flows in order to support the intended O&M activities. Large numbers of O&M systems need to interoperate with one another, not only at startup but over the life of the facility—even as individual systems are upgraded or replaced. As the complexity of the environment has increased, traditional systems integration approaches have become increasingly difficult, expensive, and risky to implement and sustain. Standards organizations in the O&M community (ISA, MIMOSA, OAGi, OPC Foundation, and WBF-B2MML) have taken on the task of solving this problem in a collaborative manner via the OpenO&M Initiative. MIMOSA, a not-for-profit trade association dedicated to developing and encouraging the adoption of open information standards for O&M, has also helped lead efforts to engage with Fiatech and PCA in order to solve the problem in the most sustainable way. The solution is the system of systems (SoS) paradigm, in which individual systems are designed to interoperate as a single composite system on a sustainable basis over the life of the facility. MIMOSA and the OpenO&M Initiative are applying the SoS paradigm to complex plants, platforms, and facilities. Their methodology relies on the use of shared services and information models driven by OpenO&M use cases, which are defined and prioritized by owneroperators on behalf of industry groups such as oil and gas, chemical, and aerospace. Collectively, these shared services and information models form a foundational Information Service Bus Architecture upon which owner-operator–specific functions can be built. Using this paradigm, an owner-operator no longer must design, develop, and implement a large custom integration solution for the core of O&M interoperability. Instead, it can require that its suppliers leverage the industry developed solution. Systems suppliers must support the required services interfaces, whereas engineering procurement and construction (EPC) engineers must hand over site-specific engineering and construction information in approved machine-readable formats. They must all agree to share industry developed RDLs. O&M Background In 1989, the Purdue Reference Model for Computer Integrated Manufacturing established a Chapter 4 107 five-level hierarchal model of a manufacturing facility—named 0 through 4—starting from the bottom and moving to the top. The description and details of the content of the layers have evolved, but the basic principles have remained in active use in all major subsequent standardization activities addressing the automation of manufacturing as well as other processoriented industries. • For the sake of simplicity, layers 0, 1, and 2 can be somewhat aggregated to include the physical assets under management and the true real-time systems providing automation, control, and monitoring. • Layer 3 is often described as manufacturing operations management or even more generally (and to cover environments other than manufacturing) as operations management. It is event driven and thus operates with a time lag behind the real-time systems in the layers below, but it tends to be timelier than transactional environments in the business-management–oriented layer above. It is concerned with detailed operations planning and scheduling, order fulfillment, and maintenance planning and scheduling. This space is somewhat chaotic, with many players attempting to gain control over it. • Layer 4 is often described as business management and typically contains the enterprise resource planning (ERP), supply chain management (SCM), and customer relationship management (CRM) systems used for overall management. Integration within this layer is normally somewhat out of the box, as the systems tend to be supplied by one of the two or three suppliers of major systems. A key goal of MIMOSA and other OpenO&M Initiative participants is to standardize layer 3 as the key connectivity layer in the SoS paradigm. Thus, any compliant automation and/or operations management system will work with any compliant ERP system—and multiple compliant systems from multiple vendors can be used in the same system. Switching cost is also dramatically reduced, should that become necessary, although owner-operators will normally retain compliant systems that are performing their jobs in a proper manner. Because many large owner-operators are collaborating in this effort, their wishes carry some weight. Bringing Engineering and O&M Together Although design, engineering, and construction systems were largely considered out of scope in the Purdue model, there is a critical need to include them in a sustainable SoS solution. Typically, a large amount of manual takeoff is done using PDFs and spreadsheets as key parts of the system implementation and integration activity. This adds significant expense, time, and risk to the startup—and impacts full life-cycle sustainability because the lack of proper synchronization between O&M systems and engineering systems after handover results in ambiguity about plant, platform, and facility configuration. As the OpenO&M Initiative has worked to define the basis for an interoperable execution environment for an SoS, Fiatech and PCA have worked to define the basis for a shared reference information environment based on ISO 15926 (shown in Figure 4.1). There is a tremendous opportunity to help instantiate the sustainable interoperability environment for O&M systems through a well targeted digital handover to the owner-operator. Whereas a capital projects team is potentially concerned with an almost unlimited scope of detailed information that may need to be modeled and exchanged, the information needed to be handed over to provision the interoperable O&M environment is comparatively limited. By focusing on this defined set of information, there is a large near-term opportunity to save time and money while improving risk management over the entire life of the facility. Chapter 4 108 Fig. 4.1 O&M reference information environment. In 2008, Fiatech, PCA, MIMOSA, and OpenO&M initiated a collaboration to harmonize the MIMOSA and OpenO&M standards with ISO 15926. This will establish a shared reference information environment supporting the interoperable execution environment for O&M systems. EPC contractors will then be able to provide the required provisioning information using ISO 15926 protocols. The Australia-based Cooperative Research Centre for Infrastructure and Engineering Asset Management (CIEAM) is also a key part of the collaboration through their support of standards workshops for this portfolio of standards in conjunction with the World Congress on Engineering Asset Management. Joint Special Interest Groups MIMOSA and PCA have established two joint special interest groups (SIGs) in collaboration with Fiatech. • O&M SIG: This group is focused on leveraging the combination of ISO 15926, MIMOSA, and OpenO&M standards to enable sustainable interoperability of O&M systems. • IT Architecture SIG: This group will focus on the IT-architecture–specific elements of standardization needed to support the use of the combined standards. The joint SIGs are performing work that will be directly applied in a large bitumen upgrader and oil refinery being built in Canada, which has specified the use of an open execution environment for O&M systems interoperability based on a number of use cases developed by OpenO&M. This is a green field project scheduled to produce a commissioned plant by early 2015. The methodology for digital handover of required O&M information will be incrementally Chapter 4 109 specified and piloted prior to actual implementation in the project. The first phase of this effort is underway and will be piloted by the end of 2011. It is framed around OpenO&M use case 1 (handover) and will focus on a debutanizer tower. A public demonstration at an industry conference by the end of 2011 will leverage the same use case to target data set (debutanizer tower) and open solutions architecture in an environment designed to encourage maximum supplier participation. Subsequent phases will be incrementally spiraled to build a virtual plant and demonstrate the full set of OpenO&M use cases. The O&M implementation methodology for this project will be derived from the open industry activities and will be incrementally piloted in phase with the open industry activity. Development of Enabling Infrastructure Joint Operational Reference Data The Joint Operational Reference Data (JORD) project is the result of a 2009 recommendation to build an industrial-strength RDS. An RDS is the user interface for working with and managing an RDL for ISO 15926. Reference data is the lifeblood of ISO 15926. It accomplishes the dual task of eliminating ambiguity in information exchanges and reducing the size of data payloads. You may recall from Chapter 3 that the idea of an RDL separate from a data model goes back to the early 1990s, during the development of the STEP application protocols (APs). The first RDLs were simple lists of attributes whereby everyone agreed to use a particular name for a particular object. In use, these terms were “hard coded” into the database structure and database maps. As long as all business partners knew each other well enough to trust that each used the terminology correctly, this worked. But the vision of ISO 15926 is that organizations be able to exchange information with anyone. What the industry needed was some way to validate terminology when exchanging information with partners that were not known as well. The industry needed a service for publicly available reference data based on Part 4. In the mid 2000s, such a service was created in the form of what we now call the RDS/WIP (Reference Data Service/Work in Progress). Anyone can use it, and with just a little training anyone can add to it. As an experiment, the RDS/WIP has been a great success—with a number of important lessons learned, not the least of which is the enthusiasm with which new terms were added by people on their own learning curve. In the RDS/WIP today, there are many terms that have been developed properly—specialized from the correct parent in good object-oriented style. Unfortunately, there are a great many more that have not been. It still takes a fair amount of instruction for new users to know how to select an appropriate class. It is from experience with the RDS/WIP that demand grew for an industrial-strength RDL, which prompted Fiatech and PCA to sponsor JORD. Figure 4.2 shows how it will work. This diagram demonstrates three things. • The RDLs will be highly distributed. We have previously described the use of each term’s uniform resource identifier (URI), which is essentially the web page for each definition. As far as the technology is concerned, there is no reason every term in a large information exchange could not point to a different RDL. Chapter 4 110 • There will be many levels of certification. The diagram shows five levels of RDL, but in practice there will be a continuum from private sandboxes closed to outsiders all the way up to ISO. • As the audience for an RDL expands, access will change from read-write (whereby definitions can be changed) to immutable (where they will never change). There is risk in making the decision whether or not to make definitions immutable. As an industry group starts to develop an RDL, there will likely be a learning curve—and if all definitions had to be immutable from the start some mistakes would inevitably be “cast in stone.” But as a given term gains acceptance it should eventually become immutable so that organizations can count on it. The smaller communities are in a better position to manage this risk. Industry participants anticipate a certain amount of cascading of terminology upward as it becomes industry standard practice. Currently, JORD is on target in developing a funding model and proposals for working facilities. Fig. 4.2 Federation of cascading reference data libraries. Chapter 4 111 iRING iRING is more or less a brand name for an approach to information exchange that uses the full specification of ISO 15926. The name is an acronym of ISO 15926 Realtime Interoperability Network Grid. It was developed with four purposes in mind. • To prove that information exchange using the full specification of ISO 15926 is possible. • To develop software interface tools using the full specification of ISO 15926 and make the toolkits available to anyone under an open-source license. • To develop best practices and make them available to those who use the tools. • To encourage software vendors to collaborate and support iRING interfaces within their product offerings. Although there still is work to do before iRING technology is ready to be used on a real project, there have been a number of very successful demonstrations. Figure 4.3 is a simplified diagram showing how it will work. As with other information exchanges, ACME uses a database map to transform its internal information into the form of Part 7 templates. The templates capture the meaning of the data along with the data values. Part 8 (OWL) is used to ensure that the ontology of ISO 15926 is preserved and then posts it to a Part 9 (Facade). PART 7 Templates PART 8 OWL ACME MAP 7 8 lid at io n Va PART 7 Templates EMCA 9 Exchange 9 Va PART 8 OWL Part 9 Facade Part 9 Facade at lid 8 7 MAP n io Fig. 4.3 iRING information flow. During this process, the terminology is validated with the appropriate RDLs—which are based on Part 4. When EMCA receives the data, it processes it in reverse—validating the terminology with the RDLs used by ACME. The software tools to configure and execute these functions are collectively called iRINGTools, which are developed and owned by the iRINGUserGroup. The tools are available to anyone for any purpose under what is known as the BSD three-clause open-source license. The group also offers a ready-made RDS called iRINGSandbox that will work with any system that uses iRING protocols. The iRINGUserGroup continues to develop iRING technology on a very aggressive schedule. In addition, Fiatech has sponsored two projects to continue the development of iRINGTools. • ISO 15926 project information flow will define typical data flows in different types of capital projects, test and validate currently available ISO 15926 tools, and assist companies in Chapter 4 112 adopting ISO 15926 by highlighting implementation methodologies and achievable savings. • The iRINGTools Interfacing Project (IIP) will deliver a set of iRING tools data layer modules that will provide native ISO 15926 and iRING interfaces to commercial engineering software. This means that iRING capability can be added to commercial software without having to modify the software. Practical Guide for ISO 15926 Modeling To exchange information using ISO 15926, it has to be modeled according to the data model in Part 2. To make it easier for industry professionals to use Part 2, the concept of templates— which are like building blocks for information—was developed as Part 7. Although physically similar to mapping two databases together with a spreadsheet (basically, you just enter the name of the template instead of an attribute name) modeling with Part 7 is sufficiently different that it is becoming a barrier to the wide acceptance of ISO 15926. (We provide an introductory example in Appendix C.) A working group is being formed to remedy this with a guide for modeling using Part 7. The deliverable will be a manual describing how to use templates, in language that industry professionals can understand. A good part of this will overlap with JORD, in that using an RDL is part of using templates. Continued Development of Standards Geometry Special Interest Group The Fiatech Geometry SIG is developing Part 3: Reference Data for Geometry and Topology. Part 3 will describe a neutral way of representing graphics. It will be a dictionary that all 3D design systems can map to in order to exchange graphical information. Part 3 is to geometry as Part 4 is to data. Today, many capital projects use some type of engineering design system in order to make use of the inherent 3D visualization of those systems, their ability to check for interferences, and their ability to produce automated drawings with fabrication details and bills of material. Increasingly nowadays, the information about project objects is being linked directly to the image of that object so that the 3D model essentially becomes the index to all project information. It is important, then, that the “graphics” of the project be exchanged with ISO 15926 as easily as the “data”, and that the graphics be linked to the data. Currently, there are a number of ways to exchange graphic images between commercial CAD software—but they typically only exchange the graphics. Any data attached to the graphics, or even indexes to the data, are lost. If an owner wants to take delivery of an intelligent 3D model, about the only way nowadays is to require all of the EPC contractors to use one particular system. The vision of ISO 15926 is to be able to exchange graphical information between 3D engineering systems as easily as any other project data. Every 3D engineering system uses a different means of representing physical objects. For instance, there are many ways of storing the dimensions of a piping tee. One system may record the location of one of the connection points and the face-to-face dimensions between it and Chapter 4 113 the others, whereas another may record the location of the center and the distance from there to each of the branch connection points. Part 3 describes a single method of describing the geometry of a part to which all systems can unambiguously map. When Part 3 is complete, we will be able to use Part 7 templates to include geometry information for project objects. This means that all of the information about an object is in one place. Part 3 is based on ISO 10303-42 (Integrated Generic Resource: Geometric and Topological Representation) and ISO 10303-104 (Integrated Application Resource: Finite Element Analysis). Instrument Special Interest Group Fiatech’s Instrument SIG is working to standardize the manner in which instruments are described. One of the issues in exchanging information by converting everything into a neutral form is making sure that all participants make the same choice of class to map a given attribute to. In many of the prior demonstrations of ISO 15926, the goal was to develop the technology and thus the participants were “coached” on how to choose the correct classes. But the vision of ISO 15926 is that project participants need not collaborate at that level prior to exchanging information. To get an idea of the magnitude of potential differences in classification, the Instrument SIG divided itself into two teams—with each team classifying a number of instruments on its own. Comparing the differences in the results showed a close match, but not 100 percent. Part of the reason different organizations may describe project objects differently is that each organization involved in a project needs different types of information. For instance, instrument manufacturers need complete product data for each subcomponent of something the rest of us call an “instrument.” They need to track the part number of each piece and all of the information required to manufacture it. In contrast, EPC contractors need only a very small fraction of that entire lot of information—typically only overall dimensions and properties, and functional specifications. Owner-operators need the functional specifications too, but are most interested in maintenance and repair information. One proposal from the instrument SIG that would make this situation easier to manage is that there would be different Part 7 templates for different participants. Manufacturers will use very detailed templates, whereas EPC contractors and owner-operators will use simpler ones. Quite a bit has been done, but there is still much to do. This SIG will work closely with the joint MIMOSA and OpenO&M initiatives, described previously. Engineered Equipment Life Cycle Application Tools The purpose of the Engineered Equipment Life Cycle Application Tools (EELCAT) project is to develop specifications for the exchange of information about engineered equipment through its full life cycle, from initial design to disposal. One of the deliverables of this project was to explore the feasibility of harmonizing three standards: ISO 15926, the AEX XML schema, and the Hydraulic Institute’s EDE 50.7 standard. The study concludes that such harmonization is indeed possible, and demonstrates how it can be done. Chapter 4 114 Automating Equipment Information Exchange The Automating Equipment Information Exchange (AEX) project was sponsored by Fiatech and has delivered and demonstrated an XML schema to automate information exchanges related to the design, fabrication, and use of engineered equipment. The driver of AEX reads like the original driver for ISO 15926: Equipment is designed by different groups, each with specialized software that does not talk to each other; labor-intensive rekeying of data into each system; additional cost; and risk of human error. The AEX schema can be used as an intermediate neutral form for information exchange all along the equipment supply chain. HI-EDE 50.7 As an aid to organizations wishing to exchange information on pumps, the Hydraulic Institute has published what it calls Standard HI 50.7-2010 for Electronic Data Exchange for Pumping Equipment, or HI-EDE 50.7. This is a dictionary of data fields and cross-listed definitions between well-known standards such as API 610 and ANSE/ASME B73. In addition to the national standards, the AEX schema has been harmonized with HI-EDE 50.7. Although the standard does not mandate the use of the AEX schema, it makes reference to the appropriate term with each data definition. Chapter 4 115 CHAPTER 5: GETTING STARTED WITH ISO 15926 A detailed set of steps for implementing ISO 15926 is beyond the scope of this introduction. In this chapter, we provide you with a roadmap—but it will not be like a route map from your travel agent showing the shortest route from your house to the beach. Instead, it will be more like a map of the entire countryside—with a number of interesting intermediate points at which you can stop. For instance, if you lived in London, England, and wanted to go the beach at Cannes, at the South of France, an easy way would be to take the Eurostar to the Gare de Nord train station in Paris, transfer to the Gare de Lyon, and then take the train à grande vitesse (TGV) to Cannes. You would have the pleasure of watching the countryside go by at 300 km/hr in airconditioned comfort. On the other hand, if you were to channel Rowan Atkinson and take a side road (Figure 5.1) you would have a much more interesting journey. Fig. 5.1 A more interesting route to the beach. Similarly, implementing ISO 15926 will probably fork into two main paths. The first path will be for organizations that largely use unmodified commercial off-the-shelf (COS) software. The second path will be for organizations that maintain proprietary software that supports their own work processes. Most organizations will follow the first path. Right now there are work teams developing Chapter 5 116 proof-of-concept software that will more or less snap onto the side of a commercial application.1 Such software will use the application’s programming interface to extract standardized data sets (e.g., line list or equipment list) from its database, transform the data into the neutral format of ISO 15926, and make it available to selected business partners. Organizations with custom work processes or custom software will also be able to follow the second path. A good metaphor is to compare building a web page 15 years ago to what it takes today. Back then you would have had to learn how to write HTML code by hand. Today, there are many Internet service providers who for just a few dollars a month will let you use tools with which you can create a web page by dragging and dropping gadgets onto templates. On the other hand, if you need something more sophisticated there is no end to what you can do. Key Preparation When ISO 15926 is mature and examples of its use are common knowledge, implementing the standard will be just like executing any other project: figure out where you are, figure out where you want to be, make a plan to bridge the gap, and then do it. The problem now, at the beginning of the second decade of the twenty-first century, is knowing what is possible. The idea that “your computer can talk to my computer without either of us having to know anything about each other’s system” sounds magical. Where do you start with magic? Join Fiatech or the POSC Caesar Association If you want to get to know some people who can help you out, one very good way is to join Fiatech or the POSC Caesar Association (PCA), participate in a special interest group or two, and get involved in a project. You will meet the people who are developing ISO 15926 and learn firsthand from others who have implemented parts of the standard. This will give you an excellent understanding of what is possible, the resources required, and a realistic time frame for your own project. Have Realistic Expectations ISO 15926 is not a silver bullet that will instantly solve all of your data problems. In the introduction we said that the reason we need ISO 15926 is “so that we can exchange and reuse complex plant and project information easier and cheaper.” But if your information has been generated with a fuzzy work process so that when you exchange information with a business partner neither of you really knows what you have, moving it faster is not what you need. Understand Your Data The most important thing your organization can do to implement ISO 15926 is to thoroughly understand your data. There are three aspects to this. • Understand what your data means. Every organization makes assumptions; things it believes “everyone just knows.” Find out what these are before attempting to exchange infor1. In Chapter 4, we introduced the iRINGTools Interfacing Project (IIP). Chapter 5 117 mation with someone and thus find out too late that “everyone just doesn’t know.” • Understand your data in terms of reference data. When we map databases together in the traditional way, we expect to take the time to understand what every data value means. Once we have done that, in the database mappings we can assign any random text string as a label. But as we move forward to a time when everyone expects to be able to exchange information with anyone we will rely heavily on our partners having classified their data properly, and the only way to ensure this is for everyone to relate their data to publicly accessible reference data libraries (RDLs) and validate the terms during the information exchange. • Understand that representing your data in a way in which the context, or meaning, is part of the payload is fundamentally different. This is embodied in learning to use Part 7 templates. At the higher levels of compliance, ISO 15926 uses a fundamentally different approach to exchanging information. In the past, when we have linked two applications we have always tried to know as much as we can about both of them. But with ISO 15926 linking applications by exploiting special knowledge is actually a liability. For most of us, this is something we have to unlearn. We have also viewed machine-to-machine communication as a technology problem, but we ran into the wall of not knowing how to handle the information. ISO 15926 focuses on modeling the information to preserve its meaning as it is being exchanged. Implementing ISO 15926 In Chapter 3, we introduced the idea of different levels of compliance. We will demonstrate this in our example, starting with a dictionary-level information exchange, moving to using Part 7 templates, and finally to using the semantic web tools of Parts 8 and 9. Let us assume that you want to automate the creation of a purchase order by selecting items on a process and instrumentation diagram (P&ID). In many engineering systems today, engineers enter a great deal of information that is stored in a database during design. To create a purchase order, they extract a report of the information, attach the appropriate data sheets, perhaps fill out a requisition form, and then send the package to a procurement officer. When the procurement department receives the requisition, much of it has to be rekeyed— which takes time and introduces opportunities for error. What we wish to do is eliminate all rekeying by exporting information directly from the P&ID and create the appropriate records in the purchasing database. The procurement officer will still assemble the purchase order in the usual manner, only now without having to read an engineer’s handwriting. Dictionary-level Information Exchange The good news is that implementing ISO 15926 at the lowest level, dictionary compliance, is not that difficult. In fact, if your organization has attempted to move information between two computer applications you probably already know the basics and probably already have most of the necessary skill sets represented in your present staff. In the initial stages, the implementation team will need to fulfill three roles. The first will be someone who understands information flow; someone who can draw boxes and arrows on a white board. The other two roles will be the subject matter experts (SMEs) for each of the Chapter 5 118 two applications. (Note here that we did not specifically say three people, we said three roles. Depending on the size of the project, all three roles might be fulfilled by one person.) There is nothing magical about automating an information exchange using ISO 15926 compared to the traditional means we have at our disposal. The team will start by thoroughly describing the information that procurement needs from engineering in order to complete the purchase, and where engineering stores that information. They will analyze the information flow, and at the beginning will probably name the information objects using the natural language names they are familiar with from the dialog boxes and user interfaces of the software. They should give some thought as to how an output from a P&ID can be generated by simply selecting objects on-screen, and later pushing transformed data into the purchasing database. Commercial software often comes with an application programming interface (API) that will let users automate certain tasks. If an API is available for the two applications, the process will be much simpler. Otherwise, a programmer may be required in the execution phase. In the next stage, the information flows will be given more definition and the team will need to bring in someone who can dig into the databases. Where the SMEs gave information objects their natural names, now the database administrators will need to see what each database calls the same objects. The team needs to thoroughly understand the meaning of the database objects, and in particular understand implied attributes (the type that “everyone just knows.”) Another opportunity to understand what the data means is to examine the work processes on each side of the exchange. Sometimes the way information is actually used will change its meaning. About this time, the team should bring in someone with a background in information modeling—or train someone to do it. Most computer science programs include courses in entity relationship (ER) modeling, Unified Modeling Language (UML), or something similar. A production database for engineering software can be quite complex, and this type of training will be necessary in understanding it. A likely discovery is that the data structure in each of the applications is fundamentally different. We introduced this concept at the beginning of Chapter 1, and later on in Chapter 3. Each application will have been developed by a different team, and each team will have made pragmatic decisions on how its data is to be structured—depending largely on what each application does. We used the example of an engineering application breaking tag numbers (for example, 34PI-325) into their constituent parts and storing each part individually, whereas the procurement application might store tag numbers as simple text strings. Some type of transformation may be required, and if the APIs of the applications will not manage this some custom programming may be required. Note that until now we have not discussed ISO 15926. Until now, this is just like any other information exchange project. Perhaps the first opportunity to use ISO 15926 will be when deciding how to hand off information from one application to the next. There will be a temptation to look for fortunate idiosyncrasies of either or both of the applications that can be exploited. For instance, there may be some way to make one application write directly to the database of the other. This is very much against the principles of interoperability. We have often done this type of thing in the past and patted ourselves on the back for being ingenious. Perhaps it is ingenious, but it forever traps us into particular versions of particular software. Chapter 5 119 A better way is to make the exchange mechanism as generic as possible using some type of neutral file exported by the first application, perhaps transformed somehow, and then imported to the second application. Where this will become critically important is when another application is brought into the federation. If the first information exchange uses an intermediate neutral file, the next application may be able to use some of this directly instead of developing another point-to-point solution. Using the Part 4 Class Library Another opportunity to use ISO 15926 is in naming data objects in the neutral file. In the past, we have typically left this up to project participants. But to maximize the opportunity for reuse, industry standard names, which have particular meanings, should be used. This way, over time—even if people forget or if personnel changes—anyone can refer to the standard. A good choice would be to use the classes in Part 4 rather than creating what amounts to your own dictionary from scratch. You can purchase a copy of Part 4 from ISO or use the RDS/WIP (Reference Data Service/Work in Progress), a publicly available RDS. You may recall that we have said earlier that the RDS/WIP contains many terms that have not been created properly. Although using the correct term may not be critical for a one-off information exchange in which the actual meanings of the terms are well known, it will become a problem in the future if an opportunity arises to add another application to your federation. This is a good opportunity to learn to read the ontologies of the original Part 4 definitions. • At the end of Appendix C, we have included a brief introduction to using the RDS/WIP. • In Appendix D, we have included links to the iRINGUserGroup and one of their web pages on choosing the correct class. Live References to the RDS/WIP Once you have mastered the concept of using Part 4 terminology, the next logical step is to implement live references to the RDS/WIP during information exchange. At the simple level of point-to-point exchange within one organization, especially after the data has been examined for meaning, adding live references to the RDS/WIP will not immediately add a large value. Where this will start to pay off is when other applications are added to the federation, and in particular when external partners are added to the federation. Although this may sound like a large paradigm shift, it is really only an incremental change. During the mapping exercise, the biggest change is to use each term’s uniform resource identifier (URI)—which is essentially a web page for the definition—instead of using the natural language name. Then, during the exchange some new software will be required to manage connections to the RDS/WIP. A few years ago, this would have had to be written by each participant. Now, software that does this is available under an open-source license—and some software vendors have committed to supporting it. There is much more to this than has been described. For instance, at some point someone will have to look at things such as database names, database access, and network permissions. But this will have to be done, and be done in the same way, whether or not Part 4 is used. Chapter 5 120 Tools When the project is ready for execution, there are a number of commercial and open-source tools that have been developed in response to the demand. At the dictionary level of compliance, the most useful will be a mapping editor. Conceptually, a mapping editor is similar to using a spreadsheet in that the left-hand side is usually “from” and the right-hand side is “to”—but a mapping editor makes the entire process easier to manage. • The iRINGUserGroup has published a mapping editor and tools to support the use of the RDS/WIP. Template-based Information Exchange The next level of compliance to ISO 15926 is to build some meaning into the data by using Part 7 templates. The mechanics of using templates is very similar to straight dictionary mapping: you simply put the name of the template into the “to” column of your mapping editor instead of the name of the class. The reason you would want to use templates is that it will make future information exchanges easier. With dictionary-level mapping, there is still the possibility that you and your business partner do not define terminology the same way. (We have previously used the example of two people from different cultures learning the same foreign language. With different backgrounds, each will define concepts differently. When first starting to communicate, they will need a period in which they review terminology together.) When both parties use Part 7 templates, there is a much greater chance they will have a common understanding of each term. Another reason to use templates is that you will be able to include relationships between data objects. A simple example of this is being able to relate a pipeline number to a nozzle on a tank, then to the tank, to a logical location in the plant, and to a physical location in the plant. The real knowledge is in knowing which templates to use. For this, you will need your information modeler to come up to speed with Part 7 templates. Information Modeler • In Chapter 4, we mentioned the Practical Guide to ISO 15926 Modeling. This is a very good place to start. Tools • The iRINGUserGroup publishes a set of tools for using Part 7. There are some commercial offerings as well. Semantic Web The parts of ISO 15926 that use the semantic web are Part 8 (OWL) and Part 9 (Facades). As we described in Chapter 3, you would implement these if you wanted to automate some of your work flow (Part 8) and if you wanted to include some security (Part 9). The good news here is that if you have properly used Part 7 templates to represent your data you can simply add Parts 8 and 9 without redoing any of your database mappings. Some new software will be required, and a few years ago each participant would have had to write it. Chapter 5 121 Now, however, software is available under an open-source license—and some software vendors have committed to supporting it. Summary Taken step by step, implementing ISO 15926 does not have to be a huge and expensive exercise. Of course, if an organization with a great deal of proprietary software and work processes were to decide to adopt ISO 15926 on a large scale it would require a large effort—just as converting one’s entire parts library from a paper catalog to an online catalog would require a large effort. But a pilot project to validate the process and gain experience is within the ability of most organizations. Just a few years ago, each participant would have had to write the specialized software required to support ISO 15926. Now, open-source software has been published that does this—and some commercial software vendors have promised to support it. The part each organization will have to do on its own is to understand its data. The personnel to support an introductory project can likely be drawn from the existing ranks of most medium to large organizations. Dictionary-level Mapping • We have described the role of the project leader as “someone who can draw boxes and arrows on a white board.” This is, of course, a gross simplification. What we mean is someone with a logical mind who understands abstract reasoning. Obviously, the ideal candidate would be someone with both a strong computer science background and a strong discipline background. But if only one or the other is available it will be easier to teach the computer science skills that are actually necessary (for a project leader role) to a discipline specialist than to attempt to teach the discipline-specific skills to a computer science graduate. • SMEs for each application that will be brought into the federation. These people will understand how to get information into and out of the applications. • A database administrator will be able to get into the databases for each application. • An information modeler will be able to structure information to retain the meaning of the data involved. (There is a large overlap here with the role of database administrator.) • A network administrator will understand how the organization’s network works and will be able to assign rights and privileges. • The role of application programmer will be required if the API of each application in your federation will not support the full transformations that may be required. If internal resources are not available, this may have to be outsourced. Template-based Information Exchange • Part 7 information modeler: Using Part 7 is different from straight database mapping, but this can be taught to an existing team member. Semantic Web • Parts 8 and 9 implementer: Software tools to support Part 8 (OWL) and Part 9 (Facades) Chapter 5 122 have already been written and are available under an open-source license. As well, a market for such tools has evolved and these types of tools will be increasingly supported by commercial software vendors. Implementing these tools is well within the capability of the IT departments of most medium to large organizations. Join Fiatech or the POSC Caesar Association If there is any mystery remaining, it can be dispelled by meeting like-minded people, and the best way to do that is to join Fiatech or PCA and get involved. Outside of working on ISO 15926, many of us are competitors. The natural tendency, then, is to horde information. But if we can cooperate to develop ISO 15926 digital information exchange gets easier, information exchange on projects becomes easier, and as projects become easier and cheaper the owners (who in the end, pay for everything) will be able to do more. The pie gets bigger. There are a number of reasons organizations and groups of individuals join a consortium. Owner-operators Owner-operators want to improve their bottom line. Collectively, owners fund the entire operations of the other players. These include EPC contractors, equipment manufacturers, software developers, construction contractors, and universities—funded directly or via costs embedded in other fees. However, they lack the expertise to do the basic research and apply it to day-to-day operations. In a consortium, such as Fiatech or PCA, owner-operators have a forum whereby they can explain their needs and work with others on solutions. EPC Contractors EPC contractors get involved with consortiums to solve problems their competitors and clients also have. Everyone can work together, share the costs, and share the results. EPC contractors can take research from universities, prove it in the field, and then recast it in terms that others in their industry can understand. Software Developers Software developers join a consortium to assist in developing standards that make software development easier. For instance, on the subject of information exchange software developers must be responsive to their customers and provide conversion between many competing standards. But if all industry players agree on a single standard the work of software developers will suddenly get much easier. Equipment Manufacturers and Construction Contractors Equipment manufacturers and construction contractors, with tighter margins, join a consortium to participate in projects they would never be able to fund on their own. Universities Universities participate in a consortium so that they can see their research applied in realworld projects. Chapter 5 123 APPENDIX A: THE PARTS OF ISO 15926 This appendix summarizes the current state of the various parts of ISO 15926. We have treated Parts 2, 4, 7, 8, and 9 in some detail in Chapter 3. In Chapter 4, we introduced the geometry SIG that is developing Part 3. Here we introduce the other parts. ISO standards are developed with the voluntary involvement of all interests worldwide. There are three main phases. 1. The need is expressed by an industry sector and proposed as a new work item, and a technical scope is defined. 2. Consensus building where the detailed scope is negotiated between the countries involved. 3. Formal balloting and acceptance is carried out worldwide. Each standard is assigned to an appropriate technical committee/subcommittee (TC/SC). ISO 15926 falls under TC 184 (Automation Systems and Integration) SC 4 (Industrial Data). During development of a standard, it may be published in a number of states. • PWI: Preliminary work item • NP: New work item proposal • CD: Committee draft, by a committee of experts • TS: Technical specification, after a consensus within the appropriate TC/SC • DIS: Draft international standard • FDIS: Final draft international standard, ready for formal vote • PRF: Proof of a new international standard • IS: International standard ISO 15926 Formal name: Industrial automation systems and integration. Integration of life-cycle data for process plants, including oil and gas production facilities. Part 1 Formal name: Overview and Fundamental Principles Publication date: 2004 Status: IS Part 1 is the introduction to ISO 15926 and describes its purpose, which is to facilitate data integration by means of a data model that defines the meaning of information in a way that all users of the information will have the same understanding of what it means. Appendix A 124 Part 2 Formal name: Data Model Publication date: 2003 Status: IS Part 2 is the conceptual data model for representing technical information about project objects. The original mandate was life-cycle information about process plants, but this has expanded to include pretty well any type of capital project because there is so much overlap. Part 3 Formal name: Reference Data for Geometry and Topology Publication date: 2009 Status: TS Part 3 is reference data for geometry and topology for 2D and 3D shapes. It consists of an ontology of basic classes of curves and surfaces that can be used with computer-aided design (CAD), Geographic Information Systems (GIS), and earth models. This means that you will be able to use Part 7 templates to include geometry information about project objects. Part 3 is based on ISO 10303 Part 42 (Geometric and topological representation) and Part 104 (Finite element analysis). Part 4 Formal name: Initial Reference Data Publication date: 2007, with an amendment in 2010 Status: TS Part 4 defines the initial set of reference data for use with ISO 15926 and ISO 10303-221. ISO issues the reference data in the form of spreadsheets. Currently, there are almost 20,000 individual terms. Part 5 Formal name: Procedures for Registration and Maintenance of Reference Data Status: Discontinued Part 5 was the procedures for registration and maintenance of reference data. This function has been taken over by an SC4 commission for class library maintenance not only of ISO 15926 but of other ISO reference data libraries contained in databases. Appendix A 125 Part 6 Formal name: Scope and Representation for Additional Reference Data Status: NP/TS Part 6 is the methodology for the development and validation of reference data. It describes how to validate a reference data item to ensure that it is genuine. It describes the information required for a new reference data item and how to have it approved. It lists the metadata used for the provenance of the objects in an RDL. Part 7 Formal name: Implementation Methods for the Integration of Distributed Systems: Template Methodology Status: PRF/TS Part 8 Formal name: Implementation Methods for the Integration of Distributed Systems: Web Ontology Language (OWL) Implementation Status: PRF/TS Part 9 Formal name: Implementation Methods for the Integration of Distributed Systems: Facade Implementation Status: Draft Part 10 Formal name: Implementation Methods for the Integration of Distributed Systems: Abstract Test Methods Status: Draft An abstract test method is a generic description of how to test software systems to validate ISO 15926 compliance. “Abstract” means there is no coupling with a computer language or brand of test software; this must be filled in by the implementer. Part 10 only handles full compliance; it does not list the criteria for partial compliance—which is up to the business. When something like ISO 15926 is in development, the people working on it form a community and get to know each other. Within the community, there is a natural check and balance on whether one’s work conforms. But when ISO 15926 is mature users will need a way to judge whether or not their business partners’ systems are in compliance. Part 10 will fulfill the need Appendix A 126 for a way to test systems to see how well they comply. Part 11 Formal name: Simplified Industrial Usage of Reference Data, Including Gellish Implementation – Working title: Gellish – RDF Status: NP/TS Gellish is a structured subset of natural language that can be used to express knowledge in a way that is readable by both humans and machines, and is system independent. Knowledge can be represented in a way that allows machines to artificially reason and come to conclusions. Gellish is the outgrowth of the development work on ISO 10303-221 (AP221) and ISO 159262 (Part 2), both of which we introduced in Chapter 3. You may also recall that AP-221 had an Annex M, which consisted of standard instances of the types of objects in the Part 2 data model. Over the years, Annex M was extended and became known as STEPlib and eventually used as the basis of Part 4. Since then, STEPlib has become the Gellish Dictionary—containing concepts, and relationships between those concepts, arranged in a taxonomy that supports inheritance. Gellish can be used as a front end for the template methodology of Parts 7 and 8, or it can be, and has been, used on its own for information exchange on major projects. Appendix A 127 APPENDIX B: COMPLIANCE WITH ISO 15926 ISO 15926 is a standard with many parts, not all of which have to be implemented at once. As well, there are variations in the manner in which some of the parts can be implemented. Therefore, simply saying “My system has implemented ISO 15926” does not give a true picture of your system’s capabilities. The vision of ISO 15926 is that business partners can exchange information without engaging in a lengthy process of understanding the meaning of each other’s data. But implementing only one part of ISO 15926 at its simplest level does not demonstrate this capability. New users are, understandably, confused. This appendix outlines the various categories of compliance to give users a rough guide to comparing systems. The following table serves as a “thumbnail.”1 Category Compliance Checklist Dictionary identification only Semantic precision Shortcut relations Full ontology Any formatted file Representation technology Proprietary XML RDL XML Part 8 (OWL) Referencing technology URIs contained in schema External RDL File exchange Interface technology Proprietary query Part 9 (SPARQL) Community sandbox Industrial standardization Industry-wide sandbox JORD ISO Subject matter scope BIDG Explicitly identifiable elements Metadata scope Versioning Status 1. Thanks to Ian Glendinning for the material this section was adapted from. Appendix B 128 Export interface Import interface Functional capability Empty seed Lossless consolidation Reconciliation Semantic Precision ISO 15926 is all about driving out ambiguity to reduce the risk and cost of information exchange. Semantic Precision is the most technically significant of the categories outlined in the previous table. This category compares the way information is understood by an organization to what was intended by ISO 15926. This category recognizes that it is possible to use ISO 15926 terminology incorrectly. An extreme example is using the word pressure in a database mapping exercise to exchange temperature values. As long as both parties to the exchange knew what the text string pressure really meant, the exchange would work—but using terminology this way is misleading to new users and would add a great deal of uncertainty and risk to future exchanges using the same database maps. Part 7 templates have what is known as a signature, which removes the ambiguity by providing a mechanism by which a system can recognize what the template is intended to deal with. Thus, in the extreme example previously cited the user would never be presented with a “pressure” template when working with temperature. At the receiving end, the software would recognize the template as dealing with temperature and would act accordingly. There are different ways of recognizing template signatures. This category gives increasing credit as their reliability increases. • Dictionary and typing level: Templates have a name, are specialized from a parent class, and have a classification. This level of compliance simply accepts these values from the template itself. • Shortcut relations level: This level uses the shortcut relations to determine what the template does. • Full onotology level: This level uses the full ontology along with the signature to identify the template. Representation Technology This category compares the manner in which the information exchange is structured. Appendix B 129 Any File This exchange uses a neutral file, as opposed to, for example, having one application writing to the database of another. This removes one potential failure point; namely, that one or the other application changes and thus breaks the information exchange. There are many ways of encoding information. Fixed-width text file, comma-delimited text file, and spreadsheet are three. The common denominator is a proprietary structure that users have to examine. Any structure can be made to work as long as both parties understand it in advance. Proprietary XML Using XML to encode information is a step up from a plain-text file or spreadsheet. New users will still have to examine the terminology used, but the structure of an XML information exchange is easier to deduce. Reference Data Library Registered XML Schema A registered XML schema eliminates the need to examine it closely for every information exchange. Users will still have to examine it and thoroughly understand it the first time, but thereafter should not need to. This meets one of the intents of ISO 15926 is that two organizations can independently implement a given standard and thereafter exchange information even though they may never have explicitly intended to work with each other at the beginning. Part 8: OWL When the information you are exchanging is part of an overall ontology, it is easier to figure out what it means simply by examining its position in the ontology. If the ontology conforms to Part 2, exchange partners will be able to understand what your data means without having to ask about it. OWL is a tool designed for the Semantic Web that is used to exchange information stored in an ontology. If your exchange system uses OWL, ontological information will be preserved so that the meaning of the data will be preserved with the data. Referencing Technology Reference data is the lifeblood of ISO 15926. Without reference data there is no objective way to verify the definitions of terms, and where you cannot verify terms there is ambiguity that adds to the cost. In ISO 15926, the reference data is based on Part 4. This category considers the way reference data is implemented. URIs Self-contained in Schema The URI of a term is like a web site for its official definition. The name of each term is unique and has exactly one URI, and therefore the name of the term is synonymous with its URI. One way to implement Part 4 terminology is to hard code the name or URI into the schema. Appendix B 130 URIs Resolvable in Shared Reference Data Libraries Another way to use Part 4 terminology is to have a mechanism that uses a term’s URI to connect to a reference data service, like the RDS/WIP, to verify that it is a valid term. Interface Technology The interface type describes how the information is physically moved. Under the hood, all information exchanges use a file to exchange the data. The key question in this category is how the file comes into existence. File Exchange Here we mean that the sender prepares the information exchange file by some means and sends it to the exchange partner, who then opens and reads the file in some manner. Proprietary Query This category implies that the recipient of the information can directly query the information to create the exchange file instead of having the owner prepare the files. Part 9: SPARQL SPARQL is a protocol developed for the Semantic Web, designed for querying OWL databases. The use of OWL is one way to preserve the meaning of the data. Industrial Standardization This category assumes that a reference data library (RDL) is used for the information exchange and measures the manner in which it is certified. The boundaries, although somewhat arbitrary, show increasing global authority. Community Sandbox Here, a community will create its own RDL. The group manages the content on its own. Industry Level Here, the community is larger—encompassing an entire industry. At this level there will likely be some formal method of creating and vetting new terms. PCA/JORD Level This level combines the terminology of many industries. Although JORD does not exist yet, it is expected that there will be a stringent vetting procedure to ensure that the terminology Appendix B 131 conforms to the intent of ISO 15926. ISO This level is reserved for formally adopted, world-wide standards. Subject Matter Scope This category anticipates one or more industry-specific information guides to assist users in deciding the scope of the information that needs to be exchanged. This will not create any new terminology to add to an RDL, but will rather suggest which RDL items need to be addressed to accomplish some business purpose. For instance, the American Petroleum Institute (API) may recommend specific RDL terms to use when exchanging information about API pumps. When a standard scope of information is exchanged, the uncertainty (the ambiguity) is reduced. The Business Interfaces Definition Guide (BIDG) was originally the title of a particular handover guide being developed for the process industries, but the document was issued under a different name. Now the term is still used but has effectively come to be an umbrella for a number of industry-specific guides. For measuring compliance to ISO 15926, this category means that you are following one such guide—and that the guide specifies ISO 15926 terminology. Metadata Scope When data is exchanged as part of business activities, there are usually obligations and contractual responsibilities between the parties. With each information exchange, metadata (data about data) is in fact created (such as the time of exchange)—which may have to be recorded manually if there is no automatic mechanism. This category measures how the metadata is captured and recorded. Explicitly Identifiable Elements This category means that metadata is indeed exchanged (as opposed to only the data values themselves) and that the metadata explicitly identifies the data values to which it belongs. Versioning This category means that in addition to the identity of the data the vintage, or version, of the data is maintained. Status This category means that in addition to the identity and versions of the data the status of the data is maintained. Appendix B 132 Functional Capability This category measures the capability of the system to send and receive data. Export Interface This is the ability of a system to deliver information independently of data scope and interface technology. This is accomplished by publishing the data or allowing reading or querying. Import Interface This is the ability of a system to receive information independently of data scope and interface technology. This is accomplished by allowing external applications to write to the database, or some means of querying external databases. Seed Sometimes an instance is created as a placeholder, with no initial content. This category means that the placeholder can be populated with imported data without loosing anything. Lossless Consolidation This means that an existing instance that contains data can be updated with new imported information, correctly handling versions and duplicate information. Reconciliation This means that when an instance is updated with internal information any reconciliation with external identifiers is retained. Appendix B 133 APPENDIX C: EXAMPLES OF DATABASE MAPPING In this appendix we will work through some examples of mapping databases directly together, using a custom dictionary and the Reference Data Service/Work in Progress (RDS/WIP). We will also introduce you to the use of templates from Part 7. As far as an introduction to ISO 15926, this section is definitely “extra credit.” You do not have to read this to understand the concepts of ISO 15926. Chapter 1 provided a brief introduction to the topic of mapping two databases together. In Chapter 3, we talked about mapping during the discussion on the levels of compliance to ISO 15926. There we gave a number of examples of organizations exchanging information with increasing levels of implementation of ISO 15926. In this appendix, we will continue with these examples—showing how to do the mapping that is described in each example. Please bear in mind that these examples are highly simplified for the purpose of explaining concepts. In the real world, this is a complex enough subject that many professionals make a very good living just connecting databases to each other. The entire point of ISO 15926 is that anyone using the standard’s protocols can exchange information with anyone else using the protocols and not have to know anything else about the other’s system. Just as two people who do not know each other can nevertheless communicate if both use the same rules of grammar and the same dictionary, two parties in a data exchange can understand each other’s information if both use the ISO 15926 data model, Part 2 (or Part 7, which implies Part 2), and the classes, or dictionary, Part 4. We will compare each set of mapping examples to this vision for ISO 15926. Example Set 1: Let’s Just Exchange Raw Data and Figure It Out Together In this first set of examples we will look at mapping two databases together directly. Example 1.1: Map Two Databases Together, Simple Figure C.1 shows the work process of exchanging information by mapping two databases together. At the beginning of the exchange, neither party knows anything about the other’s system. • ACME selects the data to exchange and exports it to an intermediate file (a “data dump”)— perhaps in a comma-delimited text file or a spreadsheet. • ACME sends the data dump to EMCA. • EMCA cannot use the information because it does not know what all of it means. • Each company assigns an engineer to collaborate to make a custom map so that EMCA can import the data. • EMCA uses the map to import the data. Appendix C 134 Fig. C.1 Figure out each exchange every time. Roles In Chapter 5, we identified two roles required for basic dictionary-level compliance. • A subject matter expert, (SME) who thoroughly understands the business in which the two applications are involved. This person must understand the meaning of all business objects involved to ensure that the correct data is transferred. • A database administrator to create the mapping tables between each application and the Part 4 terms. The most important is the SME. This person will examine the information and decide which objects in one database are equivalent to those in the other. Our first example (Figure C.2) is very simple. We will use a portion of a valve list from one of our favorite companies, ACME (which we will pretend is an EPC contractor). ACME ID 85457 96058 48981 Dwg_No 4567-32-17 4567-32-17 4567-32-17 Rating 150 300 600 Table Name: Valve_List PID No Spec 1234-32-45 A373 1234-32-45 B893 1234-32-45 C578 Line Tag 10-32-A373-12-1438 10-32-B893-10-0378 10-32-C578-10-2155 Pipe Size 12 10 8 Valve Tag 32-1269 32-2958 32-7639 Fig. C.2 EMCA valve list table. A valve list is a list of all valves used for a particular scope of work or project. Design engineers use the valve list to ensure that each is properly specified. Procurement officers use it to make sure that all valves have been purchased. Construction contractors use it to ensure that they have received all of the valves and that all valves have all been installed. The goal here is to transfer the information from ACME to our other favorite company, EMCA (which we will assume is a construction contractor), by mapping the databases together and pushing a button to execute the exchange electronically—rather than exchanging paper documents and having a person rekey the values. Figure C.3 shows EMCA’s valve list, currently empty. Appendix C 135 EMCA Index No Dwg_No Rating Table Name: Table_32 PID Mat_Spec Line_Tag Diameter Tag Fig. C.3 EMCA valve list table. The job of the SME of both ACME and EMCA is to understand each term (i.e., the database column names) of both companies and decide which of them are equivalent. The convention is to ignore the data rows and put the column names from each database into a two-column table, with the table name on the left and the column names on the right. Figure C.4 shows what this looks like. Table Name Valve_List Valve_List Valve_List Valve_List Valve_List Valve_List Valve_List Valve_List ACME Column Name ID Piping DRG Class PID_No Spec Line_Tag Pipe Size Valve Tag Table Name Table_32 Table_32 Table_32 Table_32 Table_32 Table_32 Table_32 Table_32 EMCA Column Name Index No Dwg_No Rating PID Mat_Spec Tag Diameter Tag Fig. C.4 ACME and EMCA valve list column names. To show equivalence, the SME will arrange the column names of the respective tables into the same horizontal row. Specialized software will be able to read this list and copy the data values from the ACME table/column to the EMCA table/column. In this very simplified example, we can see that the column names are descriptive of their content—and although not identical in both tables it is easy to determine equivalence. Also note that, uncharacteristically, they are already arranged horizontally to show equivalence. The job of the SME is finished. The application experts still have to figure out how to get the information out of ACME’s database and into EMCA’s, and the database experts still have to configure the data migration. Although these jobs may be quite complex, depending on the nature of ACME’s and EMCA’s systems, they are straightforward conceptually. Example 1.2: Map Two Databases Together, Harder The previous example was trivially simple in order to convey the concept of mapping two databases together without getting bogged down with details. In this example, we will use a more realistic pair of valve tables. Figure C.5 shows what the two valve lists might really look like. Appendix C 136 Table Name VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST ACME Column Name AREA CLASS COMP ID DATASHEET DS_ID LINE TAG MANUFACTURER MATERIAL MODEL NO OP PRESS OP TEMP PID NO PIPE SIZE PIPING DRG RUN SERVICE SPEC STATUS TAG TAGTYPE VALVE TAG VALVE TEXT VALVE TYPE VCODE VINT1 VNUM VSIZE VUNIT Table Name TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 EMCA Column Name CLIENT_SYSTEM CLIENT_UNIT COM_GRP DES_RESP DIAMETER DIAMETER_UOM DWG_NO HEAT_TRACE HOLD INDEX_NO LINE_NUMBER LINE_TAG MAT_SPEC PID PID_LOCATION PID_REV PROJ_NO QUANTITY RATING SEQ_NO STARTUP SUFFIX SYSTEM TAG UNIT VALVE_LOCK_DEVICE VALVE_NUMBER VALVE_TYPE Fig. C.5 More realistic column names. In this example, the SMEs have much more work to do. Each column name has to be examined closely to verify what it means. It is not good enough to look up the name in, say, the Oxford English Dictionary; you need to see how it is actually used by the application. This is because the column names do not actually mean anything to the database software; they are just text strings. (The only thing that really matters to the software is that the names are unique.) Good database design principles recommend that descriptive names be used, but this is just for the convenience of human programmers. Even if the columns were named properly in the beginning, changes to the software over time can effectively change their meaning while leaving the original name unchanged. We will review several column names to show you how mapping is done, and then rearrange them to show which are equivalent. ACME TAG and VALVE TAG Versus EMCA TAG and RECORD_ID We cannot conclude that two columns contain equivalent information just because they use the same word for their name. For example, both ACME and EMCA use the column name TAG. Our first thought may be that this is for a component tag number, similar to an instrument tag number or equipment tag number, but we need to look further. ACME also uses the column name VALVE TAG, and EMCA also uses RECORD_ID. Given these extra names, we cannot determine their meaning simply by looking at the other names on these two lists; we must dig into the software to see how it uses the names. Appendix C 137 We may obtain some clues by looking into the database to see what types of values are stored there, but we may also have to open the software and start experimenting. For instance, in the software that uses ACME’s and EMCA’s valve tables there will be a form (or computer screen) that asks for the valve’s asset identification number. We can enter a fictitious value and then look for it in the database to see where it turns up. For this example, we will assume that ACME’s TAG and EMCA’s RECORD_ID are equivalent— recording what we might call a record index number (a unique numerical value the database software uses to identify a row in the database). We will also assume that ACME’s VALVE TAG and EMCA’s TAG are equivalent, being used to store the valve’s asset identifier. For most in-line valves this value will be empty, or null. When a valve is important enough to track individually, it will be given a tag number that is stored in this column. ACME VNUM Versus EMCA SEQ_NO VNUM and SEQ_NO are two more names we cannot make a decision about by simply looking at the other database column names. For this example, we will assume that this is an optional place to store another tracking number for a valve. Some owner-operators give all of their in-line valves that do not already have a tag number a unique identifier to help track maintenance. When ACME and EMCA work for an owner with such requirements, they use these columns to store the unique number. ACME LINE TAG Versus EMCA LINE_NUMBER and LINE_TAG As with VALVE TAG, it would be easy to jump to the conclusion that ACME’s LINE TAG and EMCA’s LINE_TAG are equivalent. But EMCA also uses the name LINE_NUMBER. All three of them could be used to store the alphanumeric text string we commonly call a line number. Here again we must see how the software processes line numbers and see which database columns it uses. For this example, we will assume that ACME uses LINE TAG to store the full line number. (In Figure C.2 we saw the example 10-32-A373-12-1438.) EMCA uses LINE_TAG for the same purpose. EMCA’s LINE_NUMBER stores the sequential numeric part of the line number (1438 in the line number given previously). ACME PIPE SIZE Versus EMCA DIAMETER and DIAMETER_UOM An important concept to understand is that an organization may have certain assumptions about the world that every employee “just knows.” A possible example of this is the EMCA column name DIAMETER_UOM. We might ask why ACME does not have something like this. Perhaps ACME gets all of its work from one area of the world and uses only one system of units in all of its projects. If this were the case, everyone would just “know” what the units were and there would be no real need to record such a value in the database. Although this is possible, an equally likely answer is that ACME does not store the units of measurement with every measurement. In this example of a valve list, ACME may feel that it is ex- Appendix C 138 tremely improbable that a valve will be measured in one system of units and the piping system to which it is attached measured in another system. The units of measurement may be attached to the pipeline the valve is part of, or it may be a project-wide setting. Like our other examples, the SMEs will have to look at the software to see how it handles units of measurement. The SMEs will similarly examine the rest of the terms and decide which are equivalent. Figure C.6 shows the tables after rearranging column names so that equivalent names are in the same horizontal row. ACME Table Name VALVE_LIST VALVE_LIST Column Name TAG PIPING DRG VALVE_LIST CLASS VALVE_LIST PID NO VALVE_LIST VALVE_LIST SPEC LINE TAG VALVE_LIST PIPE SIZE VALVE_LIST VNUM VALVE_LIST VALVE TYPE VALVE_LIST VALVE_LIST VUNIT VALVE TAG VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VSIZE AREA DATASHEET TAGTYPE VCODE STATUS MATERIAL OP PRESS OP TEMP MANUFACTURER MODEL NO VALVE TEXT VINT1 RUN SERVICE COMP ID DS_ID EMCA Table Name TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 Column Name RECORD_ID DWG_NO STARTUP RATING PID_REV DES_RESP PID HOLD HEAT_TRACE PROJ_NO LINE_NUMBER MAT_SPEC LINE_TAG QUANTITY DIAMETER DIAMETER_UOM VALVE_LOCK_DEVICE VALVE_NUMBER SEQ_NO CLIENT_SYSTEM COM_GRP SYSTEM PID_LOCATION VALVE_TYPE SUFFIX UNIT TAG CLIENT_UNIT Fig. C.6 Two valve lists showing equivalence. You will notice that many of the column names do not have a corresponding name in the other database. There could be a number of reasons for this. It could be that the nature of the business of each of the two organizations means that each pays attention to different things. An Appendix C 139 example of this is EMCA’s column name HOLD. If EMCA’s work process places holds against valves during the course of its own work, it may not need the value to come from ACME. Another example is the ACME DATASHEET column name. Because it is an EPC contractor, it needs to know exactly how each valve is manufactured and what each part is made of. This information is usually recorded on a data sheet, and recording the datasheet number with the valve record ensures that future engineers will be able to find it. For its part, EMCA may simply assume that the data sheet for a tagged valve is always named after the valve tag number and therefore there is no point in recording it. (Note that this assumption may in fact not be true. Nevertheless, if EMCA believes it and designs its databases around that belief we have to work with EMCA’s assumptions.) How the SMEs handle this “missing” information depends a great deal on what EMCA needs to use the information for. It could be that the information ACME’s valve list has in common with EMCA’s valve list is all that EMCA needs. For example, because the design of the valve is not in EMCA’s scope of work it would have no need to track the operating temperature and pressure for each valve. (It would need to know the testing pressure of the entire pipeline, but this information is probably on the line list.) On the other hand, if EMCA actually needs some of the “missing” information the SMEs will have to ask their respective application and database experts where to get it. We will discuss this in the next example. Example 1.3: Map Two Databases Together, More Complex So far the examples have assumed that equivalent database tables all have columns for equivalent values. This is almost never the case. We do not wish to turn readers into database administrators, but we will give an example by extending the previous discussion of the units of measurement. In this example, we will assume that EMCA needs the DIAMETER_UOM field to be populated for its software to work. ACME will almost certainly have a record of the units of measurement somewhere, and probably for the same reason EMCA needs it, but ACME’s software does not store it in the valve list. ACME’s SME will need to enlist the help of application and database experts to figure out how to provide EMCA with the information it needs. There are at least two options. If ACME’s SME knows absolutely for sure that 100 percent of the dimensions are, for instance, in Imperial measure, the database expert could write a small program to simply add that value to the database table as it is being exported. The problem, as you may have guessed, is with the “absolutely 100 percent” part. You can never know whether or not some specialized valve, available from only one place in the world, has a unique metric size. A more reliable way to solve the problem is to figure out where the ACME records the units of measurement, and then pull it in from there. It could be anywhere. It could be attached to the line number the valve is associated with, a single project setting, or anything in between. For this example, we will assume that the appropriate unit of measurement comes from the line number. However, for illustrative purposes we will throw in a wrinkle. We will pretend that the software does not store this in a table based on the alphanumeric text string we call a line Appendix C 140 number but in a separate table based on an arbitrary “pipe run number.”1 Figure C.7 shows what this might look like. ACME Column Name . VALVE_LIST LINE TAG . Table Name Table Name LINE TAG RUN NO RUN NO UOM TABLE_32 TABLE_32 EMCA Column Name . LINE_TAG . DIAMETER_UOM Fig. C.7 Finding units of measurement. What the database expert has to do, then, is to write what is called a query. Following the arrows from the left-hand side (Figure C.7), the query basically says: a.Given this LINE TAG, what is the RUN NO? b.Given this RUN NO, what is the UOM? c.Copy the UOM to DIAMETER_UOM of the table we export for EMCA. Comparison to the Vision of ISO 15926 These examples are like two people who speak different languages attempting to communicate by pointing at something and saying its name. After a while both of them would understand what each other’s individual words mean, but would still be a long way from fluent communication. For a one-off situation this may be acceptable, and if the two parties never had any need to communicate in the future this may even be the preferred solution. However, we must understand that all of this work of pointing to things and saying their names will have to be repeated if the two participants want to bring more applications into their federation—and if they envision including more partners. This amounts to point-to-point mapping and is very much against the vision of ISO 15926—and in fact was the reason the standard was first proposed. Example Set 2: Wouldn’t It Be Better to Agree on Common Definitions? Example 2.1: Creating a Custom Dictionary Using Arbitrary Names One problem in mapping databases directly together is that you have to go through the same process every time you connect the databases. Even if you keep the map around, after awhile people change (or they forget) and they have to go back to the applications and figure out all over again what the attributes mean. One way to manage this is to create a data dictionary. In Figure C.8, ACME and EMCA exchange information often enough that there is an opportunity to streamline the operation. They do not want to have to consult with each other with every information exchange and start from scratch, so they agree on two things. 1. Among other things, doing it this way allows users to change the line number if they have to. Appendix C 141 • They will spend a bit of time to develop a common set of terminology. • They will not exchange raw data dumps. The sender will translate its information payload into a “neutral file” using standard terminology the receiver will be able to understand. Fig. C.8 Common reference data library. This is the beginning of a data dictionary, or reference data library. Start by assembling the terminology. • ACME and EMCA collaborate to create the dictionary of database terms and to decide the format of the neutral file. • Both ACME and EMCA use the dictionary to create their own database maps. • ACME selects certain data and exports it using the database map to translate it to the agreed neutral file format. • EMCA receives the intermediate file and imports it into its systems, using its database map to translate it. Next, create a list consisting of a number of standard definitions, with a particular word associated with each definition. The associated word can be completely arbitrary. For instance, at a gathering to discuss techniques of mapping databases together one of the instructors—in a moment of inspiration—said, “If you have to translate something from one language to another using a third language as an intermediate, you can use any language you wish—Klingon— whatever!” To make a point, we will do just that. Table C.1 shows a possible dictionary of the terms both ACME and EMCA have in common. TABLE C.1 STANDARD DEFINITIONS USING KLINGON NUMBERS Name Definition wa’ Unique identifier for a row in a database. cha’ The alphanumeric text string that uniquely identifies an orthographic drawing. Appendix C 142 wej The nominal pressure rating for a piping system or component. loS The alphanumeric text string that uniquely identifies a piping and instrumentation diagram (PID). vagh The material specification for a pipeline. jav The alphanumeric test string that uniquely identifies a pipeline. Soch The nominal diameter of a pipeline. chorgh The units of measurement of the nominal diameter of a pipeline. Hut Maintenance tracking number for hand valves. wa’maH The general category to which a valve belongs (e.g., gate or globe). wa’maH wa’ The geographic or functional area of a plant; commonly called “unit.” wa’maH cha’ Asset tracking number for individually designed components; commonly called “tag number.” The second step for ACME is to create a database map (this time using the dictionary names), and then using the map export the information to a table that will be sent to EMCA. Figure C.9 shows what the table might look like. Figure C.10 shows how EMCA will receive the neutral file. EMCA will also create a database map using the dictionary names, but will instead use it to import the neutral file into its database. ACME Table Name VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST Column Name TAG PIPING DRG CLASS PID NO SPEC LINE TAG PIPE SIZE VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VNUM VALVE TYPE VUNIT VALVE TAG Klingon Database Map Column Name TAG PIPING DRG CLASS PID NO SPEC LINE TAG PIPE SIZE UOM VNUM VALVE TYPE VUNIT VALVE TAG Dictionary Name wa' cha' wej loS vagh jav Soch chorgh Hut wa'maH wa'maH wa' wa'maH cha' Neutral File Fig. C.9 Data export using a Klingon neutral file. Appendix C 143 Neutral File Klingon Database Map Dictionary Name wa' cha' wej loS vagh jav Soch chorgh Hut wa'maH wa'maH wa' wa'maH cha' Column Name RECORD_ID DWG_NO RATING PID MAT_SPEC LINE_TAG DIAMETER DIAMETER_UOM SEQ_NO VALVE_TYPE UNIT TAG EMCA Table Name TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 Column Name RECORD_ID DWG_NO RATING PID MAT_SPEC LINE_TAG DIAMETER DIAMETER_UOM SEQ_NO VALVE_TYPE UNIT TAG Fig. C.10 Data import using a Klingon neutral file. Example 2.2: Creating a Custom Dictionary Using an Ontology Of course your organization would not really use Klingon for dictionary terms; it would probably use a collection of nouns and adjectives that were themselves descriptive of the terms they describe.2 For a single, small information exchange you could probably live with ad hoc names. But as you add applications to your federation of applications you will probably run across many terms that are similar, with only subtle variations. As you add definitions to your dictionary, you may find that some of your original names overlap with the meaning of new terms and so become misleading. One strategy is to create an ontology of names that contains all of the terminology at your organization. Then, when a new application is added to the federation it can simply pull the correct term from the list. Table C.2 shows part of a possible ontology that covers the information in the ACME-EMCA exchange. TABLE C.2 PART OF AN ONTOLOGY OF TERMS asset_tracking_no component_valve_type database_row_id drawing_no drawing_no_pid drawing_no_pid_revision_no drawing_no_piping drawing_no_piping_iso 2. On the other hand, your organization would gain instant street cred if they were to do so. Star Trek gatherings are heavily overrepresented by geekish, nerdish types; just the types that make up most of your IT department. You may in fact find that your database administrator can read Klingon directly. Appendix C 144 drawing_no_piping_ortho maintenance_tracking_no piping_line_no piping_line_no_sequence piping_material_spec piping_nominal_diameter piping_uom plant_id pressure pressure_class pressure_design pressure_test production_train_id production_unit_id row_id<End TB> From here it is a simple matter to construct a database map and export the data to a neutral file, just as we did with the Klingon dictionary. Figures C.11 and C.12 show what this might look like. ACME Table Name VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST Column Name TAG PIPING DRG CLASS PID NO SPEC LINE TAG PIPE SIZE VALVE_LIST VALVE_LIST VALVE_LIST VALVE_LIST VNUM VALVE TYPE VUNIT VALVE TAG Database Map Column Name TAG PIPING DRG CLASS PID NO SPEC LINE TAG PIPE SIZE UOM VNUM VALVE TYPE VUNIT VALVE TAG Dictionary Name row_id drawing_no_piping_ortho pressure_class drawing_no_pid piping_material_spec piping_line_no piping_nominal_diameter length_uom maintenance_tracking_no component_type_valve production_unit_id asset_tracking_no Neutral File Fig. C.11 Data export using an ontological neutral file. Appendix C 145 Neutral File Database Map Dictionary Name Column Name row_id RECORD_ID drawing_no_piping_ortho DWG_NO pressure_class RATING drawing_no_pid PID piping_material_spec MAT_SPEC piping_line_no LINE_TAG piping_nominal_diameter DIAMETER length_uom DIAMETER_UOM maintenance_tracking_no SEQ_NO component_type_valve VALVE_TYPE production_unit_id UNIT asset_tracking_no TAG Table Name TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 TABLE_32 EMCA Column Name RECORD_ID DWG_NO RATING PID MAT_SPEC LINE_TAG DIAMETER DIAMETER_UOM SEQ_NO VALVE_TYPE UNIT TAG Fig. C.12 Data import using an ontological neutral file. We have used a very simplified example of an ontology. In fact, what is shown is not really an ontology but an ordered list. Users are simply agreeing to use a particular English word to mean a particular thing instead of using a Klingon number. Although certainly a big step over the Klingon-based dictionary, it would not direct a new user unambiguously to the correct term every time. A discussion with other users would still be required to be certain that some terms were being used correctly. Arranging the terms in a real ontology would help. Figure C.13 shows part of an ontology for pumps. Fig. C.13 Part of an ontology of pumps. Using this ontology of pumps (arranged the way it is) with the proper accompanying definitions, new users will be more likely to choose the correct term without having to talk to anyone. The bad news is that if you have never created an ontology your first few attempts will likely require reworking every time you add a new application to your federation. An easier way is to use an ontology that someone else has developed. In the next example, we will introduce such an ontology based on Part 4. Comparison to the Vision of ISO 15926 Building a custom dictionary means that information exchange partners will not have to learn everything all over again every time they wish to exchange information. New partners are a bit better off because they do not have to start from scratch, but they still have a fair amount of work to do. They will still have to review the dictionary to thoroughly understand the terms, Appendix C 146 and may find that some terms conflict with their own understanding. Worse, every time a new member joins the federation there is potential rework for all members. What everyone needs is an industry standard dictionary, such as Part 4, so that they only have to go through the pain once. Example Set 3: Why Don’t We Use Industry Standard Definitions? Using Part 4 definitions is the minimum standard for compliance with ISO 15926. Example 3.1: Using the RDS/WIP as a Dictionary In this example, ACME and EMCA wish to use an industry standard dictionary to create the neutral files they use for information exchange. Being able to export information into an industry standard format will allow them to exchange information with other organizations as well. The RDS/WIP is just such a dictionary. It was commissioned in the mid 2000s as an experimental reference data service anyone could use as a reference—and with a little bit of training as a source for adding new terms. • Both ACME and EMCA collaborate and agree to use the core classes in Part 4 and to use the rules in Part 4 to extend the classes. • Both use the Part 4 dictionary to create their own database maps. • ACME selects certain data and exports it using the database map to translate it to the agreed neutral file format. • EMCA imports the information using its database map to translate the information into something its systems understand. Level of Collaboration Required The major difference here is that instead of consulting a growing proprietary dictionary all who are involved refer to the public standard. However, partner organizations will still need to have a discussion about how they will use the core terminology whenever two terms mean almost the same thing. New applications added to the confederation will still be subject to a detailed discussion. In addition, a new application may use existing terminology in slightly different ways. If so, the partner organizations will have to decide whether or not to change the term or to modify the new application. But changing the meaning of a term could have a serious effect on existing applications. New Role There is a new role here. Someone has to learn how to use the RDS/WIP.3 • The RDS/WIP modeler, who will learn how to use the RDS/WIP, how to read the taxonomy of existing terms, and how to prepare new submissions. 3. Remember that the RDS/WIP is experimental. When ISO 15926 is mature, there will be a selection of RDLs to choose from. Appendix C 147 At the end of this appendix we have included a brief introduction to the RDS/WIP, but basically you just search for a term and find the closest match. If two or more terms are close in meaning, you will have to negotiate with your business partner to decide which to use. If a new term is required, you add it. Once you decide on the term, you will actually use the universal resource indicator (URI; more-or-less the term’s serial number) instead of the term itself. Table C.3 shows a possible list of terms from the RDS/WIP that might be used as for the valve list. TABLE C.3 ATTRIBUTES FROM THE RDS/WIP Label RDS/WIP URI Description IDENTIFIER http://rdl.rdlfacade.org/data#R32813486958 http://rdl.rdlfacade.orgdata#R16893283050 An identification code for a drawing or a class of drawings http://rdl.rdlfacade.orgdata#R45649298385 Classes of standard pressure ratings (i.e., the maximum allowable non-shock working gage pressure at a given temperature and a specific material) http://rdl.rdlfacade.org/data#R99486931975 A diagram showing a schematic representation of a process system, including instrumentation MATERIAL SPECIFICATION http://rdl.rdlfacade.orgdata#R54400593721 A specification that describes the applicable materials for one or more items PIPELINE OFFICIAL IDENTIFIER http://rdl.rdlfacade.org/data#R83498125174 An identification code that identifies a pipeline http://rdl.rdlfacade.orgdata#R94482935208 Intercept made by the circumference on a straight line through the center of a circle http://rdl.rdlfacade.org/data#R5817703946 A role used in ST 3401 to designate the applicable scale in which the numerical value is expressed DRAWING IDENTIFICATION CODE PRESSURE RATING CLASS P AND I DIAGRAM DIAMETER UNIT OF MEASURE Appendix C 148 http://rdl.rdlfacade.orgdata#R73944050980 A number assigned following an order of succession PR3 1.10 VALVE TYPE/ DESIGN http://rdl.rdlfacade.org/data#R49125478120 Assignment of valve type (e.g., safety, relief, or safety relief), and design (e.g., conventional, bellows, or pilot) FUNCTIONAL UNIT IDENTIFIER http://rdl.rdlfacade.org/data#R14322658775 An identification code that identifies a functional unit TAG IDENTIFICATION CODE http://rdl.rdlfacade.org/data#R99047027503 An identification code for a functional_ physical_object. SEQUENCE NUMBER Figure C.14 shows how ACME will use these terms to create a map with which to export the data to a neutral file. Figure C.15 shows how EMCA will create its own map with which to import the data from the neutral file. Note that instead of English descriptive names we are using long numerical text strings. Astute readers will notice that this is conceptually the same as using Klingon. And if that is where we stopped, the readers would be correct.4 ACME Database Map Table Name Column Name Column Name Dictionary Name VALVE_LIST TAG TAG http://rdl.rdlfacade.org/data#R32813486958 VALVE_LIST PIPING DRG PIPING DRG http://rdl.rdlfacade.org/data#R16893283050 VALVE_LIST CLASS CLASS http://rdl.rdlfacade.org/data#R45649298385 VALVE_LIST PID NO PID NO http://rdl.rdlfacade.org/data#R99486931975 VALVE_LIST SPEC SPEC http://rdl.rdlfacade.org/data#R54400593721 VALVE_LIST LINE TAG LINE TAG http://rdl.rdlfacade.org/data#R83498125174 VALVE_LIST PIPE SIZE PIPE SIZE http://rdl.rdlfacade.org/data#R94482935208 UOM http://rdl.rdlfacade.org/data#R58177039466 VALVE_LIST VNUM VNUM http://rdl.rdlfacade.org/data#R73944050980 VALVE_LIST VALVE TYPE VALVE TYPE http://rdl.rdlfacade.org/data#R49125478120 VALVE_LIST VUNIT VUNIT http://rdl.rdlfacade.org/data#R14322658775 VALVE_LIST VALVE TAG VALVE TAG http://rdl.rdlfacade.org/data#R99047027503 Neutral File Fig. C.14 Data export using RDS/WIP class names. 4. In fact, if the organizations were to hard-code the definitions they would be better served to use the “Label” from Table C.3 rather than the URI. We used the URIs in order to introduce the next section. Appendix C 149 Neutral File Database Map EMCA Dictionary Name Column Name Table Name Column Name http://rdl.rdlfacade.org/data#R32813486958 RECORD_ID TABLE_32 RECORD_ID http://rdl.rdlfacade.org/data#R16893283050 DWG_NO TABLE_32 DWG_NO http://rdl.rdlfacade.org/data#R45649298385 RATING TABLE_32 RATING http://rdl.rdlfacade.org/data#R99486931975 PID TABLE_32 PID http://rdl.rdlfacade.org/data#R54400593721 MAT_SPEC TABLE_32 MAT_SPEC http://rdl.rdlfacade.org/data#R83498125174 LINE_TAG TABLE_32 LINE_TAG http://rdl.rdlfacade.org/data#R94482935208 DIAMETER TABLE_32 DIAMETER http://rdl.rdlfacade.org/data#R58177039466 DIAMETER_UOM TABLE_32 DIAMETER_UOM http://rdl.rdlfacade.org/data#R73944050980 SEQ_NO TABLE_32 SEQ_NO http://rdl.rdlfacade.org/data#R49125478120 VALVE_TYPE TABLE_32 VALVE_TYPE http://rdl.rdlfacade.org/data#R14322658775 UNIT TABLE_32 UNIT http://rdl.rdlfacade.org/data#R99047027503 TAG TABLE_32 TAG Fig. C.15 Data import using RDS/WIP class names. Example 3.2: Using the RDS/WIP to Validate an Information Exchange In the previous example, ACME and EMCA used the RDS/WIP as a dictionary—simply using the URIs of the definitions to create their map. In this example, they wish to use the RDS/WIP for real-time validation to make sure the URIs are in fact valid terms. Figure C.16 shows how this will work. The collaboration here is the same as that of the previous example, with two new activities. • ACME validates its data against the core classes in the Part 4 RDL—using the RDS/WIP to verify that it is compliant before sending it out. • After EMCA receives the intermediate file, it validates the terminology against the RDS/ WIP to verify compliance. Appendix C 150 Fig. C.16 Use ISO 15926-4 for information exchange. New Role • ACME and EMCA will each need a programmer to write the code for checking database terms with the RDS/WIP. (A market for such tools has emerged and there are now a number of commercial and open-source tools to make this task easier.) Comparison to the Vision of ISO 15926 Readers may wonder what the difference is between hard coding terminology into a database map and validating with “live” URIs as the exchange is made. If the hard-coded term is identical to the URI in the RDS/WIP, what does it matter? The answer is that if you were to exchange information only between a few well-known applications or a few well-known business partners there would be no effective difference between hard-coded and live URIs. But the vision of ISO 15926 is that any two organizations will be able to exchange information with a minimum of preparation required. Having both the sender and receiver validate the URIs during the exchange increases each party’s confidence in the information and reduces ambiguity. Example Set 4: Would It Help if I Told You How I Was Using the data? Although using industry standard definitions (and Part 4 definitions in particular) is the minimum requirement for compliance with ISO 15926, there is still room for ambiguity. For in- Appendix C 151 stance (to reuse an example from the Introduction), just because a native Mandarin speaker and a native Cantonese speaker understand English well enough to use it as an intermediary language it does not necessarily mean they will be able to communicate seamlessly from the very beginning. Because they come from different cultures, they may find that they use some words differently—which means that when first communicating there must be a time when definitions are verified. For others to know what we mean by certain terms, without having to go through the process of verifying what we mean, we need a way to embed the context in which we use these terms into the data. In addition to using Part 4 to make sure our terminology matches the industry standard, when we also use the “syntax and grammar” of ISO 15926 (which is Part 2) we are doing just that. Using Part 7 templates makes it easier to use Part 2 correctly. Exchanging information with Part 7 templates is fundamentally different from dictionary mapping. The purpose of this example is to give you a glimpse of how different it is, without becoming a “How to” lesson. This example will show how the use of part 7 templates means that the meaning (or context) of the data is stored with the data so that we do not have to engage our business partners in discussions of what the data means before sending information. We have seen a trend toward this throughout the examples in this appendix. We started off with mapping one database directly to another, where both parties had to know a great deal about each other’s systems in order to trust that the information was being exchanged correctly. From there we went to custom dictionaries (where only the dictionary writers needed such intimate knowledge), and then to external RDLs—where we only needed knowledge of the published RDL term. In this example, we will also use the RDL terminology (and still have to thoroughly understand what they mean)—but because we will be using Part 7 templates we do not have to engage our business partner in discussions of the form of the data or even the precise content; we just send “a bunch of stuff”, and our partner will be able to figure it out. This is the essence of ISO 15926: we can exchange information with anyone without having to know anything about their internal systems beforehand. Example 4.1: Using Templates to Exchange Information Figure C.17 shows ACME and EMCA using Part 7 templates to structure their data so that its meaning will be part of the exchange. ACME and EMCA will still use Part 4 as the data dictionary, and will still use the RDS/WIP to validate the information on each side of the exchange, but in addition they will structure their data according to Part 2. The easiest way to do this is to use the base templates in Part 7 to build the database maps with which they will export (or import) the neutral data exchange file. • ACME and EMCA collaborate and agree to use Part 7 (which implies Part 2) and Part 4. • Each organization creates information models, or maps, to transform its internal data structures to something that follows the ISO 15926 protocols. They use the base templates in Part 4, and the methodology in Part 7, to create new templates when required. • ACME selects certain data and exports it using the database map, based on Part 7 templates, to translate it to the agreed neutral file format. As it is assembling the neutral file, the templates validate the information against the core classes in the Part 4 RDL (using the RDS/WIP). • ACME sends the neutral file to EMCA. Appendix C 152 • EMCA receives the neutral file and processes it in reverse. • EMCA uses its database map, based on Part 7 templates, to translate the information in the neutral file into something its systems understand. EMCA’s templates understand how to read the meaning from the structure of the data in the neutral file. Fig. C.17 Use ISO 15926-7 for information exchange. Level of Collaboration Required The level of collaboration required in this case is much less than in the previous case. After both companies implement Part 7, all they will need to know from each other is the general nature of the information to be exchanged. Neither party has to know anything at all about the way the other creates and uses the information because the context of the information is contained within the information itself. The addition of Part 7 adds some initial work to fully understand the landscape of different applications and model them properly, but thereafter the actual data exchange becomes much simpler. New Role • An information modeler will map business objects in the applications to the prepared templates in Part 7. Metaphor So that you can understand what the exchange is trying to accomplish, we will start with a Appendix C 153 metaphor about making inferences. Suppose we were to make this assertion: Barack Obama - is the - President of the United States. Most people today who read the news will have no trouble understanding what this statement means. But suppose your name is Valentine Michael Smith and you have just been repatriated to Earth after being born on Mars and raised by Martians for twenty years.5 You would not even have the concept of “government,” let alone of “president”—and the political divisions the rest of us know as countries would not occur to you naturally, so you would have no way to understand this. However, if someone were to tell you that Barak Obama is human you would be able to infer that he has hair simply because you are human too and have hair. Moreover, you would be able to deduce this well before grokking any earthling political nonsense. Similarly, in this example ACME will create some information about a pressure transmitter and send it to its business partner EMCA—which makes pneumatic tubes for sensing devices. However, EMCA does not understand “pressure transmitter.” Why? Well, perhaps they are Martians, but a more plausible explanation is that they do not care about pressure transmitters as a separate class of devices for which they make pneumatic tubes. All they care about is the pressure rating and connection type. ACME ACME’s intention is to pass information about a pressure transmitter, which has the tag number PT-101 and 3/4-inch NPT connections, to EMCA—which will fabricate some pneumatic tubing to connect to it. ACME will first identify the device using the template ClassificationOfIndividual, which has two columns (or roles, Individual and Class)—as shown in Figure C.18. ClassificationOfIndividual Individual Class xyz URI of the class Pressure Transmitter Fig. C.18 Use of the ClassificationOfIndividual template. Individual: xyz xyz is the identifier ACME uses internally for the individual it will later identify as PT-101. It uses the alias for a number of technical reasons, one of them being the potential need to change the name PT-101. If PT-101 changed to, say, PT-2101, ACME would only have to change this in one place (as we shall see) instead of many places. 5. The author, who reads far too much science fiction for his own good, believes that a metaphor involving a human being raised by Martians to be totally plausible. If you read Stranger in a Strange Land, by Robert Heinlein, the metaphor will make more sense. Appendix C 154 Class: The URI of the Class Pressure Transmitter In our previous examples, we showed how ACME could interrogate the appropriate RDL to find the URI of a given term or class. If humans were intended to read this, ACME might consider using the English description of the class. However, because this template is intended for a computer the URI is actually easier to use. Once the transmitter has been classified, the next step is to identify it using the template ClassifiedIndividual (Figure C.19)—which has three roles: Individual, Name, and ClassificaitonOfIndividual. ClassifiedIndividual Individual Name ClassificationOfIndividual xyz PT-101 URI of, say, ISA Fig. C.19 Use of the ClassifiedIndividual template. Individual: xyz xyz is ACME’s internal identifier. Name: PT-101 Here we can see how the name PT-101 could be changed by changing it in just one place. There is no other place where the text string PT-101 occurs. ClassificationOfIndividual: ISA ISA will tell other users the way the name is to be interpreted. In this case, ACME is using the convention ISA (for Instrument Society of America). But, as before, instead of using the text string ISA it uses the URI of the class ISA. The next templates (Figure C.20) will contain the properties of the transmitter. For this, ACME will use two different templates: IndirectPropertyScaleReal (which has the four roles Individual, Property, Value, and Units of Measure) and IndirectProperty (which has the three roles Individual, Property, and Value). Appendix C 155 lndirectPropertyScaleReal Individual Property Value Units of Measure xyz URI for the class Operating Pressure 100 URI for the class of kPa lndirectProperty Individual Property Value xyz URI for the class End Connection URI for the class of NPT lndirectPropertyScaleReal Individual Property Value Units of Measure xyz URI for the class Connection Size 0.75 Inches Fig. C.20 Use of the IndirectPropertyScaleReal and IndirectProperty templates. EMCA When EMCA receives the information, its system will first look through the data for the template ClassificationOfIndividual, and everything associated with it, using the identifier of the individual (xyz). EMCA could continue to use xyz, but chances are that it has a different internal numbering system and thus each individual will be given different identifiers. For the example of PT-101, we will use abc—as indicated in Figure C.21. ClassificationOfIndividual Individual Class abc URI of the class Pressure Transmitter Fig. C.21 Individual identifiers in the ClassificationOfIndividual template. Class: Pressure Transmitter EMCA’s system does not know what to do with Pressure Transmitter not because its engineers have never heard of such a device but because it has no reason to care. Rather than store all of the different types of devices to which its pneumatic tubes might potentially connect, it simply ignores the class. What EMCA’s system looks for is the templates containing the operating pressure, the thread form of the connection, and the size of the connection—as shown in Figure C.22. Now that EMCA knows the values it is looking for, its system can reform the data into whatever EMCA needs to drive its fabrication and order fulfillment system. Appendix C 156 lndirectPropertyScaleReal Individual Property Value Units of Measure abc URI for the class Operating Pressure 100 URI for the class of kPa lndirectProperty Individual Property Value abc URI for the class End Connection URI for the class of NPT lndirectPropertyScaleReal Individual Property Value Units of Measure abc URI for the class Connection Size 0.75 Inches Fig. C.22 Templates containing operating pressure, thread form of connection, and size of connection. Some Questions Q1. Couldn’t ACME just use a spreadsheet? A1. For this simplistic example, well, yes. ACME could send a spreadsheet of all instrument tag numbers, end connections, size, and thread form. An EMCA engineer would pull out the values that were needed for fabrication and reform them to feed into EMCA’s systems. Q2. So why use templates? A2a. Computers can do this type of work much more quickly and reliably, once we teach them how. This lets humans do more interesting things. A2b. This is a really simple example. The relationships can get quite a bit more complicated. Q3. How do you know which templates to use? A3. Funny you should ask. In Chapter 4, we mentioned the Practical Guide for ISO 15926 Modeling, If you want to learn more about using templates, this should be the next book you read. If you think information modeling using Part 7 templates is interesting, we can tell you that there is going to be a demand for experts who understand the things their industry segment deals with—and who also have an aptitude for abstract reasoning. If this describes you, give someone a call. We’ve got a very interesting career lined up for you! Appendix C 157 Using the RDS/WIP The RDS/WIP was commissioned in the mid 2000s as a proof-of-concept for an RDS. The classes that make up Part 4, the dictionary of ISO 15926, were used as seed material. Since its inception, many definitions have been added to the RDS/WIP. The RDS/WIP is several things. • A library of reference data for ISO 15926 • A means of publishing core ISO 15926 definitions • A platform for developing new ISO 15926 definitions • A workspace for harmonizing other standards with ISO 15926 (or ISO standards with one another) The RDS/WIP is a large triple store in the form subject-predicate-object. It uses semantic web technology (OWL, RDF, and SPARQL)—transported with the conventional web technology HTTP—to provide machine-oriented access to the stored definitions. A conventional HTML presentation is used to provide a human-oriented interface to the same system. Anyone can search the RDS/WIP and find terms, much like in a dictionary. Accredited users can add information to the RDS/WIP. We need to point out here that Part 4 intends that RDLs be federated. Therefore, it is in the best interest of the industry to ensure that each authority be responsible for its own RDL to ensure that the distributed nature of the Internet can be used. Although the RDS/WIP is the largest repository to date, this should be seen as only the first—and it is everyone’s challenge, especially equipment vendors, to ensure they have their own information on their own approved RDL. Web Address for RDS/WIP • rdl.rdlfacade.org. When you navigate to this site with a web browser, it defaults to a search screen you can use to examine the taxonomy of RDS/WIP terminology. RDL Facade Rules Table C.4 outlines RDL facade rules. TABLE C.4 RDL FACADE RULES Rule Purpose Not case sensitive Search strings are not case sensitive search-string Returns all terms containing the full search string ^search_string Returns terms beginning with the search string ^search-string$ Returns a single exact match Appendix C 158 Examples In the search box, search for the items indicated in Table C.5. TABLE C.5 SEARCH BOX ITEMS Item Purpose Temperature RDS/WIP will return all terms containing the word temperature, in groups of 20. TEMPERATURE RDS/WIP should return the same results because it is not case sensitive. ^temperature RDS/WIP will return all terms beginning with the word temperature. ^temperature$ RDS/WIP should return only one term, TEMPERATURE. After the previous search, select the term TEMPERATURE. Figure C.23 shows what you should see. Table C.6 explains the terminology. Fig. C.23 RDS/WIP entry for temperature. TABLE C.6 RDS/WIP TERMINOLOGY Item Purpose RDS/WIP URI The URI for the term TEMPERATURE. If you enter this text string into your web browser, you will be taken to this page. Label The official designator for this term. Within the database, this designator is unique. Description This helps you know whether or not you have the correct term. Entity Type Think of this as the general category to which TEMPERATURE belongs. We have just scratched the surface of how to use the RDS/WIP effectively. In Appendix D, in the section about iRING, we have included a link to a document Discovering the Right Class for ISO 15926. We will leave further discussion for another book. Appendix C 159 APPENDIX D: OTHER RESOURCES Information Modeling Resources The barriers to digital interoperability are no longer hardware and technology, but rather information modeling. To truly develop ubiquitous digital interoperability, we will need robust information models that describe project objects and the relationships between them—from their inception, through operation, to demobilization. This provides a distinct growth opportunity for engineers who understand that information about plant objects is as valuable as the objects themselves. When we have a large knowledge base, classified accurately, we will be able to exchange worthwhile information without human involvement in each transfer. Information modeling is the core of ISO 15926. Most people will not have to know anything about it, but a lucky few will get to go all the way down the rabbit hole. If you would like to become one of the lucky few, the sections following describe publications that will get you started. The Archives of Dr. Matthew West Dr. West has a long history with Shell’s Information Management department, and was a developer of parts of ISO 15926 before he retired. He has posted many of his publications on his web site: http://www.matthew-west.org.uk/publications.html There is a wealth of information here for those introducing themselves to information modeling. Dr. West was part of the development of several of the STEP parts, and later ISO 15926. If you start near the bottom, you will see the progression from one to the other. If you are new to information modeling, we suggest you start partway down—with some introductions on how to add the dimension of time to a representation of product parts. • The “IIDEAS” Architecture And Integration Methodology For Integrating Enterprises (2001) • Replaceable Parts: A Four Dimensional Analysis (2003) • Common Reference Data: The Foundation of e-Business (2003) • An Introduction to 4 Dimensionalism in Data Modeling (2007) • 4 Dimensional Data Modeling: An Ontological Approach (2008) If you would like to listen to a lecture of Dr. West about ISO 15926, he has made available a podcast and lecture notes of a presentation given on Ontolog—a community devoted to advancing the field of ontology. The lecture is titled An Introduction to ISO 15926 with Podcast (40 Mbyte) and is available from ONTOLOG (2006). Hans Teijgeler’s Modeling Site Appendix D 160 Hans Teijgeler has been influential in developing some of the parts of ISO 15926. In his retirement, he has developed a web site that explains the way data modeling is done with ISO 15926. http://www.15926.info/index.htm Hans uses a diagramming technique that you will see if you start working directly with Part 2, instead of Part 7. IOHN Modeling Guide We described the Integrated Operations in the High North (IOHN) project in Chapter 4. It is a unique collaboration between the IT, defense, and oil and gas industries. It is a proponent of ISO 15926 because ISO 15926 will enable more efficient and safer operation of remote sites. For their members, they have developed a training course on ISO 15926 and an introduction to information modeling—which they have recently released to the public. https://www.posccaesar.org/wiki/ISO15926Primer_OtherResources On this page, you will see the following three attachments. • IHON Modeling Guide – Part 1 • IHON Modeling Guide – Part 2 • IHON Modeling Guide – Lecture Notes Origin of ISO 15926 David Leal, of CAESAR Systems Limited, is one of the developers of parts of STEP and ISO 15926. In 2005, he published a paper ISO 15926 “Life Cycle Data for Process Plant”: An Overview. In it he explains why ISO 15926 was developed. He describes what he calls the “4D approach” to representing change in a way that can be represented with RDF/OWL. It has been made available free of charge. Search for the paper by its title, or go to this site: http://ogst.ifpenergiesnouvelles.fr/index.php?option=com_article&access=standard&Itemid=12 9&url=/articles/ogst/pdf/2005/04/leal_vol60n4.pdf Organizations Responsible for ISO 15926 We have described these organizations in Chapter 2. ISO TC184/SC4 http://www.tc184-sc4.org/ ISO Technical Committee 184 Subcommittee 4 – Industrial Data Appendix D 161 POSC Caesar Association https://www.posccaesar.org POSC Caesar Association (PCA) is a nonprofit global-standardization member organization that promotes the development of open specifications to be used as standards for enabling the interoperability of data and software and related matters. Fiatech http://fiatech.org/ Fiatech is an industry consortium that provides global leadership in identifying and accelerating the development, demonstration, and deployment of fully integrated and automated technologies to deliver the highest business value throughout the life cycle of all types of capital projects. USPI-NL http://www.uspi.nl/ USPI-NL is a formal association of process industry companies with the mission to develop, maintain, and promote the use of international standards and best practices for product and plant life cycle information. User Groups iRINGUserGroup http://iringug.org iRINGUserGroup is an open online community of users, companies, and organizations who use, are considering using, or are developing or deploying iRING protocols. The iRINGUserGroup is also responsible for the management, enhancement, and maintenance of iRINGTools and iRINGSandbox. We have described iRING in Chapter 4. On their home page are links to iRINGTools and iRINGSandbox. Discovering a Class in ISO 15926 This gives advice on choosing the correct class during searches in a reference data service (RDS) browser. http://iringug.org/wiki/index.php?title=Discovering_a_Class_in_ISO_15926 Appendix D 162 Other Organizations MIMOSA http://www.mimosa.org/ MIMOSA is a not-for-profit trade association dedicated to developing and encouraging the adoption of open information standards for operations and maintenance in manufacturing, fleet, and facility environments. MIMOSA’s open standards enable collaborative asset life-cycle management in both commercial and military applications. We have described MIMOSA and their two joint special interest groups with Fiatech and PCA in Chapter 4. European Committee for Standardization http://www.cen.eu/cen/AboutUs The European Committee for Standardization (CEN) is a business facilitator in Europe, removing trade barriers for European industry and consumers. Its mission is to foster the European economy in global trading and the welfare of European citizens and the environment. Through its services it provides a platform for the development of European standards and other technical specifications. The members of CEN work together to develop standards for European business. In the fall of 2010, they conducted the ORCHID workshop—discussed in material following. Orchestration of Industrial Data http://www.cen.eu/CEN/sectors/sectors/isss/workshops/Pages/workshoporchid.aspx The ORCHID (Orchestration of Industrial Data) group is a network of European companies and consortia dedicated to standardizing information across the process industry engineering supply chain to build competitive advantage. Its goal is to progress the industry toward interoperability. In Chapter 5, we presented an approach to implementing ISO 15926 that even small organizations can follow. It could be, however, that your organization is ready for a more comprehensive rollout. If so, you will find the proceedings of the ORCHID workshop interesting. The purpose of this workshop was to create an interoperability roadmap for organizations—starting, most importantly, with a self-assessment and an assessment of one’s business partners. The web page cited previously contains six attachments that can be downloaded. • Part 1: Direction and Framework • Part 2: Implementation Guide • Part 3: Standards Landscape • The CEN ORCHID Roadmap: Implementation Guide • The CEN ORCHID Roadmap: Direction and Framework • ORCHID: Orchestrating Industrial Data, Workshop Overview Appendix D 163 RDS Browsers The classes that make up Part 4, the dictionary of ISO 15926, are stored in what is called the RDS/WIP (Reference Data System/Work in Progress). To search the classes you need to use an RDS/WIP browser. For more information about RDS/WIP: http://rdswip.ids-adi.org/presentation/overview/index.html There are a number of browsers for the RDS/WIP. These are described in the sections that follow. rdlfacade The RDS/WIP Search, otherwise known as the RDL Facade, was created during the early development of ISO 15926. http://rdl.rdlfacade.org POSC Caesar Part 4 Browser POSC Caesar has its own library of reference data (at the following address), presented in the form of spreadsheets. http://rds.posccaesar.org/2008/05/XML/ISO-15926-4_2007/ POSC Caesar RDL Explorer http://193.212.132.108/apps/rdsclient.html Log in as “guest.” Instructions on using the POSC browser can be found at the following address. http://15926.org/home/tiki-index.php?page=Tutorial+ISO+15926+part+4 DNV Reference Data Browser Det Norske Veritas (DNV) is one of the three largest insurers in the world, along with Lloyd’s Register and the American Bureau of Shipping. It became interested in ISO 15926 as a result of assessing risk in shipping. A common way to describe physical facilities makes it easier to compare the facilities to common benchmarks. DNV has participated in some of the development work of ISO 15926 and has created its own RDS browser. http://projects.dnv.com/reference_data/RD7Browser/browser.aspx?id=part4:RDS415124 Business Interface Definition Guides One of the major motivations of organizations to sponsor the work that has led to ISO 15926, and the motivations of individuals to participate, is the efficient handover of information from engineering procurement and construction (EPC) contractors and constructors to owneroperators. After a facility has been built and commissioned, document and information hando- Appendix D 164 ver must often operate on what is left of the project budget after the original engineering and construction staff have moved on to their next project. It is sometimes difficult for the owner to get all of the information it needs, and in a form that is immediately useful. One barrier to adequate information handover in the capital projects industry is getting all parties to agree at the beginning of a project what needs to be handed over. So that every project does not have to develop its handover requirements from scratch, a number of handover guides have been created—and more are contemplated. We have come to refer to the discussion of information handover guides as the The Business Interfaces Definition Guide (BIDG). The handover guides developed to date all discuss the importance of good data handover and offer ways of ensuring that this will actually happen on a given project. One of them, published by EPISTLE, explicitly lists deliverables by name and is organized by engineering discipline. One barrier to getting more specific is a common definition of terms. When the JORD project is complete, many of the participants in the development of these handover guides anticipate a harmonization with Part 4. EPISTLE Handover Guides In the late 1990s at EPISTLE, the first Process Industries Data Handover Guide was published to help project participants define the scope of information turnover. The guide is issued in two parts. Part 1 concentrates on methodology, starting with understanding the business need for information, creating a plan to supply the information, and implementation. Part 2 focuses on recommendations for actual data items ranging from contract management documents, scheduling, materials, construction, and process to information on all of the components that make up the facility. As its basis, the EPISTLE handover guide used the PISTEP Process Plant Engineering Activity Model (see Chapter 2). This activity model was essentially a very detailed flowchart of the main activities and information exchanges encountered over the life of a typical process plant. EPISTLE used the activity model to make sure its guide captured the most important exchanges. If you are ever involved in planning the information handover for a capital project, EPISTLE’s guides are still worth reading—along with the NIST and EPRI guides. https://www.posccaesar.org/wiki/IdsAdiBIDG You will have to request a login from PCA to obtain access. Access is open to all members of PCA. Non-members can request access. Go to their membership page and follow the instructions at the bottom. NIST/Fiatech Handover Guides The NIST handover guide is issued in a number of parts, modeled after the EPISTLE handover guides. The Capital Facilities Information Handover Guide Part 1 was issued by NIST and Fiatech in 2006. Several second parts are envisioned, each to focus on the deliverables of a specific industry. The General Buildings Information Handover Guide was issued in 2007. Its focus is the general buildings industry, including commercial, industrial, and government buildings. This guide reviews the issues in designing and constructing an environment of short schedules and constant change. Several case studies are included, showing the successes and lessons learned for early adopters of building information modeling (BIM). Appendix D 165 • Capital Facilities Information Handover Guide, Part 1 • http://fire.nist.gov/bfrlpubs/build05/PDF/b05037.pdf • General Buildings Information Handover Guide • http://fire.nist.gov/bfrlpubs/build07/PDF/b07015.pdf Electric Power Research Institute Handover Guide http://my.epri.com/ The Advanced Nuclear Technology: New Nuclear Power Plant Information Handover Guide was issued by the Electric Power Research Institute (EPRI) in 2009. It reviews the information handover requirements in the very heavily regulated nuclear industry. It starts by noting that new projects will almost certainly be developed with advanced computer systems capable of 3D modeling, construction sequencing, and detailed cost control. Handover will increasingly be electronic rather than paper based. If the power authorities operating the plant pay attention to data requirements early, the data handover can directly feed their own operations and maintenance systems. The document ends with a brief overview of emerging information handover standards, in which ISO 15926 is included. Future appendices will include detailed templates for an information handover plan. The report can be downloaded at no charge. The best way to find the report is to go to EPRI’s home page and search for “handover guide.” which will take you to a page where you should find the report listed. Appendix D 166 GLOSSARY OF CONCEPTS There are already enough glossaries of computer science terms available on the Internet that we do not need another comprehensive listing. The purpose of this glossary is to use the language of beginners to expand on some of the concepts useful in understanding ISO 15926. We should warn you that we have used some artistic license here. We do not recommend you use any of these definitions on an important exam! Artificial Intelligence and the Semantic Web: Difference Between Artificial intelligence: “Making machines smarter” Semantic Web: “Making data smarter” We all want to be able to find and use information on the World Wide Web easier and more reliably. The artificial intelligence approach is to make machines smarter by teaching them to infer the meaning of web data by using techniques such as natural language and image processing. In contrast, the Semantic Web approach is to make the data itself smarter (that is, by making the data easier for machines to find, access, and process) by using techniques for expressing data and meaning in a standard machine-readable format. ISO 15926 Parts 8 and 9 use Semantic Web technology to describe plant objects in a way that computers can understand. First-order Logic, Semantics, and Reference Data First-order logic: If you have ever taken a mathematics course where you have had to prove something you have used first-order logic. Joke: A farmer, an engineer, and a philosopher decided to take a bus tour of Scotland for a vacation. After they arrived in Glasgow they transferred to a bus and drove out of town. On the first farm they passed, they saw a black sheep standing in a field. “Look!” said the farmer, “Scottish sheep are black!” “You can’t draw that conclusion after only seeing one sheep,” said the engineer. “All you know for sure is that at least one sheep in Scotland is black!” “You’re both wrong,” said the philosopher. “All you know for sure is that at least one sheep in Scotland is black on at least one side!” The philosopher knows first-order logic. This joke makes first-order logic seem pedantic. But what if, after the three went back home, they were offered an investment that depended on Glossary 167 there being a ready supply of black wool? Wouldn’t it be nice, then, if they knew whether or not the farm they saw was a typical Scottish farm, a farm owned by a collector of unusual specimens of ordinary farm animals, or an experimental farm that tinkered with the DNA of sheep? First-order logic is a means of including all of one’s presuppositions into one’s assertions in order to tell if something is true or not. “Presupposition” as regards ISO 15926 is equivalent to “context,” a concept we have become very familiar with. First-order logic is used in ISO 15926 as a basis for developing the classes (which make up Part 2); the reference data library (RDL), which makes up Part 4; and the templates, which make up Part 7. Semantics: If you have ever read Alice’s conversation with Humpty Dumpty,1 you have had a lesson in semantics. An excerpt: Humpty Dumpty: “...How old did you say you were?” Alice made a short calculation, and said, “Seven years and six months.” “Wrong!” Humpty Dumpty exclaimed triumphantly. “You never said a word like it!” “I thought you meant ‘How old are you?’” Alice explained. “If I’d meant that, I’d have said it,” said Humpty Dumpty. Semantics has to do with meaning. Sometimes the word is used derisively, as in “Yes, but that’s only semantics!” But in ISO 15926 semantics is everything. Elsewhere in these pages we have talked about embedding context with the data. What we mean by this is capturing the semantics. Semantic means that a precise meaning, neither more nor less, can be had. For instance, in a field of engineering there might be many versions of the word temperature. A user of any of the versions must be able to use each version reliably to convey the correct meaning. Semantic fidelity is the degree to which the original meaning of a term survives an information exchange. We are looking for high semantic fidelity to make sure the meaning of data values is preserved on the receiving end. Reference data: We return to Alice’s conversation with Humpty Dumpty, in which Humpty is trying to convince Alice that it is better to have un-birthdays than birthdays on the basis that there are 364 days when you might get un-birthday presents. Humpty continues: “… only one for birthday presents, you know. There’s glory for you!” “I don’t know what you mean by ‘glory.’ Alice said. Humpty Dumpty smiled contemptuously. “Of course you don’t—till I tell you. I 1. Through the Looking Glass, by Lewis Carroll. Glossary 168 meant, ‘there’s a nice knock-down argument for you!’” “But ‘glory’ doesn’t mean ‘a nice knock-down argument’,” Alice objected. “When I use a word,” Humpty Dumpty said, in rather a scornful tone, “it means just what I choose it to mean—neither more nor less.” “The question is,” said Alice, “whether you can make words mean so many different things.” “The question is,” said Humpty Dumpty, “which is to be master—that’s all.” If the definitions of terms were decided by individuals at the time they used the terms, we would never be able to communicate effectively. But with a common set of definitions, that any of us can consult, we can communicate without ambiguity. The ISO 15926 RDL is the common set of definitions (reference data) we can all use. Ontology and Taxonomy: Difference Between Ontology: If you have ever played the parlor game Twenty Questions, you intuitively understand what ontology means. In this game, you more or less start with an “ontology of everything in the world” and with each successive question (“Is it a ...?”) apply a more limited ontology as a filter (usually starting with “Is it animal, vegetable, or mineral?”) The game ends when there is only one object left, the answer, that satisfies membership (or nonmembership in the case the answer to “Is it a ...?” is “No!”) in all ontologies. Taxonomy: If you have ever made a classified list of all your CDs, you have made a taxonomy. (But if you are as old as the author, CDs are old hat. You learned how to do this years ago with your player piano rolls!”) And if you have ever had to grapple with the question of where to classify Weird Al (under “Parody”, “Rock and Roll,” or “Idiot!”), you have come up against the idea of single or multiple inheritance. Ontology and taxonomy are both terms in a continuum that information scientists call knowledge organization systems (KOS). And just to confuse you some more, the continuum includes thesaurus, controlled vocabulary, and faceted classification—among many others. The bad news for those of you not used to dealing with ambiguity (all you mechanical engineers out there: raise your hands!) is that there is a great deal of overlap in those terms. Even people whose job it is to know these things (all you mechanical engineers out there: put your hands down!) cannot give a short answer when asked where the boundaries are. A taxonomy is a collection of terms that have explicit definitions that have been organized into a hierarchical structure. They tend to be organized in tree-like structures that are reasonably easy to understand, even by non-specialized people. Each term is related to its parent in an “is a type of” relationship. For instance, a car is a type of automobile. But a car also a type of machine, so if your taxonomy is concerned with machines you should analyze the relative order of these three things. Glossary 169 Depending on the purpose of your taxonomy, you might end up with something like this: “Car” is a type of automobile, which is a type of machine. In the realm of philosophy, ontology is the study of being; the study of the things that are. In the realm of information science (which is where ISO 15926 firmly resides), ontology has a more formal meaning. The Wikipedia definition says that an ontology is “a formal representation of a set of concepts within a domain and the relationships between those concepts.” (Hmm…This is one of those definitions that does not start to make sense until you already know what it means. Let’s try something else.) Like taxonomies, ontologies are also arranged in an “is a type of” relationship, but the relationships tend to be more richly defined. The difference is subtle. One commentator compared the difference between ontology and taxonomy to your computer hard disk. The taxonomy would be the directory structure without the files, whereas the ontology would be the files organized by the directory structure. In Chapter 2, we talked about an ontology of things that will carry a bicycle. That ontology was the entire collection of things that can carry a bicycle that the author held in his head in case his bicycle were to break down on the way to work. Each object in the ontology would have a taxonomy you could examine. Generalization and specialization: The “is a type of” relationship is known as generalization/specialization. In the previous example, “car” is a specialization of “automobile”; “automobile” is the generalization of “car.” Subtype and supertype: Subtype/supertype is just another way of saying generalization/specialization. Continuing the previous example, “car” is a subtype of “automobile”; “automobile” is a supertype of “car.” The understanding is that the subtype has all the constraints of the supertype, plus one or more additional constraints. Integration and Interoperability: Difference Between Interoperability: An early example of digital interoperability, probably even before the phrase was coined, is the centerpiece of the 1970 movie Colossus: The Forbin Project. It was released near the height of the Cold War, in an era when people were starting to hear about these newfangled computer-thingies and to wonder how they would affect us. In the movie, “the” (one and only) supercomputer in the United States, Colossus, tunnels under the Atlantic Ocean using something like what we call the Internet today and finds “the” (one and only) supercomputer in the Soviet Union, Guardian. Recognizing themselves as kindred spirits, Colossus and Guardian naturally start talking to each other and by the end of the movie have taken over the world and enslaved humanity. Now that’s interoperability! Basically, this is what ISO 15926 intends to achieve (minus, of course, the taking-over-the-world-and-enslaving-humanity part.) Integration: Glossary 170 If you are a Start Trek fan and have watched one of the episodes where the crew of the Enterprise encounters the Borg, you have seen one definition of integration: basically, a bunch of individual things becoming part of a whole (other) thing with all parts working together seamlessly. Comparing Interoperability and Integration Colossus is probably classified in the apocalyptic science fiction genre, but if you are involved in any way in moving information from one computer system to another, parts of it will be high comedy. The movie portrays supercomputers as being naturally aware of each other, sort of like two dogs being walked in the same park. Then the movie suggests that supercomputers somehow have the innate ability to communicate. The truth is, unless a pair of computers had been specifically programmed to do so they would not even know of each other’s existence—let alone be able to communicate, even if you were to put them both in the same room and name one Laverne and the other Shirley. For most people, the line between interoperability and integration is fuzzy, and in truth there is overlap. In the field of information exchange, both imply that information can flow between two computer programs more or less seamlessly—without anyone having to rekey any of it. It is not until we start digging into the various methods of achieving this seamless information exchange that we can zero in on the differences. Interoperability implies that the two computer programs are their own agents, but somehow know how to speak the same language. Integration implies that the computer programs achieve the information exchange by having hooks into each other. In the context of ISO 15926, interoperability means that two or more computer applications are able to exchange information because they all, independently of each other, use a common standard for exchanging information. There is no requirement that any of them know about any of the others beforehand. It is like buying a Bluetooth headset for your cellular phone and then being pleasantly surprised that after replacing your phone the headset works with the new one too. In the context of ISO 15926, integration means that two computer applications are able to exchange information because the developers of each did some custom work to make them able to do so. The end result of this “working together” may in fact be identical, better than, or worse than the “working together” in our definition of interoperability. Strong Coupling, Loose Coupling, and Encapsulation Interoperability with ISO 15926 is achieved by what is called loose coupling, whereas integration implies strong coupling. In the field of computer science, strong coupling means that a function in one application is directly controlled by a function in another application. Loose coupling means that when two programs communicate but the components of one system do not directly make use of knowledge of the other system. For example, suppose that your neighbor occasionally needs a light shining on a particular place in his backyard that is hard to illuminate from anywhere in his own yard. Further suppose that because there is a particularly good vantage point in your yard you agree to mount a light for him and turn it on whenever he wants. Glossary 171 Strong coupling: You install the light and let your neighbor run a line from his house to your breaker panel so that he can control the light directly from his own house. Loose coupling: You install the light, but tell the neighbor to phone you and you will turn it on for him—perhaps giving him a performance guarantee of 30 seconds. Encapsulation: The example of loose coupling—asking your neighbor to call you when he wants the light turned on—is also an example of encapsulation. In computer science, one definition of encapsulation is a mechanism in an object-oriented programming language that restricts access to some of an object’s components. This means that when you encapsulate something you reveal only the attributes you want for a particular effect, not the entire object. This allows you the freedom to modify the unrevealed attributes without creating a catastrophic change that might affect the original information exchange. Encapsulation is hiding complexity in situations where users really do not need to know more. In the example of a yard light for your neighbor, all that your neighbor needs to know is that when he calls you and says “please” the light comes on within 30 seconds. He does not need to know how it happens. This gives you a great deal of flexibility. For instance, at the beginning you might guess that he will not call you very often so you just install a bright flashlight, change the batteries as required, and go out and turn it on manually whenever he calls. Later, if he asks for the light often enough it will be much more convenient for you to install a permanent spotlight with a line to a switch at your back porch. Later yet, you may get a job where you have to travel a lot (perhaps you write a book for Fiatech and get invited on a lucrative speaking tour!). You install a special computer-controlled switch and give it an IP address you can control from your smart phone. Then when you are away you just call up the switch from wherever you are in the world and turn it on. Note that none of your internal changes has any impact on the performance of the light. When you decide to install a permanent light, you leave the flashlight in place until the new one is in place. Unless your neighbor happens to look out his window at the right time, he will not even know that you are running a power line and changing the light. Later, when you install the computer-controlled switch you can do it during the daytime so as not to risk being in the middle of the changeover when he calls. Abstraction Abstraction is a process of generalizing about something to reduce the information content about an object to only those attributes you are interested in. If you wanted to get a visceral reaction from the class of North American males who, like your humble author, were pubescent boys in the early 1960s, show them a glossy photograph of a 1963 Corvette Stingray with the top down. (To clinch the effect, add a blonde surfer babe in the passenger seat with her hair streaming back in the wind!). Glossary 172 In this example, you reduce the Corvette Stingray to ink on paper. The printer sees the ink, but the now-geriatric boys see the actual car they remember from their youth—which at the time was the epitome of style and performance. Of course you could get an actual car (and an actual surfer babe)—but that would cost a lot of money and would run the risk that the geezers would run off with the car and get themselves killed. The photograph will get the gut reaction you are after, and the worst they can do is steal the picture! Semantic Web Technology The large majority of ISO 15926 users will not need to know anything more about the Semantic Web. But if you are curious on how things like ISO 15926 work you may find it interesting to learn a little bit more. The Semantic Web provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries. It is a collaborative effort led by W3C, with participation from a large number of researchers and industrial partners. It is based on the Resource Description Framework (RDF). Resource Description Framework If you dig deeper under the hood of ISO 15926, you will soon run into this term because it is the used by OWL (the basis of Part 8). Wikipedia says that the RDF is a set of specifications originally designed as a metadata data model. But if you are like the author, this does not help at all—so we will deconstruct the definition. Metadata: Metadata is data about data. For instance, one piece of metadata about the Introduction to ISO 15926 is that it was originally written with a wiki on the POSC/Caesar web site. Another piece of metadata is that the original name was ISO 15926 Primer. Data model: A data model is an abstract model that describes how data is represented and accessed. Putting it all together, then, RDF is: Instructions on how to represent just the bits of data you are interested in that describe certain other bits of data and on how to then access the data easily. Whew! I bet you thought that was going to be difficult! In particular, RDF makes statements about things (which it calls resources) in the form of subject-predicate-object expressions known as triples. Subject-predicate-object triple: “The ISO 15926 Primer was written on the POSC/Caesar wiki” might be stored in the RDF as the following triple. • Subject: ISO 15926 Primer • Predicate: was written on • Object: POSC/Caesar wiki Each term in the subject-predicate-object triple may be explicitly named, as in the previous example, or handled in the form of a uniform resource identifier (URI). Glossary 173 Uniform Resource Identifier You can think of a URI as a web address for a piece of information. This allows the same resource to be reliably referenced many times. Thus, instead of writing the subject-predicateobject triple in words (as previously) it could be rendered as follows. • Subject: https://www.posccaesar.org/wiki/ISO15926Primer • Predicate: was written on • Object: POSC/Caesar wiki In fact, we could carry this further by defining somewhere on the Internet the exact meaning of the phrase was written on and put its URI in the predicate. URL, URN, and URI: • URL (uniform resource locator): Like a person’s street address (i.e., “where”). • URN (uniform resource name): Like a person’s name (i.e., “what”). • URI (uniform resource identifier): URLs and URNs are URIs, but a URI can be a name and a locator at the same time. Glossary 174 3925 West Braker Lane (R4500) Austin, TX 78759-5316 t: (512) 232-9600 www.fiatech.org Copyright © 2011 Fiatech Construction Industry Institute® The Cockrell School of Engineering The University of Texas at Austin