OntologySummit2013 Hackathon Clinics ISO 15926 RefDataValidation

This is one of the "OntologySummit2013_Hackathon_Clinics" Projects ...

= Project (HC-06): ISO 15926 Reference Data Validation =

Project Co-champions: VictorAgroskin

Event Date / Time: (Day-1) Sat 2013.03.30 (possibly also Sun 2013.03.31) ... see: [ details]

Archived work output can be found here

Participant Volunteers:

Add your name below, along with any time constraints or other participation-related comments. Thank you!



see: Work Plans

Abstract
This ontology clinic is aimed at the evaluation of publicly available ISO 15926 reference data, viewing it as an ontology for the engineering domain. We will look for compliance to upper ontology constraints, diagnose problems in reference data, evaluate ease of understanding and use of existing data, and make suggestions for ontology improvement. Another goal is to apply formal ontology quality metrics for data in question.

The effort will develop rules and algorithms to support generic verification tests, and also attempt to invent and implement specialized checks and quality metrics for ISO 15926 reference data.

see: [ [ Project document] ]

Ontology involved
The ISO 15926 is a standard for engineering data integration, sharing, exchange, and hand-over. The standard defines a generic data model as an upper ontology for an engineering domain. Extensive Reference Data Libraries (ontology data) for process plants are developed by community of users, which includes equipment manufacturers, engineering companies and owner/operator companies in oil and gas, nuclear power, petrochemical industries, and others. Standard development and application is jointly managed as JORD project by two industry associations: POSC Caesar Association and FIATECH.

Upper ontology of the ISO 15926 is publicly available as an OWL representation of ISO 15926-2 at:


 * http://rds.posccaesar.org/2008/02/OWL/ISO-15926-2_2003
 * see https://www.posccaesar.org/wiki/ISO15926inOWL for more details

JORD Reference Data Library is available as a reference data service:


 * the human browser access & query page is http://posccaesar.org/endpoint/
 * the SPARQL endpoint is http://posccaesar.org/endpoint/sparql

A snapshot file with full content of an endpoint is available for download at:


 * http://rds.posccaesar.org/downloads/PCA-RDL.owl.zip

Various resources for study of ISO 15926 are available from


 * http://www.infowebml.ws/, http://www.15926.info/,
 * http://www.15926.org/ (see self-education guide at
 * http://levenchuk.com/2012/10/01/iso-15926-self-education-sequence/).   (012)

Goals
Evaluate publicly available ISO 15926 reference data:


 * compliance to the best practices of ontology engineering;
 * compliance to logical model and upper ontology constraints;
 * diagnose problems in reference data;
 * make suggestions for data improvement;
 * evaluate ease of access, understanding and use of existing ontology data.

We'd also like to attempt calculation of ontology quality metrics for ISO 15926 upper ontology and Reference Data Library in various software environments or by standalone tools. We can use existing methodologies (like OntoClean, OQuaRE or OntoQA) or brainstorm for new metrics.

Deliverables
We are panning to get:


 * the list of "hard" problems (errors, inconsistencies, missing data, etc.) in existing JORD Reference Data Library;
 * characterization of any "soft" problems with data identification, availability, understandability, etc.;
 * adopted or new ontology quality metric methodologies for ISO 15926 ontology;
 * ontology quality metrics calculated for JORD Reference Data Library;
 * suggestions for improvement of data consistency and quality in JORD Reference Data Library;
 * algorithms, scripts, software tools proven to be useful for verification and quality assessment of ISO 15926 data, made available to ISO 15926 community.

Project results will be presented 11-14 of April at ISO 15926 round table during 4th annual Systems Engineering Challenges workshop of INCOSE Russian chapter.

Tools
Availability of data in OWL, RDF/XML and at a SPARQL endpoint allows the use of both public and proprietary tools in this clinic project. General purpose ontology management and Semantic Web tools can be used along with special purpose tools developed for work with ISO 15926 data (like iRING Tools http://iringug.org/wiki/index.php?title=IRINGTools or .15926 Editor http://techinvestlab.ru/dot15926Editor)

Collaborators
Project initiators are TechInvestLab.ru, a Moscow (Russia) based company, developer of the .15926 software environment. Another team of tool developers from Moscow (http://agentlab.blogspot.ru) has also expressed interest in participating.

We are looking for collaborators  ontology evaluation experts commanding any generic or specialized software tools to work on proposed ISO 15926 datasets. We will welcome software tool developers willing to show use of their instruments for project tasks.

Resources and facility
Our own tool .15926 Editor is freely available (http://techinvestlab.ru/dot15926Editor), well documented, is designed for exploratory programming and includes examples of use for data verification. Basic knowledge of the Python programming language will be enough to join us in the project with this tool.

We will be happy to provide office space for real world collaboration in Moscow, well connected to the virtual environment of the Hackathon.

Work Plans
from: (email to participants) VictorAgroskin / 2013.03.26

A. Project schedule:

We propose to start on Sat 2013.03.30 at 2:00pm Moscow / 11:00am CET / 10:00 UTC. At this time we'll open communication infrastructure of the project from our Moscow office (we propose Skype for voice and Mikogo http://www.mikogo.com/ for screen sharing). We'll work for some 5-6 hours, after that we can make a break for 1 hour and will be ready to continue work from our homes if team members wish so, or new people from other time zones join us.

Project continuation on Sat 2013.04.06 will be discussed later.

B. General framework:

Our proposal is to do work in short iterations within the loop:

1. Selection of a Test or a Metric (we've to prepare the list before the start of the meeting).

2. Short presentation of a Test or a Metrics from the most competent participant (should be prepared in advance, I think?).

3. Discussion of its application to ISO 15926 upper ontology or reference data, suitable algorithm and tools (software environments) selection. We'll always try to do each Test or Metric in at least two environments - to compare results and get better understanding of tool applicability.

4. Implementation (performed concurrently if several tools/software environments selected).

5. Test on real data, discussion of results.

6. Preliminary documenting of results (errors, inconsistencies, metric values, etc).

7. Go to (1).

If only one team team takes implementation of some Test or Metric, steps 1-3 can be performed in a loop until all people/teams capable to do implementation have some task assigned.

Final documentation and packaging of results, suggestions and useful code snippets for presentation to ISO 15926 community and to JORD project will be done after the main event. We (TechInvestLab.ru) are ready to do the job.

C. Ontology data sources:

Upper ontology of the ISO 15926 is publicly available as several OWL representations of various aspects of ISO 15926-2 at:

http://rds.posccaesar.org/2008/02/OWL/ISO-15926-2_2003, 221 kB http://rds.posccaesar.org/2008/07/OWL/ISO-15926-2_2003_annotations, 110 kB http://rds.posccaesar.org/2008/07/OWL/ISO-15926-2_2003_entityMembership, 26 kB http://rds.posccaesar.org/2008/07/OWL/ISO-15926-2_2003_entityMembershipCandidates, 14 kB http://rds.posccaesar.org/2008/09/OWL/ISO-15926-2_2003_inverseRoles, 16 kB http://rds.posccaesar.org/2008/09/OWL/ISO-15926-2_2003_chainedRoles, 38 kB

see https://www.posccaesar.org/wiki/ISO15926inOWL for more details

JORD Reference Data Library is available as a reference data service:

the human browser access & query page is http://posccaesar.org/endpoint/ the SPARQL endpoint is http://posccaesar.org/endpoint/sparql

A snapshot file with full content of an endpoint is available for download at:

http://rds.posccaesar.org/downloads/PCA-RDL.owl.zip

D. Initial list of problems:

We (TechInvestLab.ru) can suggest several Tests to perform on ISO 15926 upper ontology and/or reference data:

a) Verification of upper ontology (currently distributed between 6 OWL files). This representation is heavily criticised and a new effort to do a standard representation in OWL 2 is currently suggested to ISO. The list of problems detections and suggestions for improvement will be of great value to the community.

b) Reference data compliance to upper ontology. There are many requirements to reference data library content coming from ontology restrictions: abstract classes and disjoint classes, mandatory relationships (properties), membership relations between classes of classes and classes, etc. Simple scripts to find most common mistakes can be found in the documentation to .15926 Editor (section 6.4).

c) Filling gaps in entities and relations. This effort can be based on methods outlined in http://ontolog.cim3.net/file/work/OntologySummit2013/2013-03-07_OntologySummit2013_OntologyEvaluation-IntrinsicAspects-2/OntologySummit2013_debugging_is-a_structure--PatrickLambrix_20130307.pdf and it'll be great if this team will be convinced to join our project. In ISO 15926 reference data libraries classes of individuals, classes of classes and classes of relationships can be viewed as corresponding taxonomy systems, thus suggestions for missing members in these taxonomies can be deduced from the existing correspondence relations.

E. Questions to participants:

1. Please suggest more Tests and especially Metrics - our knowledge of these comes only from the Summit materials.

2. Please indicate your readiness to apply your tools to the tasks identified in the project (or suggested by you). If some initial introduction to data or its transformation are required - notify us in advance, we can discuss it together, by email or by arranging a preparation call.

3. If you are interested to work in Python data exploration environment of .15926 Editor - please take a look (download from http://techinvestlab.ru/dot15926Editor ) and ask questions before the event.

--- [part of above content transferred from: http://ontolog.cim3.net/forum/ontology-summit/2013-03/msg00011.html ]