Versioning Ontology

The actor authority provides base classes for anything that can perform or have behaviors or responsibilities - people, systems, dogs, organizations, groups, gods, etc.

Class Diagram for all Versioning

The core of versioning works for structured and unstructured assets. Version artifact

A knowledge base is a container for a set of managed data assets that may change over time based on the actions of an authority.  These data assets contain the knowledge of the knowledge base.  Structuring the knowledge base as a set of data assets allows each asset to be individually managed, tracked, versioned validates, approved and accepted.  This approach carries forward units of management like “files” and “ontologies” so that knowledge is in manageably sized chunks.  Anything that is “said” in the knowledge base is said with respect to such a data asset.  The knowledge base makes no assumption about the granularity of data assets – they may be as big as an enterprise DBMS or as small as a single business rule.  To capture the common concept of a data asset container (or folder) for data assets, a hierarchy of Data asset containers may be used – this is a synonym for a folder in a file system.  Use of folders is, however, optional.

Anything that affects the knowledge base begins with a transaction.  A transaction is performed by some authority (human or automated performer) that makes (states) a set of coordinated Data asset changes.  Note that any speech act (such as a transaction) may state other propositions, including other speech acts  Authority and authorization is the subject of another ontology.  A data asset change records the authority that stated the changes being proposed that move the asset from one data asset version to the next (prior_version to later_version), based on this change.

Assuming there are no errors or conflicts the data asset change and subsequent transaction is marked as a successful act or a failed act. Note: Structure for reasons for failure still to be developed.

Structured Asset Versioning

Structured asset versioning extends basic versioning to provide for changes between structured assets being recorded.Structured asset versioning

Structured data assets augment generic data asset management by understanding the changes that take place within the asset.  For example, in a time ontology it would be recorded that the concept of “timezone” was added.  Each such change is a proposition about the structured data asset, stated as part of a speech act.

Consider that we start with a transaction that states a structured data change about a structured data asset.  That change will take the structured data asset from one structured data version to the next. The structured data change is a proposition group that states a set of changes about the asset.  That set of changes may:

Through these changes every triple in the resulting version should be able to be tracked back to a specific change made as part of a speech act.


Asset Syncronization

Asset syncronization models different assets that are kept "in sync" using syncronization rules.

As EKB will manage different representations and mappings of data assets when the structure of those assets is understood based on rules about synchronizing those assets.  Structured data may have three different representations in the EKB, each of which is considered a separate asset: 1) The “raw” file in some external format. 2) An isomorphic representation of the same information expressed in RDF, but retaining the source files meta model and terminology. 3) A shared concept representation where representations of shared concepts have been normalized.


These different representations of the same information are maintained as separate but synchronized data assets.  The mappings between these assets are expressed as synchronization rules and the resulting changes to the assets “data synchronization changes”, based on the following ontology:

 Logical connections between assets that are representations of the same underlying information are represented as a “Data Synchronization Rule”.  The data synchronization rule connects two data assets in a “hub and spoke” pattern and states that these two assets are to be kept in synchronization.  A subtype of data synchronization rule is “Isomorphic synchronization rule” which states that the “spoke” asset is unstructured (at least the structure is not represented in the EKB, but it must be known in some way), and the “hub” asset is a structured data asset represented in RDF.  The isomorphic synchronization rule will preserve the original structure, meta model and terminology, but will map the artifact to RDF in a deterministic and reversible way.

A “Shared Concept Synchronization Rule” changes the semantic representation but preserves meaning by mapping the isomorphic model to and from a “shared concept” model.  The shared concept model will have those concepts that are defined in the EKB mapped to their normalized counterparts (the shared concepts).  Note that subtypes of a shared concept mapping rule may “narrow down” the situation within the shared concept model that is mapped.

Both the isomorphic and shared concept synchronization rules specify what the EKB should do, not what actually happened.  When an asset with a synchronization rule is changed is will trigger the synchronization rule to perform the mapping and thus make a change to the mapped asset.  Each version of a mapped asset will trigger a corresponding version of each asset it is mapped to.  The mapping may be done based on either the changes or a wholesale mapping of the asset.

A “Structured Synchronization Change” records the specific effect of a data synchronization rule that is a result of some other change to a data asset.  So when one asset changes it triggers a rule which creates a data synchronization change, resulting in a new version of the synchronized asset.  A data synchronization change references a data synchronization rule as its “reason”.  This results in a “forward chaining” effect on the synchronized assets.




This information is made available through the OSERA program (http://www.osera.gov) of the U.S. General Services Administration under the terms and conditions of the license located at http://www.osera.gov/license.html. Copyright © 2008 by Data Access Technologies as an unpublished work. All other rights reserved, Worldwide. Further information and community support may be found on http://www.ModelDriven.org