Portability, Traceability and Data Ownership - Part II
[return to Part I]
The Dilemma of Missing Information
Here is a four minute video interview of Chris Saad, Co-founder and CEO of Faraday Media. If you are pressed for time, just catch the first minute and a half. Chris is also Co-founder and Chairperson of Dataportability.org of which Faraday Media is a sponsor. In Part I of this multi-entry blog I began with the video clip called Data Portability – Video that is a promo for Dataportability.org.
Learning from the Future at the Next Web with Chris Saad from Maarten on Vimeo.
Right after the Facebook/Scoble incident, Dataportability.org gained momentum and membership from individuals associated with the likes of Google, Plaxo, Facebook, LinkedIn, Twitter, Flickr, SixApart and Microsoft. At Chris' suggestion I, too, have just recently joined their DataPortability Policy Action Group.
Henry Story, a staff engineer for Sun Microsystems, made the following interesting comments on the Sun Babelfish blog about Chris Saad’s Data Portability group and the Data Portability – Video.
“Will the Data Portability group [at Dataportability.org] get the best solution together? …. [O]ne wonders whether XML is not the solution to their problem. Won't XML make data portability possible, if everyone agrees on what they want to port? Of course getting that agreement on all the topics in the world is a never ending process....
But the question is also whether portability is the right issue. Well in some ways it is. Currently each web site has information locked up in html formats … [which makes] it difficult to export the data, which each service wants to hold onto as if it was theirs to own.
Another way of looking at this is that the Data Portability group cannot so much be about technology as policy. The general questions it has to address are question of who should see what data, who should be able to copy that data, and what they should be able to do with it. As a result the policy issue of Data Portability does require one to solve the technical problem of distributed identity: how can people maintain the minimum number of identities on the web? (ie not one per site) Another issue that follows right upon the first is that if one wants information to only be visible to a select group of people - the "who sees what" part of the question - then one also needs a distributed way to be able to specify group membership, be it friendship based or other. The [Data Portability – Video] … makes that point very clearly why having to recreate one's social network on every site is impractical.
Story’s comments are a good setup for what I want to address. And what I want to address is how to make a connection between data portability and what I call the ‘frayed ends and laterals’ of complex product supply chains.
Along the way I want to pay attention to those readers (i.e., the vast majority of the regular, non-techie folks in the world) who are hanging back wondering what an XML object is. Let’s weave in a little history with a simple example, shall we?
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. W3C is headed by Sir Tim Berners-Lee, creator of the first web browser and the primary author of the original Uniform Resource Locator (URL), HyperText Transfer Protocol (HTTP) and HyperText Markup Language (HTML) specifications. These are the principal technologies that form the basis of the World Wide Web.
For example, consider this product pedigree written in natural language.
Product Pedigree Document
Manufacturer ID = Safe Toy Company
Product Serial Number = STOY991
Product Description = Painted Toy
Product Info To Supply Chain = 0% lead in paint
Product Info To Govt Regulator = Less than 600ppm of lead in paint by weight
A beneficial characteristic of the World Wide Web is that you can read language like the foregoing example in a natural way but ‘behind the scenes’ (i.e., behind the web browser interface) this natural language representation can be constructed in different ways for different purposes.
The same natural language representation written as an HTML information object using an HTML authoring software application (also called an HTML editor) would read behind the scenes as follows.
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN">
<body><p>
Product Pedigree Document<br>
Manufacturer ID = Safe Toy Company<br>
Product Serial Number = STOY991<br>
Product Description = Painted Toy<br>
Product Info To Supply Chain = 0% lead in paint<br>
ProductInfo To Govt Regulator = Less than 600ppm of lead in paint by weight
</p></body></html>
Because HTML objects are designed primarily for creating static websites, and not for dynamic information sharing, W3C has further developed standards for structured electronic sharing in the form of Extensible Markup Language (XML) objects for facilitating the emerging Semantic Web.
With gracious assistance from my good friend and collaborator, Dr. Marvin Stone, here’s an example of a granular XML information object created in an XML editor that would be naturally represented through a web browser as above.
<?xml version="1.0" encoding="UTF-8" ?>
<Pedigree>
<ManufacturerID>Safe Toy Company</ManufacturerID>
<ProductSerialNumber>STOY991</ProductSerialNumber>
<ProductDescription>Painted Toy</ProductDescription>
<ProductInfoToSupplyChain>0% lead in paint</ProductInfoToSupplyChain>
<ProductInfoToGovtRegulator>Less than 600ppm of lead in paint by weight</ProductInfoToGovtRegulator>
<OtherData>Document Type Definitions</OtherData>
</Pedigree>
This type of granular XML object works fine for short, vertically integrated supply chains covered by one or two enterprise systems where a small number of supply chain participants agree on what they want to port. But due to prevalent fear factors (and other policies) that prevent or otherwise affect information sharing along lengthy, complex information supply chains, there is a critical need for a more refined XML tool.
Here’s an example of a hypothetical, author-controlled XML object that would be created/authored/constructed using an extension to the foregoing XML editor that we could call an A-XML editor extension (i.e., author-controlled XML editor extension).
<?xml version="1.0" encoding="UTF-8" ?>
<Pedigree>
<PedigreeID UniquePointer =" 99087 "/>
<ManufacturerID UniquePointer =" 00372 "/>
<ProductSerialNumber UniquePointer =" 43229 "/>
<ProductDescription UniquePointer =" 23444 "/>
<ProductInfoToSupplyChain UniquePointer =" 66221 "/>
<ProductInfoToGovtRegulator UniquePointer =" 66333 "/>
<Permissions UniquePointer =" 37911 "/>
<!-- Manufacturer information sharing permissions -->
<OtherData>Document Type Definitions</OtherData>
</Pedigree>
In the process of being authored by the toy manufacturer, this A-XML object would be constructed to point to a central repository of uniquely identified data containing the toy manufacturer's unique ID, the unique identifiers of the painted toy’s pedigree, and a unique identifier of the toy manufacturer's information sharing permissions.
Once distributed by the manufacturer/author to a lengthy supply chain, this A-XML object would provide greater control, visibility and traceability one-share, two-shares, three-shares, etc. away from the author. As other supply chain participants access the A-XML object (using a compatible XML editor) to confirm the toy’s pedigree, the toy manufacturer would be provided with supply chain visibility never before experienced.
For instance, the data element "0% lead in paint" uniquely identified as 66221 would be accessible by any supply chain participant registered with the central repository and using a compatible XML editor. The data element "Less than 600ppm of lead in paint by weight" uniquely identified as 66333 would only be accessible by permitted government regulators also registered with the central repository. (For those of you concerned with the ethics of representing one thing to consumers while reporting something else to the government, check out Are Food Labels Reliable?)
In my first journal entry to this blog I offered this:
“Unscrupulous supply chain participants will always try to hide in the ‘fog’ of their supply chains. The manufacturers of safe products want to differentiate themselves from the manufacturers of unsafe products. But, again, fear factors keep the good manufacturers from posting information online that may put them at a competitive disadvantage to downstream competitors.”
There’s a chicken and egg effect here, isn’t there? That is, which comes first, policy or technology?
Here’s one answer.
Don’t throw the baby out with the bath water. Don’t get rid of the supply chain enterprise and legacy systems that are already providing useful information sharing without the data ownership characteristics of a tool like A-XML. But in the context of an emerging Semantic Web that will lean heavily upon software-as-a-service, consider the missing and incomplete information that is not being shared from the frayed ends and laterals of complex product supply chains.
And, ask yourself, could there be both a technological and socio-political connection made between data portability and supply chain traceability?
[continued in Part III]