The dita2rdf DITA OT plugin |
The dita2rdf DITA OT plugin (a.k.a. the dita2rdf plugin) adds a new transtype to the DITA OT: rdf.
In short, running the rdf transtype on a DITA map or topic runs an extraction of the metadata of the content and stores it in RDF/XML format, a serialization of RDF. The transformation not only extracts the metadata of the input file, it also follows the references (@href and resolved @keyref) to cover the whole documentation set.
Once the metadata is extracted as RDF, you should upload it to an RDF repository. To do so, just follow the instructions. Once the upload is configured, all the RDF output you generate with the rdf transtype will be uploaded to the repository. I will soon add fun things to do with a repository fill of DITA metadata ;-) You'll get updates here.
I will speak about this project it during the DITA Europe conference and the DITA OT day. Let's meet there!
The dita2rdf DITA OT plugin only extracts the metadata at the level where it's expressed and not to apply the metadata cascading rules in the resulting RDF. I'd rather extract incomplete metadata than zealous inexact statements. The only metadata that is currently inherited is the language, as per the XML standard. For two reasons:
I have ideas to distinguish the metadata that genuinely describes a topic or a map from the metadata that is inherited, but I will not apply them until I'm sure it is robust enough.
The dita2rdf plugin was developped reusing the methodology of the pdf2 plugin for the build and the customization, as I thought many DITA-OT users would feel more comfortable with a familiar plugin structure.
However, the dita2rdf plugin differs on a couple of points:
For the long term, I aim at removing the preprocess completely and using as much XSLT as possible to make the code more portable.