SCEP: | 103 |
---|---|
Title: | Standard filesystem representation method |
Version: | 8b028d25775b16452084cca78d96d730904b294c |
Last modified: | 2014-06-16 10:23:18 UTC (Mon, 16 June 2014) |
Author: | Raphael ‘kena’ Poss |
Status: | Draft |
Type: | Standards Track |
Created: | 2014-06-16 |
Source: | scep0103.rst (fp:0cYMtlAA_T4_vG2NBmtEeB7uh26b1tpzb-0qiDGHxGrIMw) |
The Structured Commons object model is defined semantically in SCEP 101 [1], independently from its particular representation in a computer system. Conversely, there may exist multiple valid representations for an object.
This SCEP defines the canonical representation of Structured Commons objects in a filesystem, with method name "fs".
The fs representation maps objects to a filesystem (eg. on disk).
Object files are represented by filesystem files.
Object dictionaries are represented by filesystem directories.
Names in dictionaries are translated using UTF-8 [2] to file/directory names, with an optional percent-encoding [3] to support characters that are not acceptable in file names on a particular platform.
For example, the name "helló / world?" can be translated "hell%C3%B3 %2F world?" on a Unix filesystem or to "hell%C3%B3%20%2F%20world%3F" for a directory served over HTTP.
Dictionary names that map to objects are represented by directory entries that point to either files or directories.
Dictionary names that map to fingerprints are represented by a regular file containing the binary representation of the fingerprint, with a filesystem name that prepends "%00" to the dictionary name representation.
Caution!
The names "." and ".." are valid dictionary names in Structured Commons dictionaries, however they are (usually) not valid filesystem entry names. In general, names starting with a "." should always be encoded to a filesystem name starting with "%2E".
PROCEDURE encode(path, object) BEGIN IF object IS-A FILE-OBJECT THEN SAVE BYTE-DATA(object) TO path ELSE MAKE-DIRECTORY name FOR name IN DICTIONARY-LABELS(object) DO item := DICTIONARY-VALUE(object, name) npath = URL-QUOTE(UTF-8-ENCODE(name)) IF item IS-A FINGERPRINT THEN fpath := path + '/%00' + npath SAVE BYTE-DATA(item) TO fpath; ELSE fpath := path + '/' + npath CALL encode(fpath, item) END IF END FOR END IF END PROCEDURE
FUNCTION decode(path) BEGIN IF IS-REGULAR-FILE(path) THEN data := LOAD-BYTES-FROM(path) ELSE data := MAKE-EMPTY-DICTIONARY() FOR fname IN DIRECTORY-LISTING(path) DO IF fname STARTS-WITH "%00" THEN label := UTF-8-DECODE(DROP-FIRST-CHAR(URL-UNQUOTE(fname))) fpdata := FINGERPRINT-VALUE(LOAD-BYTES-FROM(fname)) SET data KEY label VALUE fpdata ELSE label := UTF-8-DECODE(URL-UNQUOTE(fname)) data := CALL decode(path + '/' + fname) SET data KEY label VALUE data END IF END FOR END IF RETURN data END FUNCTION
Example code in Python is provided separately:
[1] | SCEP 101. "Structured Commons Object Model and Fingerprints". (http://www.structured-commons.org/scep0101.html) |
[2] | RFC 3629. "UTF-8, a transformation format of ISO 10646". (https://tools.ietf.org/html/rfc3629) |
[3] | RFC 3986, "Uniform Resource Identifier (URI): Generic Syntax". (https://tools.ietf.org/html/rfc3986) |
This document has been placed in the public domain.