ZWI file format
The ZWI file format is used to represent wiki articles. It was developed store wiki articles (and associated data) to facilitate exchange between different wiki software. A ZWI file contains Wikitext of the final revision of an article, names of contributors, old revisions, embedded media and ready-to use HTML files, as well as other (optional) file formats derived from the original Wikitex file.
|Internet media type|
|Developed by||S.V. Chekanov|
|Initial release||March, 2021|
|Type of format||Compressed container format|
The ZWI files are compact (zipped) file and sharable via the network. The ZWI file format is used for exchange between wikis, and it was deployed to the HandWiki encyclopedia.. I registered user can use the button “ZWI export” (above the editor area) to download the wiki page. The ZWI file can be unzipped as any zip archive. The ZWI files have the extension *.zwi.
ZWI file structure
A ZWI file is a ZIP archive thus it can be manipulated using the standard zip compression tools. A typical ZWI file has the following structure:
- article.wikitext - Wikitext of the article with last modification using Mediawiki syntax. It is the main source of HTML, XHTML and other possible derivations.
- article.html - HTML file to view in a browser (with all headers). It is a secondary (derived from article.wikitext) format.
- article.xhtml - HTML portion with the article content (without headers, navigation etc.) (optional)
- article.tex - article in the LaTeX file format (optional)
- article.dokuwiki - article in the DokuWiki file format (optional)
- metadata.json - a JSON file with the information about the articles (editors, revisions, namespaces, abstract etc.)
- plugins.json - a JSON file with the information about plugins used by software that creates this file (used for a consistency check)
- media.json - a JSON file with the list of linked media files (images)
- data/media/[namespace]/ - directory with images associated with the article (only if they are available from the local server)
- data/attic/[namespace]/ - directory with files with older revisions of article.wikitext. Each file has the name:
The most important file that contains the description of the ZWI file is metadata.json. It describes the version of the ZWI format specification, which file is the primary source of derivations (article.wikitext for the MediaWiki software). All other files, such as article.html, article.tex, article.dokuwiki are secondary conversions since they are obtained after using convertors of the original article.wikitext file.
If a ZWI file is created using DokuWiki software, it is likely that the primary file is article.dokuwiki while article.wikitext is a result of internal conversion. This should be stated in metadata.json.
Generally, all article revisions should be stored. In some cases (like for HandWiki), only the first revision is stored.
The ZWI file can include the images linked in the articles. They are stored in the directory "data/media/[namespace]/". The images are included only if they were located on the local server (i.e. where the wiki with the article is installed). The ZWI export mechanism does not attempt to extract images if they are linked from the Mediawiki commons. However, the ZWI creation mechanism attempts to identifies cached images.
If there are no other (older) revisions of the article, the directory data/attic/[namespace]/ is not created.
The ZWI file format was initially implemented for the SandBox of the HandWiki encyclopedia in March 2021. A proof of the basic principles for creation and insertion of the ZWI files was illustrated using the DokuWiki wiki software. . In April 2021, ZWI file export was deployed as a standard feature of the HandWiki encyclopedia.