DSpace

DOCUMENT ARCHIVE

 

NDLTD Document Archive >
International Conferences, Workshops and Symposia >
ETD 2004 >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2340/712

Title: From DTD generation to XML conversion: Structured ETDs at the Document and Publication Server of the Humboldt University
Authors: Uwe Müller
Creation Date: 3-Jun-2004
Issue Date: 21-Jan-2008
Publisher: NDLTD
Place: Lexington, Kentucky, USA
Abstract: Since 1997 PhD students at Humboldt University Berlin can fulfil their publishing duty concerning their dissertations by using a digital publication. While we were one of the first universities having extended the bunch of accepted publishing possibilities to this method nowadays institutional document and publication servers are regarded as a standard service provided by almost every German university. In contrast to the vast majority of domestic university libraries Humboldt's electronic publishing group has pursued a structured document approach from the very beginning of its activities in this area. The originally developed DiML (Dissertation Markup Language) derived and adapted from an SGML-DTD evolved at Virginia Tech for Electronic Theses and Dissertations has now been transformed to an XML-DTD (xDiML). In this context a DTD generation system has been developed allowing for the compilation of individually assembled DTDs. For this purpose the elements of the DTD have been grouped into modular units. These modules which are XML files themselves are stored in the DTDBase. Using the DTDSys - a transformation system on the basis of XSLT and Java - the modules can be combined to an individual DTD - e.g. the xDiML DTD used for Theses and Dissertations. Due to its modularity the system can easily be used to supply new publication series with appropriate DTDs, which can contain special elements and which are as slim as possible and thus more easily applicable than a universal "Mega"DTD. The DTDSys also facilitates the integration of externally managed (standard) DTDs such as SVG, SMIL, MathML, or MusicML and thereby allows the generation of DTDs with multimedia extensions. The use of a controlled and centrally managed set of modules provides the advantages of shared semantics beyond the borderline of different DTDs - a feature which is used e.g. for qualified fulltext retrieval. The XML based publishing approach is currently applied for dissertations and master theses, university serial publications, as well as a few electronic journals and conference proceedings. Different approaches have been developed for the conversion process from text processing systems to XML. They were adapted for the various requirements of the particular author groups or editors. The developed methods include styles and add-ons for OpenOffice/StarOffice, FrameMaker and MS Office. They especially exploit the XML support most software vendors of office systems have newly integrated as a genuine standard interface into their products. The generated XML files form the source for different presentation formats, the basis for longterm preservation activities, and a prerequisite for value added retrieval techniques. The paper will reveal both, the DTD generation system and the XML conversion process.
Citation: In Proceedings of Seventh International Symposium on Electronic Theses and Dissertations
URI: http://www.uky.edu/ETD/ETD2004/uwe/mueller.ppt
http://hdl.handle.net/2340/712
Appears in Collections:ETD 2004

Files in This Item:

File SizeFormat
mueller.ppt1630KbMicrosoft PowerpointView/Open

All items in the NDLTD Document Archive are protected by copyright, with all rights reserved.

 

Networked Digital Library of Theses and Dissertations - Feedback