TARO - Texas Archival Resources Online Administrative Mode
menu map

How Do I...?

TARO | Administrative Pages | Retro convert schema XML to dtd XML

Retro Convert schema XML to dtd XML

Several people have adopted the ead schema and started producing all their XML files this way either using Archon or Archivist Toolkit. These repositories have stopped submitting their files to TARO due to the fact that they would have to essentially redo hours of work to make them dtd compliant.

Thanks to Amanda Focke, CA - Rice University this is no longer an obstacle. Amanda has kindly shared a stylesheet they use to convert their schema compliant files to dtd. It may be that when people test this for themselves, they have questions. So she has generously allowed us to put her contact info here for help if you like. You can email Amanda at afocke@rice.edu or call at (713)348-2124

Initial note: remember that you need your TARO file # (such as urn:taro:rice.wrc.00533) in the file you send to TARO. In AT, that should be recorded in the field "EAD FA unique identifier" under the Finding Aid Data tab in the Resource.

You can get the ead-schema-to-dtd.xsl stylesheet here (do a right click and save target, you might need to change the extension from "xml" to "xsl" after downloading since the browser recognizes it as an "xml")

Basic instructions for setting up the conversion in Oxygen (a typical xml editor)

  1. Save the stylesheet file (ead-schema-to-dtd.xsl) in a folder that makes sense for your workflow - that could be the same folder as the files you will be converting or a different folder. It doesn't matter as long as you direct the "Transformation Scenario" correctly to find the stylesheet.
  2. In Oxygen, look for the "Configure Transformation Scenario" icon (monkey wrench with a red triangle) or use Ctrl+Shift+C. Click "New." Cloose "XML transformation scenario with XSLT."
  3. On the XLST tab:
    1. Name the transformation scenario something like "my-TARO-schema-to-dtd-transformation"
    2. Leave the XML URL settings as the default "${currentFileURL}".
    3. For the XSL URL: navigate to where you saved the stylesheet and select it.
  4. On the FO Processor tab, leave that alone since we are not making PDFs.
  5. On the Output tab, select "Save as" so that it asks you to provide a name for each resulting file (the new dtd-compliant file) and where you want to save it. You can go back and change that later, as you like, if you prefer to have it save files a different way.
  6. Ok, save and close.

The basic instructions for individual file conversion:

  1. In Archivists' Toolkit, select the collection / record group from the Resources list and click "Export EAD," checking the boxes for "Number component levels," "Suppress components and notes when marked as internal only," "Include DAO's in output," and "Use Digital Object ID as HREF."
  2. Open the resulting schema-compliant xml file in an xml editor, such as Oxygen and confirm that it is valid, which it should be, coming straight out of AT.
  3. Click the icon for Apply Transformation Scenario (red triangle) or use Ctril+Shift+T. Select your transformation scenario (which you might have named "my-TARO-schema-to-dtd-transformation"). Click to Apply it. Save the file in a way that you can tell which file is the new dtd-compliant file.
  4. Open the dtd-compliant file and confirm that it is valid.
  5. Send to TARO as usual and check the file after it is processed to make sure it looks correct.


Copyright © The University of Texas at Austin.
Produced by the University of Texas Libraries.
News How Do I...? <ead> Staff