<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Digital Preservation for Beginners</title>
	<atom:link href="http://easydigitalpreservation.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://easydigitalpreservation.wordpress.com</link>
	<description>Ideas behind the field</description>
	<lastBuildDate>Fri, 14 Oct 2011 04:29:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='easydigitalpreservation.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Digital Preservation for Beginners</title>
		<link>http://easydigitalpreservation.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://easydigitalpreservation.wordpress.com/osd.xml" title="Digital Preservation for Beginners" />
	<atom:link rel='hub' href='http://easydigitalpreservation.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Digitization Specifications Compilation</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/12/23/digitization-specifications-compilation/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/12/23/digitization-specifications-compilation/#comments</comments>
		<pubDate>Thu, 23 Dec 2010 22:29:49 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=382</guid>
		<description><![CDATA[Really briefly, I wanted to direct you to the new page I added to this blog, Digitization Specs. Currently there is a compilation of various specs for digitizing photographic prints and negatives. Just for fun (and reference) I compiled the photograph digitization specifications that I could locate from the websites and publications of various libraries [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=382&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Really briefly, I wanted to direct you to the new page I added to this blog, <a href="http://easydigitalpreservation.wordpress.com/dig-specs/">Digitization Specs</a>. Currently there is a compilation of various specs for digitizing photographic prints and negatives.</p>
<p style="padding-left:30px;">Just for fun (and reference) I compiled the photograph digitization  specifications that I could locate from the websites and publications of  various libraries and cultural heritage institutions.  I thought it  would be neat to see them all side by side, and as far as I know, there  isn’t such a resource yet.  I’ve included <strong>specifications for color and black &amp; white photos, in print and film form.</strong></p>
<p>I also intend to eventually compile specifications for digitizing text/print-based documents, but that&#8217;s for another day!<strong><br />
</strong></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/382/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/382/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/382/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=382&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/12/23/digitization-specifications-compilation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>
	</item>
		<item>
		<title>File Formats and Preservation</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/10/05/file-formats-and-preservation/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/10/05/file-formats-and-preservation/#comments</comments>
		<pubDate>Tue, 05 Oct 2010 16:42:01 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Tech]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=335</guid>
		<description><![CDATA[File formats are the rock stars of digital preservation.  After all, one of the goals of digital preservation is to prevent a loss of access to files due to file format obsolescence.  If you are using a file format migration strategy for preservation, then you will be refreshing the digital files over time to keep [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=335&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>File  formats are the rock stars of digital preservation.  After all, one of  the goals of digital preservation is to prevent a loss of access to  files due to file format obsolescence.  If you are using a file format <strong>migration</strong> strategy for preservation, then you will be refreshing the digital  files over time to keep the content stored in formats that are readable  by the current technology.  If you are practicing a software <strong>emulation</strong> strategy for preservation, then you are maintaining software that will be able to read the old file formats.</p>
<p>When a digital object is deposited into a digital repository, the type of file that it is will be declared  by its extension (.jpg, .pdf, etc.).  The type of file you are dealing  with has big implications for how preservation practices can be applied  to it now and in the future.  This is because being able to access the  contents of a digital object depends on the ability to store, read, and  edit the digital files &#8211; actions that are products of the file format’s  specifications and the software that’s necessary to understand that file  format.  The specification is a description of the file format that  includes basic building blocks and technical byte-by-byte descriptions  of the file format’s layout. Cornell’s digital preservation <a href="http://www.icpsr.umich.edu/dpm/dpm-eng/oldmedia/obsolescence1.html">tutorial</a> says a bit more about it.</p>
<h4><span style="color:#000000;">Extinction</span></h4>
<div id="attachment_339" class="wp-caption alignright" style="width: 250px"><a href="http://easydigitalpreservation.files.wordpress.com/2010/10/extinct.jpg"><img class="size-full wp-image-339 " title="extinct" src="http://easydigitalpreservation.files.wordpress.com/2010/10/extinct.jpg?w=240&#038;h=135" alt="dinosaur bones" width="240" height="135" /></a><p class="wp-caption-text">Photo by Charles Tilford, CC license</p></div>
<p>As  you know, when a software program creates a file, the program can  re-open the file to view it, edit it, etc.  This is because the program  knows the file format’s specifications and was designed to be able to  work with it.  As software programs get upgraded or disappear, the  ability to read the files that it created becomes riskier.  Software  upgrades happen all the time, and it is usually possible to open a file created with the previous version of a program.  But over time and  numerous updates, this might not always be the case.  And it certainly  won’t be possible if the software stops getting upgraded and will  eventually not be capable of running on new machines.</p>
<p>To  illustrate this point, let’s look at the old Mac program MacPaint,  which was a basic painting program that shipped with Apple computers  from 1984-1988.  Files created with this program were “MacPaint bitmap  images,” and received the extension .mac (there were a few other  extensions for this format, but let’s focus on this one).  MacPaint  won’t run on modern machines, and there are certainly no programs from  after 1988 that were designed to read this format.  So all .mac files  became orphaned, and the only way to read them was to boot up an old  machine with MacPaint on it.  (Happily, Apple released the source code  of MacPaint to the <a href="http://www.computerhistory.org/highlights/macpaint/">Computer History Museum</a>, meaning that with a little work these files are readable.)</p>
<h4><span style="color:#000000;">Open &amp; Proprietary Formats</span></h4>
<p>But  we’ve come to an interesting juncture in this discussion.  File formats  can be clumped in to two categories: open and proprietary.  Open file  formats are those in which the file format specifications are publicly  available.  When this information is available, programs other than the  one that created the file can be made to interpret the file’s format (or  migrate an old file into a newer format), and we are not dependent on  the original program.  This implies a more guaranteed longevity for the  file in its original format.   Some open file formats that I’m sure  you’ve come to love include .pdf, .jpg, and .tif.</p>
<p>When  a file format is proprietary, the format’s specifications are not  available because they are usually guarded as property of the company  that created the program that creates the files.   If the .mac file  format had been open, then it is far less likely that content would have  ever gotten trapped in this extinct format.</p>
<p>With  digital preservation, the rule of thumb is to move your content into  file formats that are 1. open, and/or 2. popular.  When a file format is  open, we can get inside its structure and know what’s going on, even if  the software that a file was originally created on no longer functions.   The thinking behind going with a popular file format over one that is  used less frequently, is that a way to “get inside” the format will be  inevitable since so many people will have invested their content into  that format.  Someone will find a way in, and hopefully share their  secret.</p>
<p>Here  is a case demonstrating the issue of open versus proprietary formats.   The University of Michigan’s University Digital Conservancy explicitly  determines how much preservation action they can put into specific files  based on their format:</p>
<blockquote><p>More  extensive actions will be taken to preserve usability for objects in  file formats that are fully disclosed, well documented, widely adopted,  and are most accessible for migration, emulation, or normalization  actions. Fewer actions will be taken to preserve usability for file  formats that are proprietary and/or undocumented, and those that are  considered working formats (e.g., Photoshop .psd) and/or are not widely  adopted.</p></blockquote>
<p>You can view the tables outlining their levels of preservation support per file format <a href="http://conservancy.umn.edu/pol-preservation.jsp">here</a>.<br />
I  also liked this table of recommended formats put together by the  Florida Digital Archive (<a href="http://www.fcla.edu/digitalArchive/pdfs/recFormats.pdf">PDF</a>).</p>
<h4><span style="color:#000000;">File Format Resources</span></h4>
<p><a href="http://www.nationalarchives.gov.uk/PRONOM/Default.aspx">PRONOM</a> is a remarkable  project of the UK’s National Archives.  They have created a  comprehensive directory of file formats and the programs that can  understand them.  It’s truly a great resource for digital archivists  because a search for a file format will yield information about its  origin, its particular specification signatures, associated rights, and  more.  The National Archives also developed <a href="http://www.nationalarchives.gov.uk/aboutapps/PRONOM/tools.htm">DROID</a> to work in conjunction with PRONOM.  DROID can automatically identify file formats in batch operations.</p>
<p>Growing  from a partnership between PRONOM and the <a href="http://www.gdfr.info/">Global Digital Format  Registry</a> (GDFR) is the forthcoming <a href="http://www.udfr.org/">Unified Digital Format Registry</a> (UDFR).  The aim of this project is to create a larger, open registry to which  formats can be added by community participants and is based on the  PRONOM database.</p>
<p>If  you’re looking for new fodder for your RSS feed, here is a <a href="http://fileformats.wordpress.com/">blog</a> that is entirely devoted to  discussing file formats in the context of digital preservation.  It’s  written by Gary McGath, who worked on the <a href="http://hul.harvard.edu/jhove/">JHOVE</a> and <a href="https://confluence.ucop.edu/display/JHOVE2Info/Home">JHOVE2</a> projects,  which validate file format claims upon repository ingest.  Here is an  older <a href="http://easydigitalpreservation.wordpress.com/2009/09/04/jhove-and-jhove2/">post</a> about the projects.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/335/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/335/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/335/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=335&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/10/05/file-formats-and-preservation/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/10/extinct.jpg" medium="image">
			<media:title type="html">extinct</media:title>
		</media:content>
	</item>
		<item>
		<title>METS for Transferable Metadata</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/06/30/mets-for-transferable-metadata/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/06/30/mets-for-transferable-metadata/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 20:14:02 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Standards]]></category>
		<category><![CDATA[METS]]></category>
		<category><![CDATA[OAIS]]></category>
		<category><![CDATA[PREMIS]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=312</guid>
		<description><![CDATA[METS is the Metadata Encoding and Transmission Standard, which is applied to encoding metadata via a standardized XML schema.  METS handles all types of metadata that is relevant to preservation: descriptive, administrative, and technical/structural metadata are all included in the schema, and a METS document will serve as the container for all of this information [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=312&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>METS is the Metadata Encoding and Transmission Standard, which is applied to encoding metadata via a standardized XML schema.  METS handles all types of metadata that is relevant to preservation: descriptive, administrative, and technical/structural metadata are all included in the schema, and a METS document will serve as the container for all of this information about a digital object.</p>
<p>The schema was initially developed for the digital library community, and has thus extended to the digital repository and preservation communities.  The fact that METS confines varying types of an object’s metadata to one standard XML-based file type is excellent news for sharing and preserving resources.</p>
<div id="attachment_318" class="wp-caption alignright" style="width: 250px"><a href="http://easydigitalpreservation.files.wordpress.com/2010/06/93715447_8bcd6b1c85_m.jpg"><img class="size-full wp-image-318" title="Boat Transfer" src="http://easydigitalpreservation.files.wordpress.com/2010/06/93715447_8bcd6b1c85_m.jpg?w=240&#038;h=145" alt="Boat Transfer" width="240" height="145" /></a><p class="wp-caption-text">Photo by Flickr user SyN+H, CC license</p></div>
<p>As is evident through the experience of many current digital preservation programs, collaboration among multiple institutions is a very strategic move for a successful digital preservation program.  Using METS as a guideline for creating readable and transferable metadata ensures a more seamless sharing experience.  It also aids in escape strategies should the repository or institution hosting the repository fails and the digital objects need to be transferred to someone else’s care.</p>
<h4>History</h4>
<p>The beginnings of a standardized metadata scheme for collections of digital objects can be traced back to 1997, when UC Berkeley and the Digital Library Federation (DLF) initiated a project to further the concept of digital libraries sharing resources.  By 2001, the DLF-sponsored METS schema emerged, which is supported by the Library of Congress, and was made a NISO standard in 2004, and was renewed in 2006.</p>
<p>By 2006, it had become clear that METS could not only serve as an answer to the interoperability needs associated with sharing digital objects, but that METS is also valuable for preservation purposes.  Jerome McDonough (2006) states that “the METS standard can be considered one of many efforts to try to determine…how complex sets of data and metadata might best be encoded to support both information exchange and information longevity.”</p>
<h4>How it&#8217;s Used</h4>
<p>The <a href="http://easydigitalpreservation.wordpress.com/2009/07/29/oais-reference-model-part-i-background-and-influence/">OAIS reference model</a> considers an acceptable digital object as one that includes the original content as well as the metadata required to understand the content, its structure, its rendering needs, and its preservation history.  This information plus the actual content forms a complete “information package,” which comes in the flavors of SIPs, AIPs, and DIPs, depending on the object’s role in a repository, as discussed in the OAIS reference model.  The metadata that comes in each of these flavors is referred to as the Preservation Description Information (PDI). (Note: DIPs do not always have PDIs since they are the distribution versions.)</p>
<p>We know from the OAIS model that a PDI categorizes a digital object’s metadata into reference, provenance, context, and fixity categories.  METS is capable of fulfilling these metadata requirements with corresponding sections in each METS document:</p>
<ul>
<li>Descriptive &lt;dmdSec&gt;</li>
<li>Administrative &lt;amdSec&gt; (covers provenance and rights)</li>
<li>File Groups &lt;fileGrp&gt; (lists any and all files that comprise the digital object)</li>
<li>Structural Map &lt;structMap&gt;</li>
<li>Structural Links &lt;structLink&gt;</li>
<li>Behavior &lt;behaviorSec&gt;</li>
</ul>
<p>It is important to realize, however, that according to the METS standard, the only required part of a METS document is the Structural Map.  So in order for METS to be effective when applied to preservation, there must be information in each of these sections (FYI &#8211; a truly complete METS file will also include a header &lt;metsHdr&gt;).<a href="http://easydigitalpreservation.files.wordpress.com/2010/06/partsangles.jpg"><img class="alignright size-medium wp-image-313" title="The Seven METS sections" src="http://easydigitalpreservation.files.wordpress.com/2010/06/partsangles.jpg?w=300&#038;h=240" alt="The Seven METS sections" width="300" height="240" /></a></p>
<p>So where do we get this information to fill up a METS file?  The answer is PREMIS.</p>
<h4>METS and PREMIS &#8211; A Perpetual Preservation Honeymoon</h4>
<p>You may recall that <a href="http://easydigitalpreservation.wordpress.com/2009/10/30/premis-for-preservation-metadata/">PREMIS</a> is also an XML schema that has been developed for preservation metadata.  The PREMIS structure is based on entities and semantic units that will harbor information about a digital object that is necessary for supporting and recording digital preservation actions.</p>
<p>What’s important here is that PREMIS will sit inside the METS document.  You can see an example of this <a href="http://www.loc.gov/standards/premis/louis-2-0.xml">here</a>.  All of the preservation information will be present in the PREMIS file, and by nesting the PREMIS data into the METS file, the metadata becomes transferable to other repositories.</p>
<p>The flexibility of both of these schemas implies that there are variations and complications with integrating PREMIS and METS.  The Library of Congress created a working draft of guidelines for this process, which is viewable <a href="http://www.loc.gov/premis/guidelines-premismets.pdf">here</a> (PDF, 25K).</p>
<h4>Helpful METS Resources</h4>
<ul>
<li><a href="http://www.loc.gov/standards/mets/METSPrimerRevised.pdf">METS Primer</a> (Revised 4/2010) (PDF, 1.53MB) – Readable, and has color images and examples.</li>
</ul>
<ul>
<li>PREMIS in METS <a href="http://pim.fcla.edu/">toolbox</a>, information about the project <a href="http://digitalarchivist.wordpress.com/2010/04/23/premis-in-mets-toolkit-released-by-library-of-congress-and-florida-center-for-library-automation/">here</a>.</li>
</ul>
<ul>
<li><a href="http://www.loc.gov/standards/mets/mets-tools.html">METS Creation Tools.</a></li>
</ul>
<h5>McDonough, J. (2006). METS: Standardized Encoding for Digital Library Objects. <em>International Journal on Digital Libraries</em>, (6)2, 148-158.</h5>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/312/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=312&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/06/30/mets-for-transferable-metadata/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/06/93715447_8bcd6b1c85_m.jpg" medium="image">
			<media:title type="html">Boat Transfer</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/06/partsangles.jpg?w=300" medium="image">
			<media:title type="html">The Seven METS sections</media:title>
		</media:content>
	</item>
		<item>
		<title>Video Digital Preservation Workshop</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/06/09/video/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/06/09/video/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 22:37:57 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Digitization]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Standards]]></category>
		<category><![CDATA[OAIS]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=298</guid>
		<description><![CDATA[On Monday, I was thrilled to attend a workshop entitled Digital Preservation for Video, presented by Linda Tadic for Independent Media Art Preservation (IMAP) .  The workshop was held in San Francisco at the Bay Area Video Coalition (BAVC).  The scope of the event was to cover some of the key considerations in digitizing video [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=298&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><!-- 		@page { margin: 0.79in } 		P { margin-bottom: 0.08in } -->On Monday, I was thrilled to attend a workshop entitled Digital Preservation for Video, presented by Linda Tadic for Independent Media Art Preservation (<a href="http://www.imappreserve.org/">IMAP</a>)<span style="color:#ff0000;"> </span>.  The workshop was held in San Francisco at the Bay Area Video Coalition (<a href="http://www.bavc.org/">BAVC</a>).  The scope of the event was to cover some of the key considerations in digitizing video and creating a digital preservation program at the DIY level (i.e. without a huge IT department backing you up).  A few of the institutions represented by attendees included BAVC, the Pacific Film Archive, the California Institute of the Arts, the California Academy of Science, the Sierra Club, the San Francisco Symphony, and the California Film Institute.</p>
<p>Prior to this workshop, I hadn&#8217;t had a great deal of exposure to the digital preservation challenges of moving visual materials.  In fact, I confess that I hardly knew anything about the current physical formats used for video storage, nor much about the hard work that is necessary for digitizing them.  Most of the attendees have done their share of digitizing moving images (or of outsourcing the digitization), and I think that most of us were there to explore the answers to the question of &#8220;now what?&#8221;</p>
<h3>The Move to File-Based Video Storage</h3>
<p>Physical moving image storage formats are on death row.  We spent the bulk of the morning going over the characteristics of different physical media and their expiration dates, which served as an effective motivator for digitization and instilling all but panic among the attendees.</p>
<div id="attachment_301" class="wp-caption alignright" style="width: 209px"><a href="http://easydigitalpreservation.files.wordpress.com/2010/06/film-reel.jpg"><img class="size-medium wp-image-301 " title="film reel" src="http://easydigitalpreservation.files.wordpress.com/2010/06/film-reel.jpg?w=199&#038;h=300" alt="film reel" width="199" height="300" /></a><p class="wp-caption-text">Photo by Serena Epstein, CC Attribution-Noncommercial-Share Alike 2.0 Generic license</p></div>
<p>Unlike paper, the magnetic tapes, reels, and discs that moving images are physically stored on are on a very tight deadline; aside from succumbing to format obsolescence, most of the media is reaching the end of its life expectancy, after which the images on them will simply not exist anymore.  To give some examples of formats that I was more familiar with, the life span of VHS is approximately 15 years, while MiniDV, DVCam, and Video are 5-10 years.  This illustrates a point that in some cases, it isn&#8217;t necessary to digitize the oldest things first.</p>
<p>Digitization is arguably the increasingly best preservation option for some of these formats, and it is important that the road to digitization doesn&#8217;t result in a dead end.  That is why we need to ensure that once digitization has occurred, there is a digital preservation plan in place to ensure that the video content will continue to survive, especially since the original physical sources of the content will be dead in short while.</p>
<p>Indeed, we are observing a shift from format-based physical video storage to the file-based storage of digital video content.  Preservation will no longer be about making the tapes last as long as possible, but by caring for the digital files representing the content that the tapes once held.</p>
<h3>Preservation Concerns for Digital Video</h3>
<p>I appreciate how Linda was adamant in reminding us that digital preservation is not a one-time fix for digital video longevity.  She was very clear in telling us that it requires a constant guardianship consisting of a deliberate, scheduled management of the digital files.  To use her phrasing, there is no &#8220;store-and-ignore&#8221; solution.  Preservation activities involve keeping file formats current so that they can be accessed by the software of the now.  It also involves exercising the hard drives that your files may be stored on and not letting them sit idle for more than 6 months.  It requires diligent updating of the files&#8217; accompanying preservation metadata so that changes to the files can be tracked and managed.</p>
<p>Linda also stated what nobody likes to hear about digital preservation: that there is no one way to do things, and that there is no one set of instructions to follow that will help you save your content.  As with all file types, the preservation decisions you make will depend on your content, your files types, your storage, and your intended access methods.  So, in the case of making storage selections and creating a plan, knowledge is power.  I&#8217;ll try to summarize some of the key points covered.</p>
<p><span id="more-298"></span></p>
<h3>File Formats</h3>
<p>Before committing to a single file format for all of your files, you need to consider something truly important for maintaining the integrity of your master files: extra copies!  These come in a couple of flavors.  You&#8217;ll want a copy of your master file that can be used to make other copies (referred to as a mezzanine-level copy).  You really don&#8217;t want to touch you master files (preservation-level copies).  Depending on how you are providing access or distributing your content, you may also want to create compressed copies of your files that aren&#8217;t quite so large (distribution copies).  Distribution copies could very well be a completely different file format than your preservation copies.</p>
<p>In any preservation activity, there is a clear preference for storing your content in open file formats.  This is guaranteed to result in easier migrations down the road, and decreases your dependence on proprietary organizations.  It is prudent to be aware of file formats that are too open, however, in that everyone is using the format in a different way.  This seems to be the case with MXF wrappers.</p>
<p>One open file format discussed was JPEG200, which, surprisingly, seems like a pretty good contender for the preferred file format for video.  Most people in the US aren&#8217;t aware that JPEG2000 can be used for video (I sure wasn&#8217;t), but talk to a European involved in storing and preserving digital video content, and they&#8217;ll wonder why you aren&#8217;t already using it.  It has also been adopted by the Digital Cinema Initiative.  JPEG2000 is 3:1 lossless, is good for storage, and plays back uncompressed (making it unsuitable as a file format for most types of distribution copies).  Data can also be stored in the wrapper in an XML stream.  A potential downfall is that use of JPEG2000 in the US is currently largely led by the adoption of the SAMMA hardware system, which seems to be necessary in order to make the format compatible with anything else.  (Update: It was also pointed out to me in an email that educational institutions using the SAMMA system tend to use JPEG2000, and that there are production implementations of JPEG2000 as well.)</p>
<p>Other open file formats discussed were Uncompressed 8-bit and 10-bit, and DV25 and DV50, which are better candidates for mezzanine-level files.  Proprietary formats include Apple&#8217;s ProRes and a wide range of wrappers such as AVI (Microsoft), Quicktime, and Windows Media.</p>
<h3>Storage</h3>
<p>Once you have your files you&#8217;ll want to have a solid filenaming scheme.  The important points to consider when creating a naming convention are to be consistent, avoid punctuation, and allow the file names to reflect context to the extent that the name can still make sense outside of the hierarchy of it intended storage collection  Basically, don&#8217;t use files names like 0001.mov.  Try to include acronyms for institutions and subcollections along with an object ID: ab_cd_12345_20100602.mpg</p>
<div id="attachment_303" class="wp-caption alignleft" style="width: 310px"><a href="http://easydigitalpreservation.files.wordpress.com/2010/06/film-tape.jpg"><img class="size-medium wp-image-303" title="film tape" src="http://easydigitalpreservation.files.wordpress.com/2010/06/film-tape.jpg?w=300&#038;h=225" alt="film tapes" width="300" height="225" /></a><p class="wp-caption-text">Photo by Tobias Steinhoff, CC Attribution-Noncommercial-Share Alike 2.0 Generic license</p></div>
<p>A good thing to know about your collection of digital video is how much of it you have so that you can evaluate your options for storing it.  Depending on size, you&#8217;ll want to choose different physical storage carriers.  Options discussed included optical media like discs, external hard drives, RAID systems, digital linear tapes (i.e. LTO), and cloud storage.  Optical media and linear tapes will face the same life expectancy problems that the original sources of digitized video are currently facing, so this is a relatively short-term solution but at least they have known lifespans that you can pretty much count on.</p>
<p>Some notes about external hard drives are that the smaller the drive, the more prone it will be to failure, but drives that are very large should be avoided so that if it fails, you won&#8217;t lose too much.  Also, don&#8217;t purchase the least expensive drives out there; brands like Western Digital and Samsung were noted to be more reliable.  The hard drive option to watch for in the future are solid-state drives with non-moving parts &#8211; which at this point are still pretty new, expensive, and prone to heat problems.  Current hard drives need to be exercised every 6 months or so, and you&#8217;ll want to replace them every 3 to 5 years.</p>
<p>Cloud storage wasn&#8217;t recommended as a realistic option due to security issues, considerations related to downloads of large files, and the fact that in terms of digital preservation, cloud storage takes you right out the Trusted Digital Repository realm.</p>
<p>The take-home message from this section of the workshop is that redundancy is critical: you will have failures.  It&#8217;s be a good idea to make copies of all of your files in all their formats (preservation, mezzanine, and access levels), and store your copies in different geographic locations.  It is also wise to diversify your storage media.<strong> </strong></p>
<h3>Metadata</h3>
<p>When librarians and archivists think of metadata, they are probably first thinking about descriptive metadata: what is the content of the item, what subjects are covered?  Digital preservation unleashes a whole new type of technical metadata that must be kept for each item, in addition to any descriptive metadata.  It&#8217;s aptly referred to as Preservation Metadata.  It will include information about the original source file, what software it was created on, when it was ingested into your storage system, how big it is, checksum algorithms, and each and every change that is made to the file after ingestion.</p>
<p>Linda says that &#8220;you can do anything, as long as your metadata tracks it.&#8221;  The preservation metadata will become the history of your files, and the breadcrumb trail to follow if you need to go to Plan B.  There are many schemes that can be used for deciding what is important to include in your preservation metadata, but Linda pointed out that you&#8217;ll probably end up making your own scheme anyway, likely drawing from some of the pre-existing schemes to suit your specific needs.  Some of the preservation metadata schemes discussed included <a href="http://easydigitalpreservation.wordpress.com/2009/10/30/premis-for-preservation-metadata/">PREMIS</a>, SMPTE and PBCORE.<strong> </strong></p>
<h3>More Preservation Actions</h3>
<p>The other parts of the workshop that discussed digital preservation procedures were more general, and applicable to most types of digital content.  We went over the merits of Trusted Digital Repositories (TDRs), and basic procedures to run through when ingesting and migrating files (file type characterization and validation, checksums, etc).</p>
<p>The <a href="http://easydigitalpreservation.wordpress.com/2009/07/29/oais-reference-model-part-i-background-and-influence/">OAIS model</a> was breezed over, which is understandable due to its complexity and given the overall scope of the workshop.  But it made me think about how scalable the OAIS model actually is: even if you are dealing with a smaller collection, you can still implement the basic workflows and concepts.  I acknowledge that there are certainly situations where the model may be overkill depending on the size and type of the collection being managed, but there are still some incredibly valuable components that could be pulled from it: the digital preservation vocabulary and information packages concepts in particular.</p>
<h3>Reactions</h3>
<p>I think it is worth noting some of the concerns expressed by the attendees, since they were probably pretty common to many peoples&#8217; reactions when faced with initiating a digital preservation program.  One attendee acknowledged that &#8220;ten years suddenly seems like a long time&#8221; in regards to format life expectancy.  Some shared concerns were related to the sheer size of the task, the need to prioritize because of the cost-prohibitive activities involved, lack of personal technical expertise, the challenges of creating or locating a TDR, and an imminent feeling that action needed to be taken immediately.</p>
<p>It may also have felt discouraging to the attendees that the act of putting together a digital preservation program isn&#8217;t a final fix, because it will require constant staff monitoring and action &#8211; which is a particular challenge to smaller organizations.  It is worth noting that a feasible solution, to at least this final concern, may be around the corner.  Linda is heading a project in development called the Audiovisual Archive Network (<a href="http://www.archivenetwork.org/">AVAN</a>), which is aiming to provide hosted digital video preservation services as a non-profit organization.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/298/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/298/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/298/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=298&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/06/09/video/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/06/film-reel.jpg?w=199" medium="image">
			<media:title type="html">film reel</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/06/film-tape.jpg?w=300" medium="image">
			<media:title type="html">film tape</media:title>
		</media:content>
	</item>
		<item>
		<title>iPRES and 02010</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/03/17/ipres-and-02010/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/03/17/ipres-and-02010/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 18:39:28 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[iPres]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=277</guid>
		<description><![CDATA[My Internet radar is starting to pick up some buzz about iPRES 2010. iPRES is an annual, international conference on &#8220;the preservation of digital objects&#8221; (see my previous iPRES gushing from when I was an intern).  The 2010 call for papers for the October 2010 meeting has been issued, and this year, there is also [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=277&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>My Internet radar is starting to pick up some buzz about <a href="http://www.ifs.tuwien.ac.at/dp/ipres2010/index.html">iPRES 2010.</a><span style="color:#ff0000;"> </span> iPRES is an annual, international conference on &#8220;the preservation of  digital objects&#8221; (see my previous iPRES <a href="http://easydigitalpreservation.wordpress.com/2009/08/19/ipres/">gushing</a> from when I was an intern).  The 2010 call for  papers for the October 2010 meeting has been issued, and this year,  there is also a call for leading workshops and tutorials for digital  preservation activities.  This will surely lead to great opportunities  to learn and share experiences and skill sets.</p>
<p>iPRES 2010 is  being held in Vienna and is hosted by the Austrian National Library and  the Technical University of Vienna.  I think the statement that this  year&#8217;s organizers have made with their logo design is very apt for the  conference subject matter.  It reads &#8220;iPRES 02010.&#8221;  Expressing the year  in five digits instead of four is an excellent reminder that we are  only existing at a particular place in time.<a href="http://easydigitalpreservation.files.wordpress.com/2010/03/ipres20101.jpg"><img class="alignleft size-full  wp-image-282" style="margin:5px;" title="ipres2010" src="http://easydigitalpreservation.files.wordpress.com/2010/03/ipres20101.jpg?w=220&#038;h=70" alt="iPRES 2010 logo" width="220" height="70" /></a> As climactic as the year  2010 seemed to us on January 1st, it really isn&#8217;t any sort of finale.</p>
<p>Speak to a person involved in digital preservation, and they may be  able to forecast what the next five years of digital information  preservation management will look like.  Maybe.  The five-digit year  expresses the future, and encourages thoughts about the people coming  after us who will stand to benefit from our digital output.  Thinking  that far ahead when talking about digital preservation is rather lofty, I  know.  But it illustrates a pervasive point.</p>
<p>It is not  likely that I will attend iPRES this year, so I&#8217;ve appeased myself by  reliving some of the great presentations I saw last year in San  Francisco.  The host of last year&#8217;s iPres, The California Digital  Library, recently put up the proceedings of iPRES 2009 on their open  access publishing platform, eScholarship.  The slides from the  presentations have been available for a while on the <a href="http://www.cdlib.org/services/uc3/iPres/confsched.html">website</a>.</p>
<p>Here  are some 2009 papers that have been influential in my own thinking  since the conference:</p>
<ul>
<li>Maureen Pennock and Richard Davis: <a href="http://escholarship.org/uc/item/7zs156mb">Archive  Press: A Really Simple Solution to Archiving Blog Content</a>, which  inspired me to write a paper about blog preservation in general. (coming  soon!)</li>
<li>Emmanuelle  Bernes and Louise Faudet: <a href="http://escholarship.org/uc/item/6bt4v3zs">The Human Face of  Digital Preservation: Organizational and Staff Challenges, and  Initiatives at the Bibliothèque nationale de France</a>.  A really  excellent reflective done by France&#8217;s national library, BnF, on their  transition to a more digital existence and the implications for the  library staff in terms of training and organization.</li>
<li>Tyler  Walters, Liz Bishoff, Emily B. Gore, Mark Jordan, and Thomas C. Wilson: <a href="http://escholarship.org/uc/item/38g232wc">Distributed Digital Preservation:: Technical, Sustainability, and Organizational Developments.</a> This panel relayed their experiences in participating in private LOCKSS  systems with geographically distributed institutions.  Really great for  looking at benefits and challenges of joining a network versus going  solo.  (Also, I&#8217;m taking a course from Tyler Walters this semester,  which is an additional bonus.)</li>
</ul>
<p>The entire proceedings are  available <a href="http://escholarship.org/uc/cdl_ipres09">here</a> for free, individual downloading.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/277/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/277/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/277/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=277&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/03/17/ipres-and-02010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/03/ipres20101.jpg" medium="image">
			<media:title type="html">ipres2010</media:title>
		</media:content>
	</item>
		<item>
		<title>Briefly Exploring Digitization</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/02/05/briefly-exploring-digitization/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/02/05/briefly-exploring-digitization/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 01:11:38 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Digitization]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=268</guid>
		<description><![CDATA[To be clear, digitization and digital preservation are not the same thing.  Digitization is the process of making digital copies of physical items.  Digital preservation refers to the activities associated with maintaining the viability of, and access to, digital files over time.  Thus, the activities of digitization will result in things that can be (need [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=268&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>To be clear, digitization and digital preservation are not the same thing.  Digitization is the process of making digital copies of physical items.  Digital preservation refers to the activities associated with maintaining the viability of, and access to, digital files over time.  Thus, the activities of digitization will result in things that can be (need to be?) included in a digital preservation project.</p>
<div id="attachment_271" class="wp-caption alignright" style="width: 250px"><a href="http://easydigitalpreservation.files.wordpress.com/2010/02/book-scanner.jpg"><img class="size-full wp-image-271 " title="book scanner" src="http://easydigitalpreservation.files.wordpress.com/2010/02/book-scanner.jpg?w=240&#038;h=180" alt="Automated book scanner" width="240" height="180" /></a><p class="wp-caption-text">Automated book scanner</p></div>
<p>I like to think about all of the large scale book scanning that is happening.  Massive amounts of digital files are being created from physical books.  If these files aren&#8217;t taken care of properly, then over time they will become unusable&#8230;and all the news coverage of the Google Books settlement seems like a laughable waste of time.</p>
<p>This semester, I&#8217;m taking a course specifically on Digitization.  One of the first questions posed to us was whether or not a physical item (book, document, etc.) that has been digitized can be discarded.  Now, I am aware of the intrinsic value that a book hand-printed in the 1500s has, but in my response, I chose to focus on the intellectual (or informational) content.  This question made me think about how intricately tied digitized projects should be to digital preservation programs.  Why would we risk the total loss of an item&#8217;s content if we rely on a digital version of it that is not receiving any stewardship after the physical copy has been tossed?  So here are my conditions for tossing a physical copy once it&#8217;s been digitized.</p>
<ul>
<li>The digitized copy should be of preservation quality, meeting (what seems to be) the non-standardized requirements of 600+dpi, TIFF file format, etc.</li>
<li>The organization charged with keeping the digital file of the digitized item should have a solid and reliable digital preservation program in place.  In a successful digital preservation program, the issues related to file format obsolescence, file corruption, and crashed hard drives will be nullified, as the program should account for these disasters ahead of time and be ready with plans to prevent such events.  Under this condition, it is safe to assume the analog copy would no longer need to be retained since its informational content is safe in a digital format.</li>
<li>Access to the digitized copy must be equal to or greater than the access that was allowed with the physical copy.  Preferably, access should be increased, as the new format enables more avenues of access, by nature.  As Oya (2007) points out, the investments made in large scale digitization initiatives to aggregate and store digitized collections are huge.  &#8220;Such investments will be more worthwhile if discovery, access, and delivery are given equal emphasis.&#8221;  The argument could be made that increased access to content is as much of a justification for digitization as are any reasons associated with preservation of the content of the physical item.</li>
<li>Finally, it must be determined that the physical copy of a digitized item has no other value than what can be conveyed through its digital copy.  If the physical item is valuable for more than its informational content, then perhaps discarding it after it has been digitized is not a reasonable option.</li>
</ul>
<p><em>Rieger, Oya, <em>Preservation in the Age of Large-Scale Digitization</em> (DRAFT). Washington, DC: Council on Library and Information Resources, 2007. <a href="http://www.clir.org/pubs/abstract/pub141abst.html">http://www.clir.org/pubs/abstract/pub141abst.html</a>.</em></p>
<h5>Book scanning photo by <a href="http://www.flickr.com/photos/cogdog/">cogdogblog </a>on Flickr, Creative Commons Attribution 2.0 Generic license.</h5>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/268/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=268&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/02/05/briefly-exploring-digitization/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2010/02/book-scanner.jpg" medium="image">
			<media:title type="html">book scanner</media:title>
		</media:content>
	</item>
		<item>
		<title>Copyright and Digital Preservation</title>
		<link>http://easydigitalpreservation.wordpress.com/2010/01/06/copyright-and-digital-preservation/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2010/01/06/copyright-and-digital-preservation/#comments</comments>
		<pubDate>Wed, 06 Jan 2010 16:41:33 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Standards]]></category>
		<category><![CDATA[Copyright]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=246</guid>
		<description><![CDATA[Before you read any further, please note that I am not an expert in copyright law&#8230;by a long shot.  What you&#8217;ll find below is a discussion about how copyright law affects digital preservation as I understand it.  Copyright law is very complex, especially in regards to dealing with the &#8220;new&#8221; issues presented by the digital [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=246&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Before you read any further, please note that I am not an expert in copyright law&#8230;by a long shot.  What you&#8217;ll find below is a discussion about how copyright law affects digital preservation <em>as I understand it</em>.  Copyright law is very complex, especially in regards to dealing with the &#8220;new&#8221; issues presented by the digital environment.  Hopefully you will find the references I have listed below, and the items on the Resources page, useful in getting started.</p>
<h4>The Problem with Copyright:</h4>
<p>The absolute biggest barrier that copyright presents to preserving digital materials is the copyright owner&#8217;s exclusive right to reproduce and adapt a work.  Making copies of digital items and adapting them in various ways are generally the first steps of preservation &#8212; think of making copies to back things up, and the act of making changes to digital objects during digital format migrations.</p>
<p>Another impediment to digital preservation efforts are the dissemination restrictions that copyright law upholds.  Digital preservation is closely tied to access, yet this main goal of any preservation effort is restricted by current copyright law.  The glory of digital items is that they can theoretically be accessed from anywhere, and by multiple simultaneous users.  But copyright law hasn&#8217;t quite caught up to accommodate the digital environment and allow us to (legally) use and preserve digital items in the full capacity that the medium allows.</p>
<p>Determining the duration of copyright is somewhat confusing since it depends on when the work was created (or in some cases, when it was published versus when it was created).  Various acts of legislation over many years complicate the law because they have resulted in different copyright durations and renewal lengths.  <a href="http://www.bitlaw.com/copyright/duration.html">Bitlaw</a> provides a concise write up for the summary-inclined among us.</p>
<h4>Exceptions to Copyright Law:</h4>
<p>Libraries and archives follow the copyright provisions laid out by Section 108 of Title 17 (The Copyright Law) of the US Code (available <a href="http://www.copyright.gov/title17/">here</a>).  Libraries and archives are strong candidates for hosting digital preservation initiatives, so that&#8217;s why I&#8217;m focusing on them.  If the library or archive making the copies is open to the public or allows access to researchers from non-affiliated institutions, then it is <em>not </em>an infringement to make copies for preservation or replacement purposes under the conditions that:</p>
<ul>
<li>the item is already currently held in the collections</li>
<li>the item &#8220;is damaged, deteriorating, lost, or stolen [not good for digital items, as it will be too late once damage has occurred] or if the existing format in which the work is stored has become obsolete.</li>
<li>the copy is <em>not distributed in a digital format </em>outside the walls of the library (italics added to emphasize the impracticality of this rule)</li>
</ul>
<p>Additionally, libraries and archives are allowed to make up to three copies of unpublished works for preservation purposes, and up to three copies of published works for replacement purposes.  So, even with the compromises made for libraries in Section 108, there are problematic implications for digital preservation.  Since digital preservation is so closely tied to accessibility, libraries would be extremely limited in how they can preserve &#8211; and then share &#8211; digital material.</p>
<p>There is hope; people are aware of these limitations.  In March 2008, the Section 108 Study Group <a href="http://www.loc.gov/today/pr/2008/08-063.html">released a report</a> of suggestions to improve Section 108 and advance it into a more digitally-oriented mindset.   These suggestions include allowing copies of works to be made prior to damage or loss; make copies of publicly accessible websites with an opt-out option (see the Internet Archive in the following section); and lift the three-copy preservation or replacement limit.</p>
<p>And let&#8217;s not forget about Fair Use.  I won&#8217;t get in to it deeply here, but it&#8217;s a doctrine within Title 17 (Section 107) that actually reduces the copyright holder&#8217;s exclusive rights.  It allows people to reproduce parts of copyrighted works.  It is a totally vague and subjective doctrine, and seems to be more of a defense against infringement lawsuits rather than a right.</p>
<h4>An Aside about Copyright and the Web:</h4>
<p>There are many web-archiving projects, the resulting files of which will need to be included in preservation processes.  Like content that is created off the web, web-based content is also protected under copyright law unless it is stated otherwise.  The Internet Archive&#8217;s approach to harvesting web content for archiving is to collect everything from which its crawlers are not excluded, and to provide an opt-out policy for anyone who specifically does not want to be included.  While the legality of this method is up for debate, the Internet Archive has avoided many infringement suits via their &#8220;willingness to respect the wishes of those copyright owners who want to limit and control the reproduction of their copyrighted works&#8221; (<a href="http://fairuse.stanford.edu/commentary_and_analysis/2003_11_hirtle.html">Hirtle, 2003</a>).</p>
<p>Since the introduction of Web 2.0, web content on a given web page may also have more than one creator.  So, obtaining copyright permission for preservation purposes may be more challenging than contacting one person.  In the case of blogs, for example, blog writers do not own the copyright to comments other people have left (Biederman &amp; Andrews, 2008).  To take this one step further up the difficulty scale, think of the challenges introduced by anonymous comments with no clear author.</p>
<h4>Additional Restrictions:</h4>
<p>Outside of the general US copyright law that is applied to a work, we must also take into consideration the licensing restrictions that may be associated with subscription materials.  These will likely have their own rules and implications for preservation, especially given that which is made more clear by the Digital Millennium Copyright Act (DMCA).  The DMCA prohibits &#8220;<span style="font-family:verdana,arial,helvetica,sans-serif;">circumventing technological access controls to obtain access to copyrighted works,&#8221; meaning if access to the work is password-protected, you cannot create a work-around to allow others to get to it (<a href="http://www.clir.org/pubs/reports/pub112/contents.html">Besek, 2003</a>).</span></p>
<h4>No Real Precedents:</h4>
<p>Finally, I think another basic challenge with copyright is that there are no precedents for many of the issues that digital preservation activities bring to the surface.  This is especially true in regards to the Fair Use exemptions, which are judged on a subjective basis.  The Fair Use exemption could be a saving grace for preservation activities, but until it has proven to be so in an infringement challenge or lawsuit, it is a very big risk to assume that this can be the case for all instances.  It&#8217;s likely not the right preservation decision to wait until copyright law catches up with the needs of our digital environment.  So&#8230;who wants to try first?</p>
<h5>Biederman, C. J., &amp; Andrews, D. (2008, May 1). Applying copyright law to user-generated content. <em>Los Angeles Lawyer</em>, 12.</h5>
<h5>Besek, J. (Jan 2003). Copyright issues relevant to the creation of a digital archive: A preliminary assessment.  CLIR.  Retrieved Jan 5, 2010 from http://www.clir.org/pubs/reports/pub112/contents.html</h5>
<h5>Hirtle, P.B. (2003).  Digital preservation and copyright.  Retrieved Jan 5, 2010 from http://fairuse.stanford.edu/commentary_and_analysis/2003_11_hirtle.html</h5>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/246/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=246&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2010/01/06/copyright-and-digital-preservation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>
	</item>
		<item>
		<title>A Budding Branch?</title>
		<link>http://easydigitalpreservation.wordpress.com/2009/12/08/a-budding-branch/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2009/12/08/a-budding-branch/#comments</comments>
		<pubDate>Wed, 09 Dec 2009 04:44:33 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=234</guid>
		<description><![CDATA[This evening I sat in on a lecture given by John Phillips, a Management Consultant at Information Technology Decisions.  John was giving an overview of what he saw as the similarities and differences between the three main branches of information management professionals: librarians, archivists, and records managers.  What was not included in this list were [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=234&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This evening I sat in on a lecture given by <a href="http://www.state.sc.us/scdah/BioJTPITDv6.htm">John Phillips</a>, a Management Consultant at Information Technology Decisions.  John was giving an overview of what he saw as the similarities and differences between the three main branches of information management professionals: librarians, archivists, and records managers.  What was not included in this list were digital preservationists.</p>
<p>Now, as someone who is not actually working in the field, I may be remiss in assuming that digital preservation has yet earned thusly titled professionals.  But I think if this is not yet the case, then it certainly will be in the future&#8230;once it becomes clear that professionals from the other three branches of information management cannot be expected to all have expert-level knowledge of digital preservation practices&#8230;.which will become clear because everyone in information management really needs to starting thinking about technological obsolescence.</p>
<p>The point of this post, though, is to point out a major correlation between records managers and what I would be inclined to think of as digital record preservationists.  As John pointed out, records managers differ from librarians and archivists because, <strong>1)</strong> they tend to work in business or corporate environments, and <strong>2) </strong>they are OK with &#8211; and are expected to &#8211; throw things away after they are no longer of value to the owning organization.</p>
<div id="attachment_237" class="wp-caption alignleft" style="width: 268px"><a href="http://easydigitalpreservation.files.wordpress.com/2009/12/cans2.jpg"><img class="size-medium wp-image-237" title="cans2" src="http://easydigitalpreservation.files.wordpress.com/2009/12/cans2.jpg?w=258&#038;h=171" alt="" width="258" height="171" /></a><p class="wp-caption-text">photo by Sebastiano Pitruzzello</p></div>
<p>Upon an item&#8217;s accession into a repository, records managers will asses the value of an object, and then revisit that assessment later on in the course of retention decisions.  If the item is no longer worth keeping, it is discarded.  This is also the (theoretical) case with digital preservationists.  In digital preservation, <a href="http://wp.me/pArq0-g">OAIS</a>-type repositories are intended to preserve digital items for as long as those items are of value to their designated communities.  This implies that at some point, a digital item may no longer have value, and therefore continued preservation efforts for that item are not economically justified.  Throwing things away is a dirty job, but just as we can&#8217;t possibly collect everything out there, perhaps we can&#8217;t keep it all, either.</p>
<p>But let&#8217;s not discredit those clingy librarians.  John gave an interesting guesstimate regarding the types of respective repositories information professionals work with.  Among records managers, archivists, and librarians, librarians deal with the highest proportion of electronic to physical records out of all three professions.  (John&#8217;s guesstimate was 40% electronic / 60% physical, in comparison to IT professionals, who are 100% electronic by nature.)  The numbers for records managers were 30% electronic / 70% physical, which is still quite a lot of paper to be dealing with.<br />
So if librarians are handling the highest proportions of electronic items out of these three groups, we can make a big case for libraries to be the battle grounds for creating leaders in digital preservation.  Technological and file format obsolescence will hit libraries the hardest if these numbers are accurate.  As contenders with the most to lose, libraries are poised to harbor the most institutional support for digital preservation initiatives&#8230;and perhaps spawn the fourth major branch of information professionals.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/234/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=234&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2009/12/08/a-budding-branch/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2009/12/cans2.jpg?w=300" medium="image">
			<media:title type="html">cans2</media:title>
		</media:content>
	</item>
		<item>
		<title>PREMIS for Preservation Metadata</title>
		<link>http://easydigitalpreservation.wordpress.com/2009/10/30/premis-for-preservation-metadata/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2009/10/30/premis-for-preservation-metadata/#comments</comments>
		<pubDate>Sat, 31 Oct 2009 03:48:40 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PREMIS]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=211</guid>
		<description><![CDATA[The OAIS reference model calls for digital files to have preservation metadata, or &#8220;preservation descriptive information.&#8221;  This preservation metadata would outline the significant technical and historical (think format migrations, etc.) information about a given digital file that will be useful for the effective preservation of that file. PREMIS is intended to help institutions produce this [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=211&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://easydigitalpreservation.wordpress.com/2009/07/29/oais-reference-model-part-i-background-and-influence/">OAIS</a> reference model calls for digital files to have preservation metadata, or &#8220;preservation descriptive information.&#8221;  This preservation metadata would outline the significant technical and historical (think format migrations, etc.) information about a given digital file that will be useful for the effective preservation of that file.<br />
<a href="http://www.loc.gov/standards/premis/index.html">PREMIS</a> is intended to help institutions produce this metadata.</p>
<p>PREMIS is several things:  First, it&#8217;s an acronym for Preservation Metadata: Implementation Strategies.  Second, it&#8217;s a very active working group with members from many countries and institutions who are trying to answer how to best describe digital files for preservation purposes.  They are often involved in periodical informational workshops.  Third, it is a very detailed description of a metadata scheme that is available in the PREMIS Data Dictionary (<a href="http://www.loc.gov/standards/premis/v2/premis-2-0.pdf">PDF</a>), which is what I&#8217;m going to be referring to for the rest of this post.  Version 2.0 of the Data Dictionary was released in 2008.</p>
<p>I think that to put it simply, the PREMIS Data Dictionary provides a clear guide to what specific information needs to be known about a digital collection and its individual objects in order to best support any digital preservation activities.  Following the PREMIS guidelines would result in a specific and formulaic set of metadata that is for preservation purposes.  PREMIS also attempts to create a standardized set of preservation metadata, which would strengthen communication between management teams of different repositories, and also allow for the easy sharing and interoperability of this metadata with other PREMIS-conformant repositories.</p>
<p>The PREMIS metadata structure is based on filling in a lot of blanks which are specific to each file to be preserved.  There five different types of categories (called entities) of blanks to complete:</p>
<div id="attachment_209" class="wp-caption alignright" style="width: 100px"><img class="size-medium wp-image-209 " title="premis" src="http://easydigitalpreservation.files.wordpress.com/2009/10/premis.jpg?w=90&#038;h=117" alt="PREMIS Data Dictionary cover" width="90" height="117" /><p class="wp-caption-text">PREMIS Data Dictionary</p></div>
<ul>
<li> Intellectual Entities</li>
</ul>
<ul>
<li> Objects</li>
</ul>
<ul>
<li> Rights</li>
</ul>
<ul>
<li> Agents</li>
</ul>
<ul>
<li> Events</li>
</ul>
<p>The blanks to be filled in under each of these entities are referred to as <em>semantic units</em>, and they were identified by the PREMIS team.  The semantic unit entries for each digital files are very specific pieces of information that are important to the preservation process.  The PREMIS Data Dictionary presents and describes all of these semantic units for each entity.</p>
<blockquote><p><span style="color:#808080;">Some examples of a digital file&#8217;s potential semantic units would include:<br />
-the program on which the file was created<br />
-the version of that program<br />
-the operating system on which that program ran<br />
-who created the file<br />
-the rights associated with the file<br />
-when the file was ingested into the preservation system<br />
-dates the file was validated<br />
-and so on.</span></p></blockquote>
<p>It is very detailed, and I&#8217;d really just recommend that you flip through it, unless you are the one responsible for implementing it.  It is so detailed, in fact, that it might even be a good place to start when you are in the beginning stages of developing a digital preservation program, as it will tell you the kind of preservation information that is important to collect.</p>
<p>I can definitely understand that there is quite a large learning curve for using PREMIS at your institution.  There is a lot of information and implementation training to go through, and it is likely that such training would be unprecedented.  (Not to mention that your whole digital preservation program is likely unprecedented at your institution!)  Perhaps, though, learning PREMIS can be approached just as the learning of other new metadata schemas has been approached in the past.</p>
<p><img class="alignleft size-medium wp-image-210" style="border:1px solid black;" title="suitcases" src="http://easydigitalpreservation.files.wordpress.com/2009/10/suitacases.jpg?w=212&#038;h=140" alt="Suitcases" width="212" height="140" /></p>
<p>To wrap up, I&#8217;d like to say that the good news is that PREMIS is designed to collect this preservation metadata automatically!  It is also highly important to know that the PREMIS Data Dictionary is supported by an XML structure.  This is relevant because this allows PREMIS records to be shared or transferred between preservation systems&#8230;which has excellent implications for cross-institutional cooperation and collaboration.  I hope that once my understanding of these processes grow, I will be able to share it in a future post (Update: See my post on <a href="http://easydigitalpreservation.wordpress.com/2010/06/30/mets-for-transferable-metadata/">METS</a>).</p>
<p>For further exploration, the Library of Congress has a non-intimidating page full of resources for everything PREMIS.  This includes an overview by Priscilla Caplan (<a href="http://www.loc.gov/standards/premis/understanding-premis.pdf">PDF</a>) and a <a href="http://www.loc.gov/standards/premis/tutorials.html">tutorial</a> that is much more in-depth than this post.  And finally, an open PREMIS implementation fair was held earlier this month, and the presentation slides are posted <a href="http://www.loc.gov/standards/premis/premis-implementation-fair2009.html">here</a>.</p>
<h3>Sample PREMIS records (Updated July 2010)</h3>
<p>I&#8217;ve noticed that many people arrive at this post by searching for an example of a PREMIS record.  I originally didn&#8217;t include one, but I want to do so now.  The two links below represent segments of a single PREMIS record provided by the Library of Congress.  You&#8217;ll see the semantic units affiliated with each entity (i.e. the &#8220;blanks&#8221; that need to be filled in for each category of metadata within the PREMIS record, as defined by the Data Dictionary).  The two examples pertain to the same digital object, which is a portrait of Louis Armstrong, viewable <a href="http://lcweb2.loc.gov/cocoon/test-ihas/loc.natlib.gottlieb.09601/default.html">here</a>.</p>
<ul>
<li>This first link will take you to the PREMIS information for the <a href="http://www.loc.gov/standards/premis/ObjectsExercise-Rome.pdf">Object entity</a>.  The semantic units are listed, and the &#8220;Value&#8221; column contains the information that has been entered about the characteristics of this specific digital object.</li>
<li>This second link will take you to the metadata for the <a href="http://">Events entity</a> of this same digital object.  The &#8220;Value&#8221; column will again contain the institutionally-added information, this time about actions and preservation events related to this specific digital object.</li>
</ul>
<p>To come back to the bigger picture, keep in mind that these two links only represent segments of what would be a single PREMIS record.  As listed above, there are 5 separate entities, each with a slew of semantic units to hold specific information about the object.</p>
<h6>Suitcase photo by masochismtango on <a href="http://www.flickr.com/photos/masochismtango/">Flickr</a>, Creative Commons Attribution-Share Alike 2.0 Generic license.</h6>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/211/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=211&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2009/10/30/premis-for-preservation-metadata/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2009/10/premis.jpg?w=231" medium="image">
			<media:title type="html">premis</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2009/10/suitacases.jpg?w=300" medium="image">
			<media:title type="html">suitcases</media:title>
		</media:content>
	</item>
		<item>
		<title>Why There is No Single Preservation Strategy</title>
		<link>http://easydigitalpreservation.wordpress.com/2009/10/01/why-there-is-no-single-preservation-strategy/</link>
		<comments>http://easydigitalpreservation.wordpress.com/2009/10/01/why-there-is-no-single-preservation-strategy/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 17:26:41 +0000</pubDate>
		<dc:creator>M.Amaral</dc:creator>
				<category><![CDATA[Standards]]></category>

		<guid isPermaLink="false">http://easydigitalpreservation.wordpress.com/?p=190</guid>
		<description><![CDATA[The following are some thoughts that I had about why there isn&#8217;t one digital preservation strategy that can be applied to all digital preservation programs.  As wonderful as it would be to find one standardized solution that fits everyone&#8217;s needs, it&#8217;s essentially an impossibility.  What&#8217;s below is something I wrote up for some coursework, but [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=190&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The following are some thoughts that I had about why there isn&#8217;t one digital preservation strategy that can be applied to all digital preservation programs.  As wonderful as it would be to find one standardized solution that fits everyone&#8217;s needs, it&#8217;s essentially an impossibility.  What&#8217;s below is something I wrote up for some coursework, but I thought I&#8217;d share it here, too.</p>
<p>There are two ways to answer the question of why there is no universally applicable digital preservation strategy.  The first is at the institutional level, and the second is at the level of the digital objects intended for preservation.</p>
<p>Digital preservation efforts so far have been tied to institutions interested in maintaining access to digital objects over time.  Being tied to an institution for funding and support will come with governance, policies, administrations, departments units, stakeholders to please, and service missions which will all be very specific to the institution.  These factors will all be guiding principles in the way a digital preservation strategy will be created at a given institution.  And this is fine; <a href="http://www.jisc.ac.uk/media/documents/publications/digitalpreservationbp.pdf">Pennock (2006)</a> even goes so far as to state that &#8220;digital preservation policies are <strong>most effective</strong> when integrated into the overall organisational policy framework.&#8221;  But this would prevent a universal digital preservation model from being possible simply due to all the &#8220;personalizations&#8221; that would need to take place in order to meet the needs of the institution as well as the capabiliti<img class="size-medium wp-image-195 alignleft" style="border:1px solid black;" title="bubbles" src="http://easydigitalpreservation.files.wordpress.com/2009/10/2908186658_055fb448ba.jpg?w=236&#038;h=158" alt="bubbles" width="236" height="158" />es allowed by whatever funding is available.</p>
<p>The second way to answer the question of why there is not a single digital preservation method that can be applied to everything is at the level of the digital objects.  When it comes to determining the actual preservation method, some ways work better than others depending on the type of file at hand, and the needs associated with that individual file.  For example, <a href="http://jeffrey.famvdhoeven.nl/dd/Researchtask%20IBM%20TU%20Delft%20-%20J.R.%20van%20der%20Hoeven.pdf">van der Hoeven (2004)</a> points out that migration is an effective preservation method for widely supported file formats, but it might not be good for files that must maintain high levels of authenticity.  He even goes on to state that &#8220;&#8230;no one size fits all solution is possible.  Digital documents differ from each other in too many ways and are used for many different purposes by many different users.&#8221;<br />
If we are to come up with an effective digital preservation strategy (at both the institutional and document levels), we must remain aware of the options, and expect to employ more than one method, strategy, and tool set.</p>
<h5>Pennock, M. (2006).  &#8220;JISC Briefing paper: digital preservation, continued access to authentic digital assets.&#8221;  Retrieved September 30, 2009 from</h5>
<h5>van der Hoeven, J. R. (2004). “Permanent Access Technology for the virtual heritage.”  Retrieved September 30, 2009 from http://jeffrey.famvdhoeven.nl/dd/Researchtask%20IBM%20TU%20Delft%20-%20J.R.%20van%20der%20Hoeven.pdf</h5>
<h6>Photo by <a href="http://www.flickr.com/people/tambako/">Tambako the Jaguar</a> under a Creative Commons Attribution-No Derivative Works 2.0 Generic license.</h6>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/easydigitalpreservation.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/easydigitalpreservation.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/easydigitalpreservation.wordpress.com/190/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=easydigitalpreservation.wordpress.com&amp;blog=8685208&amp;post=190&amp;subd=easydigitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://easydigitalpreservation.wordpress.com/2009/10/01/why-there-is-no-single-preservation-strategy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/95c267667946ad011c97519d855e8f1d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">M. Amaral</media:title>
		</media:content>

		<media:content url="http://easydigitalpreservation.files.wordpress.com/2009/10/2908186658_055fb448ba.jpg?w=300" medium="image">
			<media:title type="html">bubbles</media:title>
		</media:content>
	</item>
	</channel>
</rss>
