

Thank You For more information: DuraSpace Organization: Wiki: DuraCloud project page: Next Steps(Feb-April) V.2 release complete Replication, web access and viewing, file format conversion, J2K image server, bit integrity checking Launch Fedora and DSpace plug ins V.3 release primary features Synchronization with local repository( Fedora and DSpace) Expand pilot in April to include 15 new users, to connect with current repositories Continue to test robustness and performance of commercial cloud partners

ANCESTRY J2K FILES SOFTWARE
Timeline Begin pilots– September 2009 DuraCloud Alpha Pilot release- Oct 2009 Pilot data loading and testing – Fall 2009 Beta for repository community – Q2 2010 Pilot testing with software services Q2 2010 Cloud partner evaluations complete-Q3 2010 Hosting service pricing and SLA’s complete-Q3 2010 Report pilot results – Q3 2010 Code available open source-Q3 2010 Launch production service Q4 2010
ANCESTRY J2K FILES ARCHIVE
WGBH Media Library and Archives Archive large video files Provide public access to streaming versions Transcode files in cloud Edit files where appropriate to sell clips Give third party access to cloud store for processing and access Use case: Provide backup preservation for video files from repository and other sources, and create derivative files for access and streaming.Ĭhallenges Provisioning bandwidth at local institution to transfer data Transferring large files over the wire ( over 5 GB is rejected, found issues in transfer over 1 GB) Consistency of operation of 2 nd tier providers (EMC, RackSpace) Enabling others to easily build on platform Best process for integration of 3 rd party applications into hosting service Cost effective bit integrity checking Balancing ease of use and more sophisticated functionality Īdvantages of hosted platform Strategic partnerships with cloud providers Better pricing Transparency Early notification Ease of implementation for end user Multiple copies in multiple geographies/administrations through one interface Access to broad number of services relevant to the repository community NYPL pilot -back up copy all TIFF images (10 TB data) -transformation from Tiff to JPEG 2000 using Imagemagick -run J2k image server in cloud -Push JPEG 2000 back into Fedora Repository Digital Gallery Collection Use case: back up online preservation copy to Fedora, file format transformationīHL pilot -back up copy entire corpus (40 TB data-JPEG, Tiff -have multiple copies including Europe -Run J2K image server in cloud BioDiversity Heritage Library Use case: Find the best cost competitive solution for keeping multiple copies in multiple geographies, easily accessible. Partners and Pilots Selected initial cloud providers Selected 3 initial pilot partners Preservation Services -ability to replicate content to multiple providers and locations -ability to synchronize backup with primary store or repository system -access to content through web based interface -ability to do bit integrity checking -ability to do file format transformations Services in the cloud for durable digital content DuraCloud Platform: Allow organizations to utilize cloud infrastructure easily offering data storage, data replication, preservation support and access services Institutional needs: managing digital collections Likely to use cloud services in next 12 months Key Challenges completed 145 participants higher ed Key Advantages completed 145 participants higher ed It is not being done well where it is not mission critical.” “ We have not invested enough to make it a service of which we are proud…” “ Collection development and storage are more important than computing. There is no system effort.” “ Where it is prioritized as mission critical, it is being done well. Implications for our future work more distributed more collaborative more web - oriented more open more interoperableĬhallenges (from survey ) Preservation support is hard to implement consistently “ Our preservation support is collection based where we have had grants or specific initiatives. DuraCloud - Open technologies and services for managing durable data in the cloudĭuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace “ Repositories in the Cloud” Seminar, Feb 2010
