Archiving the Web: How to Support Research of Future Heritage?
Transcription
Archiving the Web: How to Support Research of Future Heritage?
Archiving the Web: How to Support Research of Future Heritage? NWO-CATCH Meeting, hosted by WebART April 19, 2013 William G. LeFurgy Library of Congress @blefurgy LAMs… GLAMs! We Are in The Big Data Business Focus from Items to Aggregates Born Digital Cultural Heritage Data Challenge Infrastructure Improvement: Perpetually Iterative White House RFI Input Instructive • Request for Information on Public Access to Federally Funded Scientific Research Data, Nov. 2011 • Interested organizations comment on how best to ensure long-term stewardship and public access • Input provided to inform development of agency policies and standards for managing big data Summary of Responses • 118 individual responses – 50% from academic research departments, professional organizations – 35% from libraries, repositories and allied organizations – 10% from publishers and commercial organizations – 5% other • Excellent (unstructured!) data set to analyze current thinking on big data stewardship Top-Level Policy Recommendations • Remarkable degree of congruence among comments – Broadly allocate resources for data stewardship – Extend national digital stewardship infrastructure – Institute and enforce a data preservation mandate – Encourage policies to support secondary use • But… conflicted about IP, copyright, privacy Need: Path to Greater Resources • Funders to include money in awards for data stewardship • Need cost models, other guidance for estimating data life cycle costs • Allocate expanded resources to support national data repositories Need: National Digital Stewardship Infrastructure • Leverage current institutional efforts to define best practices, tools, services • Extend community of practice for data stewardship through collaborative action across disciplines • Develop a skilled workforce with data stewardship expertise Support: Secondary Use, Respect for Data • Broadly apply a citation mechanism for data sets (e.g., DataCite, DOIs) • Criteria for evaluating grant applications tied to secondary use of data • Give equal credit for publishing articles and data sets • Develop robust metrics to track data publication and use Update How We Feel About Our Work How Some Perceive Our Work “If you go down to the British Library today, you‟re sure of a big surprise.” “„Capturing the nation‟s digital memory‟ – that‟s the phrase they are using about the venture. Your first response might be: …” „„The internet archives itself, doesn‟t it? It‟s called Google.‟ People Love Libraries (In Principle) How is Our Brand Different? Trustworthy! Unique! Informed! Helpful! Innovative! Promoting Web Archiving at Library of Congress • Post on our blog, The Signal • Tweets via @ndiipp • Media mentions : Mashable, The Economist, New York Times • Active involvement in IIPC; supporting theme for this year’s meeting, Scholarly Access to Web Archives: Progress, Requirements, and Challenges Library of Congress Digital Preservation YouTube Channel Cultivating Brand Awareness “A city’s conserved historic core can also differentiate that city from competing locations—branding it nationally and internationally—thus helping the city attract investment and talented people.” Related Approaches Are Good, Too Researcher Engagement A new model for Research IT services, University of Melbourne Entwined: Branding and Supporting Cultural Heritage Data Research Credits-1 Slide 2: 1970s Glam Rock, http://www.flickr.com/photos/cutandchicvintage/8514461334/ Slide 3: Forging a Digital Roadmap: The Preservation, Curation and Stewardship Nexus, http://www.slideshare.net/lipscombb/forging-the-digital-roadmap-the-preservation-curation-and-stewardship-nexus Slide 6: Estimate of Internet users worldwide, http://www.educause.edu/ero/article/egranary-digital-library Slide 7: International Internet Preservation Consortium, http://netpreserve.org/ Slide 8: Company Information System, http://www.flickr.com/photos/jnicho02/2182824162/ Slide 9: Pearson Think Tank Publications, http://www.flickr.com/photos/louismm/; Enter the Matrix, http://www.flickr.com/photos/jurvetson/389673106/ Slide 10: Library Challenges in a 2.0 world, http://www.flickr.com/photos/etches-johnson/516140669/ Slide 11: Elliot computer, http://www.retronaut.com/2013/04/norwich-city-councils-first-computer-being-delivered/ Slide 12: Request for Information: Public Access to Digital Data Resulting From Federally Funded Scientific Research, http://ow.ly/ePB93; White House, DC, http://www.flickr.com/photos/22240293@N05/3735172478/ Slide 13: Your Comments on Access to Federally Funded Scientific Research Results, http://ow.ly/ePBb9 Slide 18: Cite: White House Memo, Increasing Access to the Results of Federally Funded Scientific Research, http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf Slide 19: Graphic: brand, http://www.flickr.com/photos/russelldavies/1360875397/ Slide 20: Graphic: opportunities, http://www.flickr.com/photos/mrvelocipede/2898098906/ Slide 21: Graphic: Pop!Tech 2007, http://www.flickr.com/photos/kk/2675666099/; Graphic: Pop!Tech 2007, http://www.flickr.com/photos/kk/2675678117/ Slide 22: Graphic: ?, http://www.flickr.com/photos/sublime_dharma/1912192266/; Graphic: bored, http://www.flickr.com/photos/greggoconnell/36973681/ Credits-2 Slide 23: Cite: http://blogs.spectator.co.uk/books/2013/04/the-british-library-goes-digital/ Slide 24: Cite: http://blogs.spectator.co.uk/books/2013/04/the-british-library-goes-digital/ Slide 25: Cite: http://blogs.spectator.co.uk/books/2013/04/the-british-library-goes-digital/ Slide 26: Cite and graphic: http://www.pewinternet.org/Presentations/2013/Apr/Greatest-Hits-from-Pew-InternetsLibrary-Research.aspx Slide 29: Cite and graphic: America's Young Archivists: The K-12 Web Archiving Program; http://www.youtube.com/watch?v=Gob7cjzoX3Y; Tim O'Reilly - Keynote for 2011 NDIIPP/NDSA Partners Meeting, http://www.youtube.com/watch?v=bC3qapPwcdI Slide 30: Cite and graphic: European Public Sector Information Platform, Topic Report No. 2012 /04, Open Data in Cultural Heritage Institutions, http://epsiplatform.eu/sites/default/files/Final%20TR%20Open%20Data%20in%20Cultural%20Heritage%20Instituti ons.pdf; Framework for the development of a brand strategy for The European Library, Europeana and Europeana Libraries, http://www.europeana-libraries.eu/documents/868553/9397e084-fb76-4624-94b6-278a8576fa7b; What is DPLA?, http://lj.libraryjournal.com/2013/04/future-of-libraries/whats-is-the-dpla/; Investing in Historical Assets for Sustainable Development, http://siteresources.worldbank.org/EXTSDNET/Resources/Economics_of_Uniqueness.pdf. Slide 31: Cite and graphics: Measuring the Impact of Digital Resources: The Balanced Value Impact Model http://www.kdcs.kcl.ac.uk/fileadmin/documents/pubs/BalancedValueImpactModel_SimonTanner_October2012.pdf; WebART, http://www.webarchiving.nl/home; IIPC, Web Archiving Use Cases, http://netpreserve.org/sites/default/files/resources/UseCases_Final_1.pdf; Altmetrics, http://www.digitalscience.com/products/altmetric/; Common Crawl, http://commoncrawl.org/. Slide 32: Cite and graphics, A new model for Research IT services, http://dfflanders.wordpress.com/2013/04/15/resbaz/ Slide 33: Graphic: Tendril #2, http://www.flickr.com/photos/blueridgekitties/7340012008/