<div class="field field-name-field-guest-author field-type-text field-label-hidden"><div class="field-items"><div class="field-item even">Stefana Breitwieser, Intern, Digital Services Division</div></div></div><div class="field field-name-body field-type-text-with-summary field-label-hidden"><div class="field-items"><div class="field-item even"><p>Websites are important records of institutional history, but they are also always being updated, redesigned, or taken down. How do we access important information from outdated versions of websites? The Archives is currently using <a href="https://www.archive-it.org/" target="_blank">Archive-It</a>, a tool created by the <a href="https://archive.org/index.php" target="_blank">Internet Archive</a>, to capture Smithsonian websites and social media accounts for future use. Archive-It uses a <a href="http://en.wikipedia.org/wiki/Web_crawler" target="_blank">crawler</a> - a program that browses the Internet like Google - to replicate a website at that specific moment. These “crawls” are later accessible using the <a href="https://archive.org/web/">Wayback tool</a>. While the research potential for these crawls is enormous, two areas stand out in particular; to document the evolution of website features and to capture public participation during a specific event or program through social media.</p> <p><a href="https://wayback.archive-it.org/3340/20140625164626/http:/invertebrates.si.edu/echinoderm/" target="_blank"><img src="http://siarchives.si.edu/sites/default/files/styles/body-image-450/public/blog-attached-images/IZ_Newsletter.jpg?itok=TJ843600" alt="A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives." title="A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives." width="450" height="249" /></a></p> <p>Crawls show the progress of how technology is used and how websites have evolved over time. Above and below, we have two examples from the National Museum of Natural History (NMNH). This is the <a href="http://invertebrates.si.edu/echinoderm/" target="_blank">Virtual Echinoderm Newsletter</a>, which was last updated in 2002. Though it may seem simplistic to us today, this is very representative of a typical website from the early 2000s. </p> <p><a href="https://wayback.archive-it.org/3340/20140625164626/http://invertebrates.si.edu/echinoderm/" target="_blank"><img src="http://siarchives.si.edu/sites/default/files/styles/body-image-450/public/blog-attached-images/IZ_Newletter_2.jpg?itok=fxoguRam" alt="A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives." title="A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives." width="450" height="253" /></a></p> <p>Fast-forward to 2014: With the new <a href="http://humanorigins.si.edu/" target="_blank">Human Origins Initiative website</a>. We have a slideshow of features, live updates from Facebook and Twitter, and a text box that allows visitors to participate in the project - all located on the first page. While both of these sites are pretty typical for the respective years they were created in, they also are demonstrative of how much websites have changed in just over a decade. </p> <p><a href="https://wayback.archive-it.org/3340/20131122030535/http://humanorigins.si.edu/" target="_blank"><img src="http://siarchives.si.edu/sites/default/files/styles/body-image-450/public/blog-attached-images/HumanOrigins.jpg?itok=exyNzYEf" alt="A screenshot of the website for the Human Origins Initiative, crawled November 22, 2013, Accession 14-079 - National Museum of Natural History, Website Records, 2013, Smithsonian Institution Archives." title="A screenshot of the website for the Human Origins Initiative, crawled November 22, 2013, Accession 14-079 - National Museum of Natural History, Website Records, 2013, Smithsonian Institution Archives." width="450" height="250" /></a></p> <p>The Archive-It tool is also being used to capture certain programs and events using social media. A great example of this is the crawl of the National Museum of American History’s <a href="https://wayback.archive-it.org/3393/20130607181237/http://talkback.americanhistory.si.edu/" target="_blank">#HistoryTalkBack Tumblr page</a>. This site documented an ongoing project at the museum where curators invited visitors to respond to a question every day and to post their answers on a wall at the museum. The Tumblr page broadcasts some of the favorite posts and then invites commenters to respond to the question as well. We were pleased with the amount of public participation captured in our crawl - not only do we have the visitors’ comments, but because the site is Tumblr-based, we also captured the number of likes and re-blogs. Now that this site is defunct, this crawl becomes important for documenting the scope and impact of this project.</p> <p><a href="https://wayback.archive-it.org/3393/20130607181237/http://talkback.americanhistory.si.edu/" target="_blank"><img src="http://siarchives.si.edu/sites/default/files/styles/body-image-450/public/blog-attached-images/HistoryTalkBack.jpg?itok=P8xxOuEG" alt="A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives." title="A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives." width="450" height="252" /></a></p> <p>I especially like these social media crawls. Social media - instantaneous, constantly updated, and therefore often thought of as transient - is transformed into something more lasting. By looking at crawls from blogs, Facebook, Twitter, Tumblr, and Flickr, we can examine the public’s response to a project and the strategies museums use to engage with their audiences. The #HistoryTalkBack crawl shows this. Tumblr users spread these images, sharing the posts to express their own love of history to friends and followers, while the National Museum of American History used this platform to engage both their real-life and virtual visitors. Capturing these moments using social media gives us a greater understanding of how the public participates in museum programs, and also how museums reach out to people. </p> <p><a href="https://wayback.archive-it.org/3393/20130606201303/http://talkback.americanhistory.si.edu/page/3" target="_blank"><img src="http://siarchives.si.edu/sites/default/files/styles/body-image-450/public/blog-attached-images/HistroyTalkBalk_2.jpg?itok=hSdtzgiM" alt="A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives." title="A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives." width="450" height="250" /></a></p> <p>The Archive-It tool promises incredible potential in the coming years, especially as the Archives continue to grow. If you’d like to learn more, you can check out the <a href="https://archive-it.org/organizations/660" target="_blank">Archives’ Archive-It crawls</a>. </p> <h3>Related Resources</h3> <ul> <li><a href="http://siarchives.si.edu/blog/smithsonian-now-using-archive-it-crawl-websites" style="line-height: 1.538em;">Smithsonian Now Using Archive-It to Crawl Websites</a><span style="line-height: 1.538em;">, The Bigger Picture blog, Smithsonian Institution Archives</span></li> <li><a href="http://siarchives.si.edu/blog/connecting-dots-issues-preserving-complex-websites" style="line-height: 1.538em;">Connecting the Dots: Issues with Preserving Complex Websites</a><span style="line-height: 1.538em;">, The Bigger Picture blog, Smithsonian Institution Archives</span></li> <li><a href="http://siarchives.si.edu/blog/saving-smithsonians-web" style="line-height: 1.538em;">Saving the Smithsonian’s Web</a><span style="line-height: 1.538em;">, The Bigger Picture blog, Smithsonian Instituion Archives</span></li> </ul> <p> </p> <h3>Related Collections</h3> <ul> <li><a href="http://www.siarchives.si.edu/collections/siris_arc_366586" style="line-height: 1.538em;">Accession 14-039 - National Museum of American History, Website Records, 2011-2013</a><span style="line-height: 1.538em;">, Smithsonian Institution Archives</span></li> <li><a href="http://www.siarchives.si.edu/collections/siris_arc_366664" style="line-height: 1.538em;">Accession 14-079 - National Museum of Natural History, Website Records, 2013</a><span style="line-height: 1.538em;">, Smithsonian Institution Archives</span></li> </ul> </div></div></div><div class="field field-name-taxonomy-vocabulary-3 field-type-taxonomy-term-reference field-label-above"><div class="field-label">Blog Categories: </div><div class="field-items"><div class="field-item even"><a href="/blog/category/behind-scenes"><span>Behind the Scenes</span></a></div></div></div><div class="field field-name-taxonomy-vocabulary-4 field-type-taxonomy-term-reference field-label-above"><div class="field-label">Blog Tags: </div><div class="field-items"><div class="field-item even"><a href="/blog/tag/archive"><span>Archive</span></a></div><div class="field-item odd"><a href="/blog/tag/webtech"><span>Web/Tech</span></a></div></div></div>