Category Archives: Technical Resources

Anatomy of a Web Archive | The Signal: Digital Preservation

By gretchen | November 8, 2013

by Nicholas Taylor I’m inclined to blame the semantic flexibility of the word “archive” for the fact that someone with no previous exposure to web archives might variously suppose that they are: the result of saving web pages from the browser, institutions acting as repositories for web resources, a navigational feature of some websites allowing for browsing of past content, […]

Automating Extracting GIS Data from Scanned Maps – GIS Lounge

By gretchen | November 8, 2013

by Caitlin Dempsey he New York Public Library Labs (NYPL Labs) has posted on Github the code to its open source map-vectorizer project.  NYPL Lab’s map-vectorizer project is seeking to automate (“like OCR for maps”) the process of extracting polygon and attribute information from old scanned maps.  The code was developed with the purpose of extracting building information from […]

Les mots croisés vénitiens

By gretchen | October 24, 2013

by  Frederic Kaplan J’étais la semaine dernière à Venise pour l’organisation de la première école doctorale entre l’Université Ca’Foscari et l’EPFL. Une petite trentaine d’étudiants des sciences historiques et des sciences de l’information ont pu travailler en groupe autour de projets liés à la Venice Time Machine. Nous avons créé un site web dédié à cet évènement. J’ai […]

Abstractualized: Slavic-specific resources for digital scholarship

By gretchen | October 24, 2013

by Seth Bernstein I’ve just had a lesson come out on automatic transliteration of Cyrillic sources in The Programming Historian so I thought that I would devote this post to shameless self promotion. Then I decided I should also write a little about some of the tools I use to build databases from web information and create visualizations. […]

Google’s Ngram Viewer Goes Wild

By gretchen | October 24, 2013

by Ben Zimmer It’s been nearly three years since Google rolled out its Ngram Viewer, allowing armchair historians to plot the trajectories of words and phrases over time based on an enormous corpus of data extracted from the Google Books digitization project. Since then, there have been numerous studies seeking to glean some cultural significance from […]

Forced Labour, Five Tours on Berlin History « Museummedia

By gretchen | October 14, 2013

“Forced Labour.The Testimony App” offers five tours which take you through forced labour in Berlin. Witnesses tell of factories and forced labourers camps. You can follow their path with a multimedia guide either on foot, by bicycle or with the Berlin S-Bahn. Download the tours in Wifi and experience the everyday life of forced labourers throughout Berlin. […]

Data exploration through visualisation – Digital scholarship blog

By gretchen | September 26, 2013

by Ben O’Steen The impact that a thoughtful visualisation has cannot be underestimated. However, it’s easy to forget how tremendously useful they are for understanding your own data, before you even know what you have. “Textual cross references found in the Bible” © Chris Harrison  The questions “Is there…?” and “What if…?” drive the exploration of data. […]

The Programming Historian 2

By gretchen | September 26, 2013

by Kellen Kurschinski Technical Reviewer: Nick Ruest, Konrad Lawson Literary Reviewer: Ian Milligan Background and Lesson Goals Now that you have learned how Wget can be used to mirror or download specific files from websites like via the command line, it’s time to expand your web-scraping skills through a few more lessons that focus on other […]

TextBlob: Simplified Text Processing — TextBlob 0.6.3 documentation

By gretchen | September 26, 2013

Release v0.6.3. (Installation) TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Features Noun […]

Europeana Professional – Europeana Cloud – Shared Infrastructure for European Cultural Content – eCloud Blog

By gretchen | September 26, 2013

by Pavel Kats, Europeana Development Manager. Unless you’ve been living in a cave for the last few years, you’ve probably noticed that cloud computing technologies are rapidly transforming our lives. Cloud computing, or ‘the cloud’, for simplicity, have become buzzwords in everyday talk – every kid on the block knows that there are things out […]