Freedom of Information Archive

CRL Status:
Not set yet.
Feedback:
0 User comments 0 0

CRL licensing and community input features are only available with a CRL member login.

If your institution is a CRL Member please:

log in or sign up
    Overview

    A web-based digital archive and accompanying analytical tools, developed by the HistoryLab Project at the Department of History, Columbia University,of declassified records and documents obtained from the U.S. and other national governments through freedom of information act requests and other means.

    Mar 4, 2016 1:31pm
    Details
    Collection Content

    As of February 2016 the database included the following groups of U.S. government records: 

    • The Foreign Relations of the United States (1945-1980) includes 75,263 documents to date, from a curated collection of the most important declassified documents selected by State Department historians with access to every government department and agency. It also includes many documents drawn from the personal papers of Presidents and key policymakers.
    • The State Department Central Foreign Policy Files (1973- ) As an early adopter of electronic record systems, the U.S. Department of State has highly structured communications. This core collection continues up to the present -- it was the source of the Wikileaks cables, for instance. But whereas Wikileaks was a relatively small and non-random sample (~7% of all cables), we have over 2.6 million cable records, including: ~1.76 million fully released cables, ~411k unreleased cables for which we have metadata, and over 500k records with metadata for documents that were delivered by diplomatic pouch.
    • Henry Kissinger Telephone Conversations (1973-1976) . The database holds 4,552 transcripts of Kissinger telephone conversations during his tenure as Secretary of State. A collection from a high-level policymaker, it affords users the ability to draw connections and contrasts with the larger institutional archives in FRUS and the State Department Central Foreign Policy Files. 
    • The Hillary Clinton Emails (2009-2012). As of November 2015 the database held over 16,246 documents comprising 40,737 individual email messages as released through the State Department’s FOIA Reading Room. History Lab researchers have done extensive work to parse these documents, which were originally released as PDFs, in order to extract structured information for conducting data analyses.
    Delivery

    The Freedom of Information Archive brings together in a single database documents that are now isolated and scattered in multiple online repositories and commercial databases, and only accessible through stand-alone search engines, which in most cases are quite primitive. History Lab has integrated millions of these documents in a common database, extracted metadata, and built tools to reveal the most important persons, places, the topics. Users can now obtain an overview of different collections, overlaying one atop another, and use document counts over time to chart the rise and fall, and patterns of historical activity. After identifying documents sharing common features, the Merriam tool automatically retrieves similar texts from all collections based on desired criteria, such as simultaneity, semantic features, and social networks.

    The archive interface enables users to identify documents within a desired date range, determine and graph the number of documents in the archive over time, filter by country, map the geographical distribution of countries referenced in documents, search collections by document topic and visualize results, 

    While the content is open access, only IP-address authenticated researchers are able to create their own password-protected user accounts. This system,which is now being tested by students at Columbia, allows users to save individual documents, tag them with labels (either new ones they devise on their own or auto-completed from previous labels), highlight and annotate sections of texts.  These are individual accounts, but can also be used for class assignments or collaborative projects.

     

     

     

    Community Ratings

    Content scope and completeness
    No votes yet
    Cost and price-structure
    No votes yet
    Platform and user interface
    No votes yet