Center for Research Libraries - Global Resources Network

Resources for

Overview

Abstract

Gale Cengage has designed Nineteenth Century Collections Online (NCCO) to be an extensive database with multiple content types, covering most regions of the world. The collection is in a rolling release of twelve modules over several years. The initial four collection modules (called “Archives” by Gale) were released in spring 2012. The next four modules are scheduled for release in spring 2013.

This review is in progress. CRL is working to gather more information and critical assessment. We encourage our community to submit comments on their experiences with the database.

This resource is currently offered to CRL libraries at favorable terms. Representatives at each library have been sent login access to view purchase or subscription offers.

 

Sources

Sources for this review include information publicly posted or obtained directly from the publisher, data collected by CRL staff and members, and examination of the digital collection when possible. Other sources are noted where cited.

NCCO has also been reviewed in The Charleston Advisor1.

Reviewers

Center for Research Libraries

  • Virginia Kerr - Digital Program Manager

Analysis

Collection Content

Gale Cengage has designed Nineteenth Century Collections Online (NCCO) to be an extensive database with multiple content types, covering most regions of the world. The collection is in a rolling release of twelve modules over several years. The initial four collection modules (called “Archives” by Gale) were released in spring 2012. The next four modules are scheduled for release in spring 2013.

NCCO content sources include monographs, newspapers, manuscripts, photographs, maps, ephemera, and statistical data. The collection modules are topically oriented and intended to integrate in varying degrees sources from world regions including the Americas (North, Central, and South), Africa, East and South Asia, and the Middle East, as well as Europe. Most of the content in the initial four modules will be English language, although future collection modules will expand coverage to other European and also some non-western language content.

The planned topics for the “Archives” are:

Module Description
NCCO 1

British politics and society (2012)

c. 1.7 M pages. Newspapers, periodicals, correspondence, pamphlets, etc.

NCCO 2

Asia and the West: diplomacy and cultural exchange (2012)

c. 2 M pages. Ministerial and consular papers; foreign missions.

NCCO 3

British theatre, music and literature: high and popular culture (2012)

c. 1 M pages. Annotated programs, compositions, correspondence, financial records.

NCCO4

European literature, 1790-1840: the Corvey collection (2012)

c. 5.2 M pages. Sourced  from microfiche and scans produced by Belser Wissenschaftlicher Dienst; c. 9500+ titles from a specialized European literature collection; languages evenly distributed among German, English, and French

NCCO 5

History of science, technology, and medicine (spring 2013)

c. 5 M pages; includes American medical periodicals, periodicals on biology, botany, chemistry, zoology, etc.; monographs on the theory of evolution, electricity, color theory, engineering, etc.

NCCO 6

Photography-the world through the lens (spring 2013)

c. 2 M pages and images. Documents photography used as historical record,  reproduction, art form, and scientific methodology.

NCCO 7

Women: transnational networks (spring 2013)

c. 1.5 M pages. The intersection of gender and class through the era of suffrage movement. Includes diaries and personal papers, periodicals, biographical entries.

NCCO 8

Europe and Africa: commerce, Christianity, civilization, and conquest (spring 2013)

c. 1.5 M pages. Consular and colonial office documents from the U.S., Britain, Germany etc.; colonial-era African newspapers; exploration narratives; missionary documents.

NCCO 9–12

To be determined “in collaboration with NCCO customers and partner libraries”

 

Microform and original sources

Modules 1-4

Gale projected that existing microfilm collections would account for approximately 65% of the content in the initial release (modules 1-4), with 35% scanned directly from original sources. (See Appendix A for a list of initially identified microform sources for these modules.)  Two significant sources of original material were the British Library and The National Archives. Gale also targeted previously uncataloged material from the British courts.

Modules 5-8

Gale has indicated that published microform sources will make up a smaller component of subsequent modules, with about 30% of modules 5-8 to come from Gale microfilm collections. (See Appendix B for a list of Gale Primary Source Media microform collections planned as sources for NCCO modules 5-8).

*History of science, technology, and medicine

Sources identified include: American Medical Periodicals, 1797-1900 from the National Library of Medicine; Scientific & Technical Periodicals from the Royal Society of London’s Catalogue of Scientific Papers, 1800-1900 from the National Academy of Natural Sciences in Philadelphia (as well as the Academy’s Minutes and Correspondence1812-1924); journal titles from the aggregated content in Landmarks of Science from Readex (about a 15% overlap with the microform in this collection). See Appendix C for specific journal titles sourced from microfilm.  Original content: rare scientific monographs from the Burndy Library of the Dibner Institute, acquired by The Huntington Library in 2006.

* Photography-the world through the lens

History of Photography from Gale’s Primary Source Media microform collections. Original content: of fourteen photography source collections under consideration from the Library of Congress (as of November 2012) only two are also represented in the online open access site American Memory.

* Women: transnational network

History of Women from Gale’s Primary Source Media microform collections

Other original content sources for modules 5-8 will include The National Archives, and National Archives (U.S.).

Overlap with other digital collections

There are questions about both comprehensiveness and duplication in NCCO, since many key documents and monographs are already available elsewhere. Gale has stated that a key selection criterion is avoidance of overlap with existing digitized 19th-century materials, especially the content scanned for the Google Books project and available in HathiTrust. Particular attention is paid to material having multiple potential sources.

On the other hand, Gale notes that collections of published documents in NCCO, such as the scientific journals,  have not been selected through  individual title checking against licensed or open access digital sources. For instance, at least seven of the American medical journals in module 5 can also be found in American Preiodicals from the Center for Research Libraries (APCRL), distributed by ProQuest. In a sampling of 20% of the science and technology journals included from microform sources (taken from the title lists in appendix C), CRL found 50% overlap with existing digital collections, including licensed content and open access sources.

Note that Gale has indicated plans to eventually integrate their previously digitized 19th- century content, which they expect will require re-indexing prior to mounting on the newly developed research platform.

Q & A with Gale

In 2011, prior to the initial launch of modules 1-4, questions about the product under development were submitted to Gale in 2011, with the following responses:

Q: What are the anticipated sizes of the collection modules, and the estimated overall size of the completed database?

A: “The plan for NCCO allows for a total of approximately eight to twelve million pages across four Archives per year, for the next three years, with an overall program size in excess of thirty million pages. It is not anticipated that each Archive will be equal to the next in size or document count, but rather that the total number of pages/documents will reflect the quantity of content needed to fit the research needs (and budgets) of the subject areas which the Archive is meant to support.” Note that Gale has further indicated that additional material could be developed after the initial twelve modules, depending on market response.

Q: What percentage of the overall digital collection will be European content? What percentage will be from North America?

A: “This cannot be determined at this point. This is an evolving program and the direction it takes, guided by our Advisory Board and ad hoc experts in the subject areas on which Gale will focus, will determine where Gale looks for sources and content. For example, if Gale focuses on photography [for one Archive] as currently planned, sources would include the U.S., Germany, France, and Japan—but MAY also include institutions outside of those geographies with significant collections. Gale will announce each library partner as agreements are signed. Gale intends to react to changing research needs and an ever-changing digital landscape, and so will not seek to define the program so far into the future as to be unable to be certain that we are bringing unique materials of value to researchers and students . . . at any given time.”

Q: What percentage of the documents will be unpublished/archival documents?

A: “At present, Gale is looking outside of monographs and newspapers in a more intensive way than at the corpus of published materials; manuscripts and ephemera will play a large role in the program going forward. However, it is impossible to say, at this point, what the percentages may be.”

Q: Will the item-level metadata be exposed in various web scale discovery tools?

A: “At present, Gale is working through how we are working with discovery tools and services per our Gale Digital Collections, however progress is being made and we anticipate a program to roll-out hopefully in 2013. Per exposed metadata in general, it is viewable and available in most viewing options of the NCCO platform itself, and the underlying scanned XML content is downloadable.”

Q: What metadata schema (or schemas) will be used for the archival materials?

A: “This will be very similar to our current collections structure.” Note that Gale uses metadata authority sources developed in-house.

Delivery

NCCO has been built on a new platform developed by Gale, intended to incorporate a range of tools and features that support current digital research methods. In a description provided prior to the launch in 2012, Gale noted the following distinguishing features:

  • Textual Analysis Tools
    Identify patterns, trends and relationships. In addition to providing new avenues for searching, underlying data is exportable for further analysis.
  • Subject Indexing
    Detailed subject indexing of manuscript materials, highlighting topics, people, places, and dates, and identifying relationships among documents
  • Zotero Compatible
  • User-Generated Tags and Annotations
    Users can add their own metadata for document categorization. Tags can be shared publicly or used privately to organize documents for particular research needs. Text can be highlighted and annotated with virtual notes for personal use.
  • User Accounts

         Personal accounts allow users to save and edit their tags and annotations.

  • Image Viewer
  • Flexible page image viewing

          Zoom, highlight, rotate, reverse (negative image), and view individual pages in full screen      mode, with the ability to adjust brightness and contrast to create a custom view.

Gale has extended their previous agreement with Portico to include archiving backup for NCCO content.

Strengths and weaknesses

The scope of this digitization project will make it a significant research resource for both scholars and undergraduates. There are several benefits to this scope: the sheer size of the collections; the geographic range including comparative materials from various cultures; and the variety of types of documents including a significant amount of unpublished material. At the same time, the large scale of NCCO can have drawbacks:

  • Some core resources will be excluded to serve the goal of providing unique content (these core resources may be available only in other licensed or open access sources, leaving the challenge of integrating discovery and access).
  • Some print content will nevertheless overlap with other digital collections.
  • Up to 40% of the content has been distributed in microform.
  • The thematic basis for building the modules, while appealing to curricular goals and potentially supporting resource discovery, may also result in truncating the full runs of some content (such as newspapers).
  • The heterogeneous variety of source material will result in an uneven level of discovery. While Gale has invested in extensive indexing of manuscript material to enhance access, and the platform also offers user tagging, the full texts of manuscript materials have not been transcribed.

The new platform tools developed for NCCO are impressive and for the most part should be useful to scholars as well as students.  These include the browse features for "Exploring Collections,"the search within results features, text analysis tools, and sophisticated image viewing tools. However, scholars wanting to apply extensive textual analysis methods will still want to download content for further research.

 

Terms

The individual modules are available for purchase. Discounts apply to purchase of multiple modules.

Endnotes

1 Brian Vetruba, “Nineteenth Century Collections Online,” The Charleston Advisor January, 2013, 14(3): 35-40, accessed January, 2013.

 

Details


Direct from Publisher

Sources

Sources for this review include information publicly posted or obtained directly from the publisher, data collected by CRL staff and members, and examination of the digital collection when possible. Other sources are noted where cited.

Collection Content

Subjects coveredPolitics, economics, diplomacy, international relations, imperialism, labor movements, nationhood, theater, literature, musical composition, history of science and engineering, history of medicine, women's studies, history of photography
Geographic coverageEurope, Africa, East Asia, South Asia, North America, South America, Central America, Australia, New Zealand
PChronological coveragemid-18th century to mid-20th century; bulk of materials are within the 19th century
Content types Monographs, manuscripts, newspapers, photographs, maps, ephemera, statistics
Source formats Microfilm, microfiche, paper
PTotal titlesunknown at this time
PTotal pagesc. 30 million; twelve "archives" or modules of varying size, with modules 1-8 averaging 1.85 M pages each
Digital collection launch dateMay 2012
Update frequency annual
Collection ongoingY
PCompletion dateMay 2014
PAvailable supplements Gale will consider marketing supplements depending on demand
PMajor languagesEnglish, French, German initially; other languages to be added as the publishing program progresses
Back to Top

Technical Platform

PBrowser compatibilityIE v7.x, 8.x, 9.x; Firefox v5.0.x+ and v3.5x-3.6.x; Safari 4.1x+; Chrome v9.0+
PAuthentication optionsIndustry Standard – IP Authentication / Proxy / User / Pass
PArchiving solution – master filesCovered under Gale's contract with Portico
PArchiving solution – derivative filesCovered under Gale's contract with Portico; Portico platform
PAvailability in web discovery toolsN (Gale continues to investigate viability)
POpen URL targetOpen URL may be available for appropriate content types later in 2013. It is now a higher technical priority for Gale.
Federated searching, z39.50NA
PLocal host optionY
PUsage statisticsY
Back to Top

Interface Tools

PFull text displayedY (for printed documents only)
PPage imagesY
PColor imagesY (% not finalized yet)
PSearch full textY (for printed documents only)
PAdvanced searchY
PSearch within resultsY
PLimit results by dates and/or document typesY
PDisplay highlighted search termsY
PDisplay snippet -- search term in contextY
PRelevance sortingY
PSave searchesY
PDownload PDFY
PDownload HTMLY
PPrint pageY
PPrint full documentY
PExport citationsY
PAnnotation toolsY
PCross-product searchingY (within NCCO; in 2013 will build cross searching with other Gale collections such as ECCO)
PILLY (“Limited amount to reasonable, controlled degree”)
PRestrictions on useY
Back to Top

Provider

Publisher / DistributorGale Cengage Learning
Address27500 Drake Rd., Farmington Hills, MI 48331
Contacthttp://gale.cengage.co.uk/product-highlights/history/nineteenth-century-collections-online/ncco-archives.aspx ; http://gdc.gale.com/: ray.abruzzi@cengage.com
Related product(s) multiple
CRL Profile of Publisher
Back to Top

Terms

Options
Subscription optionN
Purchase optionY
PMultiple year payments optionY
PHosting chargesY
List of purchasers availablenot released yet
PSample license availableY
MARC records purchase feeY
PPrice tier basisFTE tiers
Back to Top

Use Cases

Use Cases

Mining content from NCCO

Gale has contracted with NINES (Ninetenth-century Scholarship Online) to provide metadata from NCCO (as well as ECCO) in support of the NINES effort to provide federated searching across a range of key scholarly databases. Because of the complications of clearing rights with various content providers, as of November, 2012, Gale is not currently able to offer extensive downloads of NCCO content for text mining.  They are examining the feasibility.

Follow CRL

Stay informed by subscribing
to our e-mail list and social
media outlets.