Can I use Dataverse for HIPAA-protected data?

The Harvard instance is not for PII/PHI; however, the software can be self-hosted in a secure environment for such data.

Harvard Dataverse

Harvard Dataverse | Find AI List

Overview

Harvard Dataverse is a robust, open-source research data repository software designed to facilitate the sharing, preservation, and citation of scholarly data. Built on a Java-based architecture (utilizing Payara and PostgreSQL), it serves as a central node in the global Dataverse Project network. As of 2026, it remains the primary implementation of the FAIR (Findable, Accessible, Interoperable, and Reusable) data principles, providing researchers with automated DOI (Digital Object Identifier) minting via DataCite. Technically, it offers a modular schema for metadata, supporting domain-specific standards like DDI, Dublin Core, and Schema.org. The platform's API-first design enables deep integration with computational notebooks like Jupyter and RStudio, as well as institutional identity providers via Shibboleth and OAuth2. Its market position is solidified by its status as a non-profit, community-governed alternative to commercial repositories, offering unmatched granularity in metadata management and long-term digital preservation through its integration with Archivematica. It is optimized for both individual researchers needing to meet funder mandates and large-scale institutions requiring a scalable data infrastructure.

Common tasks

Research Data Archiving Persistent Identifier Generation Metadata Harvesting Restricted Data Access Control Data Versioning

FAQ

View all

Is there a cost to store data on Harvard Dataverse?

No, it is currently free for researchers globally up to 1TB per dataverse.

Can I keep my data private?

Yes, datasets can be kept in a 'Draft' state or 'Restricted' even after publication.

Does Dataverse support big data?

Yes, through integration with Globus, Dataverse handles large-scale file transfers and petabyte-scale storage.

Who owns the data I upload?

The researcher/depositor retains ownership; Dataverse acts as the host and archive.

FAQ+

Is there a cost to store data on Harvard Dataverse?

No, it is currently free for researchers globally up to 1TB per dataverse.

Can I keep my data private?

Yes, datasets can be kept in a 'Draft' state or 'Restricted' even after publication.

Does Dataverse support big data?

Yes, through integration with Globus, Dataverse handles large-scale file transfers and petabyte-scale storage.

Who owns the data I upload?

The researcher/depositor retains ownership; Dataverse acts as the host and archive.

View all

Compare with top alternatives

Full compare

Tool	Pricing	Rating	Visits
Harvard DataverseCurrent	Freemium	-	-
Collibra	Paid	★ 0.0	-
Cambridge Core	Paid	★ 0.0	-
DOAJ (Directory of Open Access Journals)	Freemium	★ 0.0	-

Harvard Dataverse

Current

Pricing: Freemium
Rating: -
Visits: -

Collibra

Pricing: Paid
Rating: ★ 0.0
Visits: -

Cambridge Core

Pricing: Paid
Rating: ★ 0.0
Visits: -

DOAJ (Directory of Open Access Journals)

Pricing: Freemium
Rating: ★ 0.0
Visits: -

Harvard Dataverse

Should you use Harvard Dataverse?

Overview

FAQ

Pricing

Pros & Cons

Compare with top alternatives

Reviews & Ratings