Dataverso: Open Source Research Data Repository Software

Dataverso Project: Research data repository SW

Dataverso Project: Research data repository SW

From time to time, we usually walk through the scientific field to learn about and disseminate developments, research and scientific initiatives related to or based on the use of free software and open source. For this reason, on other occasions, in a timely and appropriate manner, we have managed to make known not only about news related to the creation or use of free and open operating systems (distributions) based on GNU/Linux for the scientific field, but on projects like OpenDreamKit and Project Jupiter.

And this time, the turn is for a great and very valuable scientific initiative called, The "Project Dataverse". It is worth mentioning briefly before starting, for those who have never heard of it, that basically a sopen source research data repository software.

OpenDreamKit and Project Jupyter: 2 Open Source Scientific Projects

OpenDreamKit and Project Jupyter: 2 Open Source Scientific Projects

But, before starting this present post about this cool music player called "Project Dataverse" We recommend that you explore this other one later previous related post:

OpenDreamKit and Project Jupyter: 2 Open Source Scientific Projects
Related article:
OpenDreamKit and Project Jupyter: 2 Open Source Scientific Projects

Dataverso Project: Web application for research data

Dataverso Project: Web application for research data

What is the Dataverso Project?

According to his Official website, the "Project Dataverse" It is briefly described as follows:

The Dataverso Project is an open source web application for sharing, preserving, citing, exploring and analyzing research data. It makes data available to others and allows you to more easily replicate the work of others. Researchers, journals, data authors, publishers, data distributors, and affiliated institutions receive academic credit and web visibility.

so, doing similarities with the linux world would be something like a GitLab or GitHub, but for scientists.

Other important points about the operation of this open source scientific initiative to know are the following:

  1. A Dataverso repository (or Dataverse in English) is created or formed by installing the project software. And this created Dataverso can then host various virtual files called Dataverso collections.
  2. In turn, each Dataverso collection contains data sets, and each data set contains descriptive metadata and data files (including documentation and code that accompanies the data). In addition, recursively, this organization method allows Dataverso collections to also contain other Dataverso collections.
  3. The main objective of the Dataverso Project is to automate much of the manual work that a professional archivist has to do. In addition, to provide services and distribute credit to the creator of the research data generated. Thus correcting the bad effects of the previous practice or methodology created, which consisted of, the researchers decided between receiving credit for their data, controlling the distribution themselves, but without long-term retention guarantees, or having long-term retention guarantees, sending them to a professional archive but without receiving much credit.

Features

As can be seen in your GitHub website, the Dataverso Project software is active and in full development. And among his many characteristics It is worth noting the following:

  1. Includes support for FAIR data principles (Findable, Accessible, Interoperable, Reusable): i.e. Data Locatable, Accessible, Interoperable and Reusable.
  2. Has an API for interoperability and custom integrations: Which facilitates the use of Search API, Data Warehouse API (SWORD), Data Access API, Metrics API, Migration API, among others.
  3. Integrates support for API Client Libraries: Allowing you to Interact with the Dataverso APIs of Python, R, Javascript, Java, and Ruby.

Finally, currently this software has as latest stable version to number 5.13, dated February 14, 2023, which includes new features, improvements, and bug fixes, thanks to its active community that often contributes code, suggestions, bug reports, and other types of help to said project. And if you want to know more about it, you can explore its online demo to meet him

The Dataverse Project is being developed at Harvard's Institute for Quantitative Social Sciences (IQSS), together with many collaborators and contributors around the world. The Dataverso Project built on our experience with our previous Virtual Data Center (VDC) project, which ran from 1997 to 2006 as a collaboration between the Harvard-MIT Data Center (now part of IQSS) and the Library of the University of harvard. History (origins) of the Dataverso Project

Related article:
Open Science Project arrives, a free scientific program to expand knowledge

Roundup: Banner post 2021

Summary

In summary, the "Project Dataverse" is, without a doubt, one of many scientific initiatives developed with free software and open source that deserve to be known, disseminated and supported, for the benefit of human development. Therefore, we invite you to do so, and if necessary, tell us via comments what you think of this scientific initiative based on open source. Or if you know of another important one to review, it will also be a pleasure to meet her in order to give her space in a publication in the near future.

And if you liked this post, do not stop sharing it with others on your favorite websites, channels, groups or communities of social networks or messaging systems. Lastly, remember visit our home page en «FromLinux» to explore more news. And also, join our official channel of Telegram from DesdeLinux, West group for more information on today's topic.