European Organization for Nuclear Research (CERN)



CERN, “where the web was born”, is the leading High-Energy Physics laboratory. Funded by 22 Member States with a budget of more than €800M/yr, it employs 2,200 permanent staff and hosts over 10,000 HEP scientists from 600 institutes in 113 countries.


CERN flagship project, the Large Hadron Collider (LHC) produced over 100 Petabytes of data in its first run, allowing the discovery of the Higgs Boson. The discovery was announced in July 2012, celebrated by the award of the Nobel Prize to Peter Higgs and Francois Englert in October 2013. Its ambitious research goals have put CERN at the forefront of e-Infrastructures throughout its history. The laboratory has prominently contributed to dozens of EC cofunded Grid projects and operates the largest multi-disciplinary Grid infrastructure in the world; it is also a founding member of ORCID and contributed to several FP7 and H2020 projects relevant to the topics of this project.


Open Science is enshrined in the words of the CERN Convention: “… the results of its experimental and theoretical work shall be published or otherwise made generally available”. This vision and IT innovation come together through the development of Invenio, an Open Source digital library platform. Invenio powers Zenodo, a multi-disciplinary data repository that acts as a cornerstone of Open Science for H2020, and CERN Open Data, which delivers large scale data releases from the LHC experiments to the world - the data repositories support integration of persistent identification of all content types. Its closed counterpart, CERN Analysis Preservation, uses Invenio to build a service that enables researchers to preserve and persistently link all the materials associated to a physics analysis. CERN Open Data and Analysis Preservation are two pillars of the CERN pilot applications for the FREYA project.


CERN and collaborating partners in Europe, the United States, China, and Japan use Invenio to power INSPIRE, a one-stop-shop digital library for HEP. Serving over 1 million records to 50,000 scientists worldwide, INSPIRE increasingly hosting additional content types such as data and software snippets, to facilitate their discoverability and reuse. It is a strategic goal of INSPIRE to encourage the attribution and citation of datasets, code, or any other Open Science material. Furthermore, this service is particularly interested in the advancement of the persistent identifiers for organizations.