CERN
Long before the term ‘big data’ entered public domain, the European Organization for Nuclear Research (CERN), founded in 1954, was standing up to the challenge of processing unthinkable amounts of information.

Yandex started to collaborate with the LHCb experiment at CERN in 2011 on a number of projects in various areas of research, and has been part of CERN openlab as an Associate member in 2013 and 2014. A single experiment can generate tens of petabytes of information per year. Yandex has developed an efficient mechanism for navigating datasets of such a scale. Another Yandex-CERN project is focusing on optimization of highly distributed data storage based on dataset popularity patterns. Distributed execution of user tasks for simulation or data processing in GRID resources, clouds and individual clients is another area of our interest.

We are sharing our collaborative practices and expertise in improving efficiency of computational experiments with CERN. Implementing our technologies in high-energy physics gives us invaluable feedback, which we use to make these technologies more flexible and powerful. Reproducible Experiment Platform, a collaborative software infrastructure, is our most important means of communication. This platform is a bridge that brings together data scientists and high-energy physicists by giving them an opportunity to meet, speak a common language and work together to push the frontiers of fundamental science to new, bright and fascinating discoveries of the Universe.

See also

pdf

Large Hadron Collider Reaches New Level of Cost-Effectiveness With YDF’s Big Data Storage Solution

Read the full case study
pdf

YDF’s Solution to the Large Hadron Collider’s Hunt for Critical Data

Read the full case study
pdf

Large Hadron Collider Research Speeds Up With YDF’s Event Analysis Solution

Read the full case study
pdf

YDF Cuts Large Hadron Collider Costs by Detecting Anomalies in Data

Read the full case study