EFRA Facts: 10 Facts about Big Data You Don’t Know

In the digital age, Big Data reigns supreme, representing the massive volumes, diverse sources, and rapid velocities of information.  From driving business insights to powering artificial intelligence, these 10 facts highlight the multifaceted impact of Big Data on our rapidly evolving digital landscape.


Explosive Data Growth

The term “big data” was reportedly coined in the early 2000s and refers to the vast and complex datasets that are too large to be processed using traditional methods. Over 2.5 quintillion bytes of data are created every day, and this rate is accelerating.

The Three Vs

Big data is often characterized by the three Vs: Volume (large amount of data), Velocity (high speed at which data is generated and processed), and Variety (diversity of data types and sources).



Data Growth Rate

As of 2021, it’s estimated that over 90% of the world’s data was generated in just the last two years. This rapid data growth is driven by the digitalization of various aspects of our lives.

Real-Time Analytics

Big data analysis allows businesses to perform real-time analytics, enabling them to make informed decisions based on up-to-the-minute insights. For example, financial institutions use it to detect fraud in real-time, preventing monetary losses.



Data Size in Perspective

If all the data generated in a single day were burned onto DVDs, the stack would reach from Earth to Mars and back.

Machine Learning and AI


Machine learning and artificial intelligence play a crucial role in big data analysis, helping to uncover patterns, correlations, and insights that humans might miss.




Data Creation Every Minute

Every minute, around 4.5 million videos are uploaded to YouTube, 500 million tweets are sent on Twitter, and approximately 41.6 million messages are sent on WhatsApp.

Speed of Data Generation

The Large Hadron Collider (LHC) at CERN generates about 30 petabytes of data annually, highlighting the extreme data generation rates in advanced scientific experiments.


The Large Hadron Collider (LHC) at CERN. Image by Anna Pantelia/CERN


Data Lakes

A data lake is a repository for storing vast amounts of raw data in its native format, which can be processed and analyzed as needed. It’s an alternative to traditional structured databases.

Weather Data

Weather forecasting heavily relies on big data analysis. The National Oceanic and Atmospheric Administration (NOAA) collects over 20 terabytes of data every day for weather prediction.


Send us a message

Get our latest news

to our newsletter.