Browse Definitions :
Definition

3Vs (volume, variety and velocity)

Contributor(s): Ivy Wigmore
This definition is part of our Essential Guide: From data gathering to competitive strategy: The evolution of big data

3Vs (volume, variety and velocity) are three defining properties or dimensions of big data. Volume refers to the amount of data, variety refers to the number of types of data and velocity refers to the speed of data processing. According to the 3Vs model, the challenges of big data management result from the expansion of all three properties, rather than just the volume alone -- the sheer amount of data to be managed.

Gartner analyst Doug Laney introduced the 3Vs concept in a 2001 MetaGroup research publication, 3D data management: Controlling data volume, variety and velocity. More recently, additional Vs have been proposed for addition to the model, including variability -- the increase in the range of values typical of a large data set -- and value, which addresses the need for valuation of enterprise data.

The infographic below (reproduced with permission from Diya Soubra's post, The 3Vs that define Big Data, on Data Science Central) illustrates the increasing expansion of the 3Vs. 

The 3Vs of big data

 

This was last updated in February 2013

Continue Reading About 3Vs (volume, variety and velocity)

Join the conversation

4 comments

Send me notifications when other members comment.

Please create a username to comment.

This is ridiculous. Coming up with abstract buzzwords to sell tools to distill garbage data into garbage results to CIOs reading about it in SkyMall.
Cancel
Great to see the industry finally adopting the "3Vs" of Big Data that Gartner first introduced over 12 years ago! Here's a link to the original piece I wrote on "The Three Dimensional Data Challenge" back in 2001 positing them: http://goo.gl/wH3qG. Interesting also to see others lop on additional "V"s that while interesting are decidedly no definitional characteristics of Big Data. --Doug Laney, VP Research, Gartner, @doug_laney
Cancel
rrrrrrrrrrrrrrrrrr
Cancel
I agree, as far as definitions go this is pretty useless. If one has never heard of the three V's before it is good to understand. What we really need is some kind of empirical definition that transcends time, sort of like Moore's Law. 

Here's my suggestion: "Data is Big Data when it is too big to work on any one commonly available computer, but rather requires a cluster of computers". "Commonly available" would then have to be defined somehow, for example "computers available in the majority of large and medium-sized businesses" so that mainframes would be eliminated.

The reason why a "cluster of computers" is important is because this requires a fundamental change in the underlying architecture of how mathematical functions are designed in order to perform acceptably when network communication is part of the system.

The amount of data that one computer can process has certainly changed over the years and will continue to do so. Therefore this kind of definition should be useful moving forward.
Cancel

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

  • Whistleblower Protection Act

    The Whistleblower Protection Act of 1989 is a law that protects federal government employees in the United States from ...

  • smart contract

    A smart contract, also known as a cryptocontract, is a computer program that directly controls the transfer of digital currencies...

  • risk map (risk heat map)

    A risk map, also known as a risk heat map, is a data visualization tool for communicating specific risks an organization faces. A...

SearchSecurity

  • challenge-response authentication

    In information security, challenge-response authentication is a type of authentication protocol where one entity presents a ...

  • Secure Shell (SSH)

    SSH, also known as Secure Shell or Secure Socket Shell, is a network protocol that gives users, particularly system ...

  • honeypot (computing)

    A honeypot is a network-attached system set up as a decoy to lure cyberattackers and to detect, deflect or study hacking attempts...

SearchHealthIT

SearchDisasterRecovery

  • virtual disaster recovery

    Virtual disaster recovery is a type of DR that typically involves replication and allows a user to fail over to virtualized ...

  • tabletop exercise (TTX)

    A tabletop exercise (TTX) is a disaster preparedness activity that takes participants through the process of dealing with a ...

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a data center.

SearchStorage

  • exbibyte (EiB)

    An exbibyte (EiB) is a unit used to measure data capacity.

  • zebibyte (ZiB)

    A zebibyte (ZiB) is a unit used to measure computing and storage capacity.

  • tiered storage

    Tiered storage is a way to assign different categories of data to various types of storage media with the objective of reducing ...

Close